MATIH Platform is in active MVP development. Documentation reflects current implementation status.
17. Kubernetes & Helm
Monitoring Stack
Grafana

Grafana

Grafana provides visualization dashboards for all MATIH metrics, logs, and traces with provisioned data sources and pre-built dashboards.


Data Sources

SourceTypePurpose
Prometheus (CP)PrometheusControl plane metrics
Prometheus (DP)PrometheusData plane metrics
LokiLokiLog querying
TempoTempoTrace exploration
ClickHouseClickHouseBusiness analytics

Pre-Built Dashboards

DashboardMetrics SourceKey Panels
Platform OverviewPrometheusService health, request rate, error rate
AI ServicePrometheusInference latency, LLM costs, token usage
TrinoPrometheusQuery throughput, memory usage, queue depth
KafkaPrometheusBroker metrics, consumer lag, topic throughput
PostgreSQLPrometheusConnection pool, query duration, replication lag
Node ResourcesPrometheusCPU, memory, disk per node pool
Cost AttributionClickHousePer-tenant cost breakdown

Authentication

Grafana authenticates via the MATIH IAM service using OAuth2/OIDC. Admin credentials are stored in a Kubernetes secret:

# Password from External Secrets Operator
grafanaAdminPassword: "grafana-admin-password"  # Key Vault reference