MATIH Platform is in active MVP development. Documentation reflects current implementation status.
19. Observability & Operations
Monitoring Architecture

Monitoring Architecture

MATIH uses a Prometheus-based monitoring stack with Grafana for visualization, ServiceMonitor CRDs for automatic target discovery, and custom recording rules for SLO tracking. The monitoring infrastructure lives in the matih-monitoring namespace and collects metrics from all control plane and data plane services.


Components

ComponentPortDescription
Prometheus9090Metrics collection and storage
Grafana3000Dashboard visualization
Alertmanager9093Alert routing and notification
Node Exporter9100Host-level metrics
kube-state-metrics8080Kubernetes object metrics

Subsections

PageDescription
Prometheus SetupInstallation, configuration, and service discovery
Prometheus RulesRecording rules and alerting rules
Grafana DashboardsPre-built dashboards for platform monitoring
Grafana Data SourcesConfiguring Prometheus, Loki, and Tempo data sources
Custom MetricsApplication-level custom metrics instrumentation
SLO MonitoringService Level Objective tracking and error budgets

Deployment

The monitoring stack is deployed using the kube-prometheus-stack Helm chart:

helm repo add prometheus-community https://prometheus-community.github.io/helm-charts
helm install monitoring prometheus-community/kube-prometheus-stack \
  -f prometheus/values.yaml \
  --namespace matih-monitoring \
  --create-namespace

Metric Collection Flow

Services (metrics endpoint) --> ServiceMonitor CRD --> Prometheus --> Grafana
                                                           |
                                                     Alertmanager --> PagerDuty / Slack