MATIH Platform is in active MVP development. Documentation reflects current implementation status.
17. Kubernetes & Helm
Flink

Flink

Apache Flink provides real-time stream processing for event aggregation, CDC processing, and streaming analytics.


Architecture

+----------------+     +------------------+
| Job Manager    |     | Task Managers    |
| (1 replica)    |---->| (2-8 replicas)   |
+----------------+     +------------------+
       |                       |
       v                       v
+----------------+     +------------------+
| Kafka          |     | State Backend    |
| (Source/Sink)  |     | (RocksDB + S3)   |
+----------------+     +------------------+

Flink SQL Jobs

MATIH defines streaming SQL jobs deployed via K8s manifests:

JobSourcePurpose
agent-performance-aggmatih.ai.agent-tracesAgent performance metrics aggregation
llm-operations-aggmatih.ai.llm-opsLLM cost and latency aggregation
session-analyticsmatih.ai.state-changesSession analytics and funnel tracking
state-transition-cdcmatih.ai.state-changesCDC for state transition history

Checkpointing

checkpointing:
  enabled: true
  interval: "60s"
  minPause: "30s"
  timeout: "600s"
  backend: "rocksdb"
  s3:
    endpoint: "http://minio.matih-data-plane.svc.cluster.local:9000"
    bucket: "flink-checkpoints"