MATIH Platform is in active MVP development. Documentation reflects current implementation status.
17. Kubernetes & Helm
Data Plane Charts
Pipeline Service

Pipeline Service Chart

The Pipeline Service orchestrates ETL pipelines with Airflow integration, dbt transformation support, and Spark job management.


Chart Configuration

pipeline-service:
  enabled: true
  replicaCount: 2
 
  resources:
    requests:
      cpu: "250m"
      memory: "512Mi"
    limits:
      cpu: "1"
      memory: "2Gi"
 
  config:
    airflow:
      baseUrl: "http://airflow-api-server:8080"
    dbt:
      enabled: true
      version: "1.7"
    spark:
      enabled: true
      masterUrl: "spark://spark-master:7077"

Pipeline Backends

BackendPurposeConnection
AirflowDAG orchestration, schedulingREST API on port 8080
dbtSQL transformationsCLI / API
SparkDistributed data processingSpark Master on port 7077