MATIH Platform is in active MVP development. Documentation reflects current implementation status.
17. Kubernetes & Helm
MinIO

MinIO

MinIO provides S3-compatible object storage for development and staging environments. In production, it is replaced by Azure Blob Storage, AWS S3, or Google Cloud Storage.


Configuration

# From matih-data-plane/values.yaml
global:
  storage:
    type: "minio"
    s3:
      endpoint: "http://minio.matih-data-plane.svc.cluster.local:9000"
      region: "us-east-1"

Storage Buckets

BucketPurposeConsumers
curated-dataIceberg tables, curated datasetsTrino, Spark, Pipeline Service
raw-dataRaw ingested dataPipeline Service
ml-artifactsMLflow model artifactsML Service, MLflow
spark-historySpark event logsSpark History Server
airflow-logsAirflow task logsAirflow

Secret Management

Even for dev MinIO, credentials use secretKeyRef:

# Correct: Reference from secret
artifactStore:
  s3:
    existingSecret: mlflow-s3-credentials
    accessKeyIdKey: aws-access-key-id
    secretAccessKeyKey: aws-secret-access-key

Dev vs Production

AspectDev (MinIO)Production
ProviderMinIO StatefulSetAzure Blob / AWS S3 / GCS
Endpointminio.matih-data-plane.svc:9000Cloud-native endpoint
AuthenticationK8s SecretWorkload Identity / IRSA
ReplicationNoneCross-region replication