MATIH Platform is in active MVP development. Documentation reflects current implementation status.
17. Kubernetes & Helm
ML Infrastructure
JupyterHub

JupyterHub

JupyterHub provides multi-user notebook environments for data scientists and ML engineers, with configurable spawners for resource allocation.


Architecture

+------------------+     +------------------+
| JupyterHub Proxy |     | JupyterHub       |
| (Port 80/443)    |---->| (Spawner)        |
+------------------+     +------------------+
                                 |
                          +------+------+
                          | User Pods   |
                          | (Notebooks) |
                          +------------+

Spawner Configuration

ProfileCPUMemoryGPUUse Case
Small24Gi0Data exploration
Medium416Gi0Feature engineering
Large832Gi0Model training (CPU)
GPU416Gi1Deep learning

Authentication

JupyterHub authenticates users through the MATIH IAM service using OAuth2/OIDC:

auth:
  type: oauth2
  oauth2:
    authorizeUrl: "https://api.matih.ai/api/v1/auth/oauth2/authorize"
    tokenUrl: "https://api.matih.ai/api/v1/auth/oauth2/token"
    userInfoUrl: "https://api.matih.ai/api/v1/auth/oauth2/userinfo"

Pre-installed Packages

User notebook environments include pre-configured connections to all MATIH data infrastructure: Trino, Spark Connect, MLflow, Feast, and S3/MinIO.