Platform Status
The platform-status.sh script provides a comprehensive health diagnostic for the MATIH Kubernetes cluster. It reports pod status summaries, identifies failing pods, and provides per-pod diagnostics with error logs.
Source: scripts/tools/platform-status.sh
Usage
./scripts/tools/platform-status.shDiagnostic Functions
| Function | Description |
|---|---|
get_pod_summary | Count pods by status in a namespace |
get_failing_pods | List pods in error states across namespaces |
get_pod_error_logs | Retrieve last N log lines for a pod |
diagnose_pod | Full diagnostic for a single failing pod |
Failure Detection
The script detects the following pod states as failures:
| State | Description |
|---|---|
CrashLoopBackOff | Container repeatedly crashing |
Error | Container exited with error |
ImagePullBackOff | Cannot pull container image |
CreateContainerConfigError | Configuration error (missing secret, security context) |
Init: | Init container failure |
Output Format
Namespace Summary
matih-control-plane:
8 Running
1 CrashLoopBackOff
matih-data-plane:
15 Running
2 ErrorPod Diagnostics
For each failing pod, the script outputs:
Pod: ai-service-abc123 (matih-data-plane)
Status: CrashLoopBackOff
Restarts: 5
Last Error:
ModuleNotFoundError: No module named 'missing_package'Checked Namespaces
| Namespace | Components |
|---|---|
matih-control-plane | CP services, API gateway |
matih-data-plane | DP services, compute engines |
matih-monitoring-control-plane | CP monitoring |
matih-monitoring-data-plane | DP monitoring |
matih-ingress | Ingress controller |
Libraries Used
| Library | Purpose |
|---|---|
core/logging.sh | Color-coded output |
Related Pages
- Stage 18: Validation -- Post-deployment validation
- Service Build and Deploy -- Individual service deployment
- Health Check -- CD pipeline health checks