MATIH Platform is in active MVP development. Documentation reflects current implementation status.
9. Query Engine & SQL
Overview

Trino Integration Overview

Trino serves as the primary query execution engine for the MATIH platform. The Query Engine integrates with Trino for complex analytics, federated queries across multiple data sources, and high-performance SQL processing. This section covers configuration, catalog management, data federation, and connector setup.


Architecture

The Query Engine communicates with Trino via its REST API and JDBC interface. Each tenant's queries are routed to Trino with tenant-specific session properties and catalog access controls.

Query Engine              Trino Coordinator            Trino Workers
     |                          |                          |
     |--- Submit query -------->|                          |
     |    (with session props)  |--- Parse & plan -------->|
     |                          |--- Distribute tasks ---->|
     |                          |                          |--- Execute
     |                          |                          |--- Return results
     |<-- Query results --------|<-- Aggregate results ----|

Key Features

FeatureDescription
Federated queriesQuery across Delta Lake, Iceberg, PostgreSQL, and more
Session propertiesPer-tenant resource limits and configuration
Catalog isolationTenants access only their authorized catalogs
Query queueingResource groups manage concurrency per tenant
Cost-based optimizationAutomatic query plan optimization

Trino Deployment

Trino is deployed as part of the data infrastructure via Helm:

ComponentReplicasDescription
Coordinator1Query parsing, planning, and coordination
Workers2+ (auto-scaled)Distributed query execution

Section Pages

PageDescription
ConfigurationTrino connection settings and resource groups
CatalogsCatalog configuration for data sources
Data FederationCross-source query federation
ConnectorsAvailable Trino connectors and setup