MATIH Platform is in active MVP development. Documentation reflects current implementation status.
21. Industry Examples & Walkthroughs
Retail & E-Commerce
Industry Overview

Retail & E-Commerce

End-to-end walkthroughs showing how a mid-size e-commerce company uses the MATIH Platform to unify customer data, predict demand, optimize revenue, and make AI-driven strategic decisions.


Industry Context

Retail and e-commerce businesses generate some of the most diverse data in any industry. A single customer interaction can touch transactional databases, clickstream analytics, payment processors, marketing platforms, inventory systems, and shipping providers. The challenge is not data scarcity -- it is data fragmentation.

Most retail teams operate with siloed tools: one dashboard for revenue, another for marketing attribution, a third for inventory. Data scientists copy CSVs between systems. Executives wait days for ad-hoc analyses. The MATIH Platform consolidates these workflows into a single governed environment where every persona -- from data scientist to CEO -- works from the same trusted data.


Company Profile: NovaMart

All walkthroughs in this section follow employees at NovaMart, a fictional mid-size e-commerce company with the following profile:

AttributeValue
Annual Revenue$180M
Active Customers2.1M
Product SKUs45,000
Order Volume~8,000 orders/day
ChannelsWeb, mobile app, 12 retail stores
Data Team4 data scientists, 2 ML engineers, 3 BI analysts, 1 VP Strategy

Sample Datasets

These are the core datasets used across all four walkthroughs. In a production deployment, these tables live in their respective source systems and are ingested into the platform via Airbyte connectors or file imports.

DatasetSourceRowsDescription
ordersPostgreSQL12.4MOrder headers -- order_id, customer_id, order_date, total_amount, status, channel
order_itemsPostgreSQL38.7MLine items -- order_id, product_id, quantity, unit_price, discount
customersPostgreSQL2.1MCustomer profiles -- customer_id, email, signup_date, segment, lifetime_value
productsPostgreSQL45KProduct catalog -- product_id, name, category, subcategory, cost, current_price
inventoryPostgreSQL45KCurrent stock levels -- product_id, warehouse_id, quantity_on_hand, reorder_point
returnsPostgreSQL1.8MReturn records -- return_id, order_id, reason_code, refund_amount, return_date
clickstreamSnowflake340MWeb/app events -- session_id, customer_id, event_type, page_url, timestamp
marketing_campaignsGoogle Ads API2.3KCampaign performance -- campaign_id, spend, impressions, clicks, conversions
supplier_shipmentsCSV Import156KInbound shipments -- shipment_id, supplier_id, product_id, quantity, eta, actual_arrival
customer_surveysCSV Import48KNPS and satisfaction scores -- customer_id, survey_date, nps_score, comments

Data Sources

NovaMart's data lives in five systems. The platform connects to all of them through the Ingestion Service (Airbyte connectors) and the Query Engine (SQL federation).

SourceTypeConnectorSync ModeFrequency
NovaMart PostgreSQLTransactional DBAirbyte PostgreSQL CDCIncremental (WAL)Every 15 min
Snowflake DWHAnalytics WarehouseAirbyte SnowflakeIncremental (timestamp)Hourly
ShopifyE-Commerce APIAirbyte ShopifyIncremental (API cursor)Hourly
Google Analytics / AdsMarketingAirbyte Google AdsFull refreshDaily
CSV FilesManual exportsFile Import (Data Workbench)One-time / on-demandAs needed

Data Flow Architecture

                           NovaMart Data Flow
  ┌──────────────────────────────────────────────────────────────────┐
  │                        DATA SOURCES                             │
  │  ┌───────────┐ ┌───────────┐ ┌───────────┐ ┌────────┐ ┌─────┐  │
  │  │PostgreSQL │ │ Snowflake │ │  Shopify  │ │ Google │ │ CSV │  │
  │  │  (OLTP)   │ │  (DWH)    │ │  (API)    │ │  Ads   │ │     │  │
  │  └─────┬─────┘ └─────┬─────┘ └─────┬─────┘ └───┬────┘ └──┬──┘  │
  └────────┼──────────────┼─────────────┼───────────┼─────────┼─────┘
           │              │             │           │         │
           ▼              ▼             ▼           ▼         ▼
  ┌──────────────────────────────────────────────────────────────────┐
  │                   INGESTION SERVICE (Airbyte)                   │
  │         600+ connectors  |  CDC  |  Schema mapping              │
  └──────────────────────────────┬──────────────────────────────────┘


  ┌──────────────────────────────────────────────────────────────────┐
  │                     PLATFORM DATA LAYER                         │
  │  ┌──────────┐  ┌──────────┐  ┌──────────┐  ┌────────────────┐  │
  │  │ Catalog  │  │  Query   │  │ Quality  │  │  Governance    │  │
  │  │ Service  │  │  Engine  │  │ Service  │  │  Service       │  │
  │  │          │  │ (Trino)  │  │ (GX)     │  │ (masking,ACL)  │  │
  │  └──────────┘  └──────────┘  └──────────┘  └────────────────┘  │
  └──────────────────────────────┬──────────────────────────────────┘

           ┌─────────────────────┼─────────────────────┐
           ▼                     ▼                     ▼
  ┌──────────────┐    ┌──────────────┐     ┌──────────────────┐
  │ ML Workbench │    │ BI Workbench │     │ Agentic Workbench│
  │              │    │              │     │                  │
  │ Models,      │    │ Dashboards,  │     │ NL queries,      │
  │ Experiments, │    │ Semantic     │     │ Multi-agent,     │
  │ Feature      │    │ Layer,       │     │ Workflow         │
  │ Store        │    │ Reports      │     │ Generation       │
  └──────────────┘    └──────────────┘     └──────────────────┘

Business KPIs

NovaMart tracks these key performance indicators across all workbenches and dashboards. Each walkthrough shows how the platform computes, monitors, and acts on these metrics.

KPIDefinitionCurrent ValueTarget
Gross Merchandise Value (GMV)Total sales before returns and discounts$15.2M/month$18M/month
Average Order Value (AOV)Revenue / number of orders$67.40$75.00
Customer Churn Rate% of customers with no purchase in 90 days18.3%< 15%
Customer Lifetime Value (CLTV)Predicted total revenue per customer over 3 years$412$500
Inventory TurnoverCOGS / average inventory value8.2x/year10x/year
Conversion RateOrders / unique sessions3.1%4.0%
Return RateReturns / orders14.6%< 12%
Customer Acquisition Cost (CAC)Marketing spend / new customers acquired$34.20< $30
Return on Ad Spend (ROAS)Revenue from ads / ad spend4.8x6.0x
Net Promoter Score (NPS)Customer satisfaction metric (-100 to 100)4250+

Persona Walkthroughs

Each walkthrough follows one persona through all eight lifecycle stages, using real NovaMart data and scenarios. Start with the role closest to yours, or read all four to see how the platform enables cross-functional collaboration.

WalkthroughPersonaScenarioPrimary Workbenches
Data Scientist JourneyPriya, Senior Data ScientistPredicting customer churn to reduce the 18.3% churn rateML Workbench, Data Workbench
ML Engineer JourneyMarcus, ML EngineerBuilding a production demand forecasting system for 45K SKUsML Workbench, Pipeline Service
BI Lead JourneySofia, BI LeadCreating a real-time revenue command center for the executive teamBI Workbench, Semantic Layer
Executive Leadership JourneyDavid, VP of StrategyUsing AI-assisted analysis for strategic planning and board reportingAgentic Workbench, BI Dashboards

How the Walkthroughs Connect

These four personas work on the same data at NovaMart. Their work products feed into each other:

  Priya (Data Scientist)              Marcus (ML Engineer)
  ┌──────────────────────┐            ┌──────────────────────┐
  │ Churn prediction     │            │ Demand forecasting   │
  │ model (AUC 0.87)     │───────────▶│ pipeline (daily)     │
  │                      │  model     │                      │
  │ Feature engineering  │  registry  │ Ray Serve deployment │
  └──────────┬───────────┘            └──────────┬───────────┘
             │ churn scores                      │ forecasts
             ▼                                   ▼
  ┌──────────────────────┐            ┌──────────────────────┐
  │ Sofia (BI Lead)      │            │ David (VP Strategy)  │
  │                      │◀───────────│                      │
  │ Revenue Command      │ dashboard  │ AI-driven strategic  │
  │ Center dashboards    │ access     │ scenario analysis    │
  │                      │            │                      │
  └──────────────────────┘            └──────────────────────┘

Priya's churn model scores feed into Sofia's customer health dashboards. Marcus's demand forecasts inform inventory decisions that David reviews in strategic planning. The semantic layer ensures all four personas use the same metric definitions.


Prerequisites

Before following these walkthroughs, ensure you have:

  1. A running MATIH Platform instance (see Installation)
  2. The NovaMart sample dataset loaded (available in the platform's sample data catalog)
  3. Completed the Quickstart Tutorials for the workbenches you plan to use

Related Chapters