// Project 01·2022 — 2024
Enterprise Cloud Modernization Program
// Headline number
~1 TB
processed / day
// Architecture
live · 9 nodes · 9 edges
Company-wide migration from Alteryx / MySQL to a cloud-native GCP stack. Started as a POC, became the production architecture of a $100M+ org.
// The problem
Alteryx / MySQL couldn't keep up. ETL ran for hours. No visibility. Engineering bottleneck for every new pipeline.
// My approach
Metadata-driven GCP architecture: BigQuery + Cloud Composer + Dataproc + PySpark, all IaC'd with Terraform. Analysts add pipelines via config — no engineering tickets. Incremental migration, production never stopped.
// Stack
// Outcome
- Runtime cut from hours to minutes
- ~1 TB processed daily
- Right-sized Dataproc clusters reduced cost
- Self-service onboarding for non-engineering teams