what it is
The Foundation Sprint is a short implementation phase that fixes the highest-impact blockers identified in the scan—so reporting, insights, and AI work can move forward without constant friction. The goal is to deliver a “gold dataset” (a reliable, well-defined dataset) that teams can use for dashboards, analysis, and future modeling.
what is a “gold dataset”?
A gold dataset is a curated, trusted dataset built from the most important sources. It has:
It’s the layer that prevents “multiple versions of the truth.”
How it works (3 steps)
Select the target outcome
Agree on the primary use.
Build the minimal reliable pipeline
Extract → clean → standardize → join → validate.
Focus on a small number of key tables/fields that unlock value quickly.
Deliver + handover
Document definitions, checks, and how to maintain it going forward.
What gets delivered
Gold dataset (v1)
Cleaned and joined dataset ready for reporting and analysis.
Metric definitions (“single source of truth”)
KPI dictionary + logic (e.g., “active learner”, “completion”, “customer”).
Join map + data model (practical)
How the main entities connect end-to-end.