Course overview

How to Design Data Analytics & Decision Systems

53 modules
214 lessons
—
Part 1

Appendices

  1. Appendix A - Diagram Templates by StepSign in

  2. Appendix B - Mapping Concepts to Real-World Data StacksSign in

  3. Appendix C - Readiness Checklists (Step N -> Step N+1)Sign in

  4. Appendix D - Glossary (Canonical Definitions)Sign in

Part 2

Course Setup and the Incremental Ladder

  1. Course Setup and the Incremental LadderSign in

  2. Why "Facts to Forecasts"Sign in

  3. How to Use This CourseSign in

  4. The Incremental Ladder (Step 0 -> Step 7)Sign in

  5. The Course LensesSign in

  6. Diagram Legend and Notation TypesSign in

Part 3

What Is a Data Analytics & Decision System?

  1. What Is a Data Analytics & Decision System?Sign in

  2. Reports vs Analytics vs Decision SystemsSign in

  3. Descriptive to Prescriptive ThinkingSign in

  4. The Decision LoopSign in

Part 4

Data as a Model of the World

  1. Data as a Model of the WorldSign in

  2. Entities, Events, RelationshipsSign in

  3. Operational vs Analytical ViewsSign in

  4. Domain AlignmentSign in

Part 5

Facts, Metrics, and Dimensions

  1. Facts, Metrics, and DimensionsSign in

  2. Facts and DimensionsSign in

  3. Events as the Building BlockSign in

  4. From Facts to MetricsSign in

Part 6

People and Roles in Data Systems

  1. People and Roles in Data SystemsSign in

  2. Producer RolesSign in

  3. Modeling and Analysis RolesSign in

  4. Decision-Makers and ResponsibilitiesSign in

Part 7

Diagramming Data Analytics Systems

  1. Diagramming Data Analytics SystemsSign in

  2. End-to-End Flow DiagramsSign in

  3. Lineage and Dependency GraphsSign in

  4. Decision Flow MapsSign in

Part 8

Step 0 Data Modeling for Analytics

  1. Step 0 Data Modeling for AnalyticsSign in

  2. Entities vs EventsSign in

  3. Identifiers, Keys, RelationshipsSign in

  4. Slowly Changing DimensionsSign in

Part 9

Step 0 Descriptive Statistics

  1. Step 0 Descriptive StatisticsSign in

  2. Core AggregationsSign in

  3. Distributions and PercentilesSign in

  4. Variability and TrendsSign in

Part 10

Step 0 Analytical Queries

  1. Step 0 Analytical QueriesSign in

  2. Group-By ThinkingSign in

  3. Joins for MeaningSign in

  4. Rolling Windows and ComparisonsSign in

Part 11

Step 0 Common Analytical Patterns

  1. Step 0 Common Analytical PatternsSign in

  2. Cohorts and FunnelsSign in

  3. Retention and ChurnSign in

  4. Conversion and AttributionSign in

Part 12

Step 0 First Analytics Environment

  1. Step 0 First Analytics EnvironmentSign in

  2. A Minimal Analytical DatabaseSign in

  3. Core Tables and ViewsSign in

  4. From Manual Queries to First DashboardsSign in

Part 13

ETL vs ELT (Conceptual)

  1. ETL vs ELT (Conceptual)Sign in

  2. ETL and ELT PrimitivesSign in

  3. Modern ELT ContextSign in

  4. Trade-OffsSign in

Part 14

Sources of Data

  1. Sources of DataSign in

  2. Operational Databases and APIsSign in

  3. Event Streams and LogsSign in

  4. External and Third-Party DataSign in

Part 15

Batch vs Streaming Pipelines (Conceptual)

  1. Batch vs Streaming Pipelines (Conceptual)Sign in

  2. Batch IngestionSign in

  3. Streaming and Near-Real-TimeSign in

  4. Selecting the ModeSign in

Part 16

Pipeline Orchestration

  1. Pipeline OrchestrationSign in

  2. DAGs of Tasks and JobsSign in

  3. Scheduling, Dependencies, RetriesSign in

  4. Failures and IdempotencySign in

Part 17

Data Cleaning and Transformation

  1. Data Cleaning and TransformationSign in

  2. Type Normalization and UnitsSign in

  3. Deduplication and ReconciliationSign in

  4. Missing and Noisy DataSign in

Part 18

Data Quality and Validation

  1. Data Quality and ValidationSign in

  2. Expectations and RulesSign in

  3. Checks at Ingestion vs in the WarehouseSign in

  4. Alerting and Health ReportingSign in

Part 19

Analytical Storage Architectures

  1. Analytical Storage ArchitecturesSign in

  2. WarehousesSign in

  3. Lakes and Lakehouse ConceptsSign in

  4. Selecting an Analytical Storage ArchitectureSign in

Part 20

Schemas for Analytics

  1. Schemas for AnalyticsSign in

  2. Dimensional ModelsSign in

  3. Fact and Dimension Tables in PracticeSign in

  4. Normalized vs Denormalized Trade-OffsSign in

Part 21

Tables, Views, and Semantic Layers

  1. Tables, Views, and Semantic LayersSign in

  2. Raw vs Curated LayersSign in

  3. Views and Semantic LayersSign in

  4. Logical vs Physical ModelsSign in

Part 22

Partitioning, Clustering, and Performance

  1. Partitioning, Clustering, and PerformanceSign in

  2. Partitioning StrategiesSign in

  3. Indexing and Clustering ConceptsSign in

  4. Cost and PerformanceSign in

Part 23

Governance, Access, and Security

  1. Governance, Access, and SecuritySign in

  2. Access Control BoundariesSign in

  3. Masking and AnonymizationSign in

  4. Privacy and Compliance PostureSign in

Part 24

Multi-Environment and Multi-Region Warehousing

  1. Multi-Environment and Multi-Region WarehousingSign in

  2. Dev, Stage, and Prod for DataSign in

  3. Residency and Multi-Region ConstraintsSign in

  4. Replication and DR ConceptsSign in

Part 25

BI Tools and Visualization

  1. BI Tools and VisualizationSign in

  2. What BI Tools ProvideSign in

  3. Choosing Chart TypesSign in

  4. Visual Storytelling PatternsSign in

Part 26

Dashboards, Reports, and Ad Hoc Analysis

  1. Dashboards, Reports, and Ad Hoc AnalysisSign in

  2. Dashboard TypesSign in

  3. Static Reports vs Interactive ExplorationSign in

  4. Drill-Down and Cross-FilteringSign in

Part 27

Semantic Metrics Layer

  1. Semantic Metrics LayerSign in

  2. Define Metrics OnceSign in

  3. Preventing Metric FragmentationSign in

  4. Serving Metrics to Tools and SystemsSign in

Part 28

Designing Dashboards for Decisions

  1. Designing Dashboards for DecisionsSign in

  2. Leading vs Lagging IndicatorsSign in

  3. Guardrails and Trade-OffsSign in

  4. Owners and DecisionsSign in

Part 29

Self-Service Data Culture

  1. Self-Service Data CultureSign in

  2. Analyst vs Self-Serve FlowsSign in

  3. Templates, Curated Datasets, TrainingSign in

  4. Empowerment with GovernanceSign in

Part 30

BI Platform Operations

  1. BI Platform OperationsSign in

  2. Content Sprawl and StalenessSign in

  3. Performance Tuning for BI WorkloadsSign in

  4. Monitoring Usage and UsefulnessSign in

Part 31

Why Experimentation?

  1. Why Experimentation?Sign in

  2. Correlation vs CausationSign in

  3. Experiments as Intervention TestsSign in

  4. When Not to ExperimentSign in

Part 32

Experiment Design Basics

  1. Experiment Design BasicsSign in

  2. Treatment and ControlSign in

  3. Randomization and AssignmentSign in

  4. Sample Size and Duration IntuitionSign in

Part 33

Experimentation System Architecture

  1. Experimentation System ArchitectureSign in

  2. Exposure Logging and Event CaptureSign in

  3. Experiment Configuration SurfacesSign in

  4. Bucketing, Flags, and Rollout ControlsSign in

Part 34

Metrics for Experiments

  1. Metrics for ExperimentsSign in

  2. Primary vs Secondary MetricsSign in

  3. Guardrail Metrics for SafetySign in

  4. Metric Alignment with DecisionsSign in

Part 35

Analyzing Experiments (Conceptual)

  1. Analyzing Experiments (Conceptual)Sign in

  2. Differences Between GroupsSign in

  3. Confidence and UncertaintySign in

  4. Overlaps and Multiple ExperimentsSign in

Part 36

Operationalizing Experimentation

  1. Operationalizing ExperimentationSign in

  2. Experiment Review ProcessesSign in

  3. Libraries of LearningsSign in

  4. Integrating into RhythmsSign in

Part 37

From Descriptions to Predictions

  1. From Descriptions to PredictionsSign in

  2. Prediction-Shaped QuestionsSign in

  3. Supervised Learning at a High LevelSign in

  4. Prediction as a Decision InputSign in

Part 38

Data for Predictive Models

  1. Data for Predictive ModelsSign in

  2. Feature Engineering ConceptsSign in

  3. Splits and LeakageSign in

  4. Drift ConceptsSign in

Part 39

Evaluating Predictive Models

  1. Evaluating Predictive ModelsSign in

  2. Accuracy, Error, CalibrationSign in

  3. Precision vs Recall Trade-OffsSign in

  4. Offline vs Online EvaluationSign in

Part 40

Forecasting Time-Series

  1. Forecasting Time-SeriesSign in

  2. Time-Series StructureSign in

  3. Horizons and GranularitySign in

  4. Uncertainty and IntervalsSign in

Part 41

Model Serving and Integration

  1. Model Serving and IntegrationSign in

  2. Batch vs Real-Time ScoringSign in

  3. Feature Stores ConceptuallySign in

  4. Wiring Predictions into FlowsSign in

Part 42

Monitoring Models in Production

  1. Monitoring Models in ProductionSign in

  2. Performance Over TimeSign in

  3. Drift, Label Delay, Feedback GapsSign in

  4. Retraining and Lifecycle ManagementSign in

Part 43

Decisions and Decision Frameworks

  1. Decisions and Decision FrameworksSign in

  2. Repeated vs One-Off DecisionsSign in

  3. Thresholds, Playbooks, PoliciesSign in

  4. Human vs Automated BoundariesSign in

Part 44

Decision Rules and Policy Engines

  1. Decision Rules and Policy EnginesSign in

  2. Rules Engines ConceptuallySign in

  3. Combining Metrics and Model OutputsSign in

  4. Versioning and Auditing RulesSign in

Part 45

Resource Allocation and Optimization (High-Level)

  1. Resource Allocation and Optimization (High-Level)Sign in

  2. Allocation ProblemsSign in

  3. Optimization as Constraint SatisfactionSign in

  4. Heuristics vs Exact MethodsSign in

Part 46

Human-in-the-Loop Decision Systems

  1. Human-in-the-Loop Decision SystemsSign in

  2. Recommendations vs Auto-ApprovalSign in

  3. Escalations, Overrides, and AnnotationsSign in

  4. UX for Decision MakersSign in

Part 47

Observability and Feedback Loops for Decisions

  1. Observability and Feedback Loops for DecisionsSign in

  2. Logging Decisions and OutcomesSign in

  3. Post-Decision Analysis and RecalibrationSign in

  4. Continuous Improvement LoopsSign in

Part 48

End-to-End Analytics Platform Architecture

  1. End-to-End Analytics Platform ArchitectureSign in

  2. End-to-End Flow in a Platform ArchitectureSign in

  3. Logical vs Physical ComponentsSign in

  4. Control Planes and Platform ControlSign in

Part 49

Multi-Tenant and Multi-Domain Data Platforms

  1. Multi-Tenant and Multi-Domain Data PlatformsSign in

  2. Multi-Team Platforms and Shared Failure DomainsSign in

  3. Data Contracts and Domain OwnershipSign in

  4. Autonomy vs Standardization in Multi-Domain PlatformsSign in

Part 50

Governance, Ethics, and Compliance

  1. Governance, Ethics, and ComplianceSign in

  2. Access, Privacy, Usage Boundaries as Platform PropertiesSign in

  3. Bias and Fairness Concerns in Data and Decision SystemsSign in

  4. Documentation, Audits, and Accountability for Reviewable BehaviorSign in

Part 51

Operating Analytics and Decision Systems

  1. Operating Analytics and Decision SystemsSign in

  2. SLOs for Pipelines, Warehouses, and BISign in

  3. Incident Handling and Data Outage PlaybooksSign in

  4. Capacity Planning and Cost ManagementSign in

Part 52

Organizational Adoption and Data Culture

  1. Organizational Adoption and Data CultureSign in

  2. Maturity Transitions - From Reports to Decision SystemsSign in

  3. Training, Documentation, and Evangelism as System InterfacesSign in

  4. Incentives and Alignment - Rewarding Correct DecisionsSign in

Part 53

Reference Architectures and Maturity Models

  1. Reference Architectures and Maturity ModelsSign in

  2. Early Stage Reference Architecture - One Database, One Truth BoundarySign in

  3. Growth Stage Reference Architecture - From Single Slice to Shared PipelinesSign in

  4. Mature Stage Reference Architecture - Unified Semantics, Decisions, and GovernanceSign in