§0.4
The Data-to-Decision Loop
The modern data system is a loop. Human and machine activity creates records. Records move into storage. Storage feeds transformations, metrics, charts, models, search indexes, and AI workflows. Those evidence assets inform decisions. Decisions change customer experience, operations, pricing, policy, product, staffing, or automation. Those changes create new data. The loop starts again.
The executive question: where are we in the data-to-decision loop?
Most analytics mistakes come from losing track of the loop. A team starts with a model but has not named the action. A dashboard monitors a metric but has no threshold. A causal analysis estimates an effect but does not connect to a decision cadence. An AI workflow answers questions but has no evaluation or escalation path. Each failure is a broken link between data and action.
The data-to-decision loop
Figure 1 is the front-door operating model for the book. Part I begins in the source record. Part II turns records into visual evidence. Part III asks whether an action caused an outcome. Part IV predicts and ranks future cases. Part V brings text, documents, images, embeddings, and language models into the loop. Part VI asks how the organization runs the loop repeatedly without losing ownership, quality, or governance.
Data-driven versus data-decorated
A decision is data-driven only when three things are named:
- The action. A specific lever someone can pull.
- The comparison. What would happen if the firm did not act or acted differently.
- The threshold. The signal, effect size, ROI, quality level, or risk standard required to act.
Anything missing one of these may still be useful description. It may be a good dashboard, a good analysis, or a promising model. But it is not yet a decision.
| Failure pattern | What it looks like | Missing link |
|---|---|---|
| Dashboard without action | Weekly KPI review shows a metric falling | No owner, threshold, drill path, or response playbook |
| Causal claim without counterfactual | Customers who received an email spent more | What those same customers would have spent without the email |
| Prediction without workflow | A churn score ranks customers every Monday | Which offer, threshold, queue, and follow-up action the score triggers |
| AI without evaluation | A chatbot answers from company documents | Grounding tests, refusal rules, escalation, monitoring, and ownership |
The decision ladder
The book's evidence languages climb a ladder of business questions. The lower rungs do not disappear when the upper rungs arrive. A prediction model still depends on a clean target. A causal design still depends on a well-defined unit, timing, and outcome. A language model workflow still needs source data, storage, evaluation, monitoring, and human review.
The decision ladder
The ladder is a routing device:
- What happened? Read the data and define the metric.
- Where and for whom? Visualize, segment, compare, and diagnose.
- What caused it? Construct a credible counterfactual.
- How much does the lever matter? Estimate effects, elasticity, and heterogeneity.
- What is likely next? Predict, rank, and evaluate.
- What does the text, image, or document say? Use unstructured data and AI workflows.
- How do we operate this? Monitor, govern, communicate, and learn.
Six evidence languages
Six evidence languages, one per Part
| Decision question | Evidence language | Part | Studio |
|---|---|---|---|
| What happened? | Description, metrics | I | Data Language Studio (§4.1) |
| What should the eye see first? | Visual evidence | II | Visual Decision Brief (§8.2) |
| What caused it? | Causal designs | III | Pricing & Promotion (§13.4) |
| What is likely next? | Prediction & segmentation | IV | Customer Intelligence (§17.4) |
| What does the text or image say? | AI workflows | V | Customer Voice Intelligence (§22.2) |
| How do we operate this? | System view | VI | Final Integrative Case (§25.1) |
Each Part teaches one evidence language and ends with a Studio that ships its capstone artefact.
The important point is not the numbering. It is the discipline of choosing the evidence language that fits the decision. A dashboard should not be asked to prove causality. A predictive model should not be treated as an intervention. A language model should not be trusted because it is fluent. A causal estimate should not ship if no one knows what action it changes.
The artifacts that survive the work
The book does not end each method with "and now you know the method." It ends with an artifact that a firm can reuse, audit, and refresh.
The artefact family — five one-page documents that survive the work
Each artefact extends the discipline of the one above. The card you write at §9.1 grows into the memo you sign at §24.1.
The artifact family matters because modern analytics is not a sequence of one-off clever analyses. It is infrastructure for repeated decisions. A metric card can be reused in a dashboard. A predictive task contract can be reused by a modeling team. A model card can be used by risk, legal, product, and operations. An AI workflow card can be audited when the workflow changes. A decision memo can show what evidence led to action and how the firm will learn afterward.
The cases
One through-line company, Bean & Basket Coffee, appears throughout the book. Standalone cases add real empirical grounding where a specific method needs a richer dataset.
The case portfolio
A multi-store specialty coffee chain with reviews, tickets, transactions, panel data, campaigns, products, stores, and an internal knowledge base. Appears in every Part.
Standalone cases are appended outside chapter prose. They give the methods a second testing ground beyond the Bean & Basket through-line.
The purpose of the cases is not to decorate chapters. It is to make the loop concrete. Every case asks: what activity generated the data, where is it stored, what evidence language fits the decision, what artifact should survive, and how would the organization monitor what happens next?
Where Part I begins
Part I now begins after the full system is visible. We zoom in from the operating loop to the basic object inside it: a dataset. The first question becomes almost physical: what does one row mean? That question sounds small, but it controls everything that follows. Grain, structure, variable type, joins, reshaping, metrics, and data quality are the mechanics that make the larger loop trustworthy.
Concept check
Three questions about the loop and the evidence language it requires.
- 1.A dashboard shows that customers who received a marketing email spent more than customers who did not. The team wants to roll out the email nationally. What is missing?
- 2.A customer-facing AI assistant answers from policy documents. Which artifact is most natural?
- 3.Why does Part I begin with rows, columns, and grain after the broader Part 0 map?