Part IV · Chapter 9

Predictive Task Design

Get the task contract and the features right, and the algorithm almost picks itself; get them wrong, and no model can save you.

This chapter opens Part IV by writing the prediction problem down honestly — the step where most production failures are born or avoided. It traces the ladder from manager intuition to hand-coded rule to statistical score to machine-learned model, then pins the supervised task to four decisions — target, features, unit, and label timing — condensed into a one-sentence Task Contract. From there it builds the generalization toolkit (random, time-based, and group splits, plus cross-validation) alongside a gallery of leakage traps, and closes on feature engineering, where a manager's domain knowledge actually enters the model. The Bean & Basket churn model runs throughout as a reminder that the human leverage has migrated from picking algorithms to defining the task.

Start reading

Topics covered

the rules-to-algorithms ladderthe predictive-modeling lifecyclethe Task Contract (unit, target, horizon, feature cut-off)label-timing rules and horizon leakagetrain/test splits (random, time-based, group)cross-validation and stable estimatesthe data-leakage galleryoverfitting vs. underfittingRFM and engagement feature catalogs

Topics covered

In this chapter