Part IV · Chapter 10

Supervised Models and Business Evaluation

A model isn't evaluated until its scores meet the firm's cost matrix and ship with a card.

This chapter focuses on building, grading, and shipping the models that fill the predictive task. It opens with logistic regression as a defensible first churn scorer, then assembles the full grading toolkit — confusion matrix, ROC and PR curves, calibration, and lift — culminating in the chart a manager should read first: the threshold-profit curve that puts the firm's own cost matrix on the y-axis. From there it covers numeric prediction graded in business dollars, trees and ensembles for the interactions a linear model misses, and the AutoML-era reality that promotes the manager to task-definer and model-card author. A RentHop case ties it together, turning thousands of messy New York apartment listings into a ranked “Hot listings” queue.

Start reading

Topics covered

log-odds coefficients and odds ratiosPR-AUC vs. ROC-AUC under class imbalancecalibration and lift curvesthe threshold-profit curveMAE, RMSE, and R² for numeric errorresidual diagnostics and heteroskedasticityrandom forests and gradient boostingthe bias-variance trade-offpermutation importance, partial dependence, and SHAPmodel cards as deployment contracts

Topics covered

In this chapter