Low Stakes Stats · Chapter 2

Two predictors
are better than one

Runs don't just come from home runs — walks put runners on base too. Multiple linear regression lets you use several predictors at once. Same idea as Chapter 1, one step up. First, the short primer.

Start the primer arrow_downward Skip to the example
Two predictors feeding into an outcome
The short primer

From one predictor to many

Five ideas — most of them you already know from Chapter 1.

Two inputs feeding one output
sports_baseball

Two predictors

x?, x₂

Now we use two inputs: home runs (x?) and walks (x₂). Both are numeric, both plausibly drive scoring. MLR handles any number of predictors — we'll use two to keep the arithmetic readable.

A runner and scoreboard
scoreboard

One outcome

y

The response is still a single number — runs scored (y). We're asking how runs depend on home runs and walks together.

A fitted line
functions

The model

One predictor gave a line. Two predictors give a tilted plane: y = β₀ + β₁x? + β₂x₂ + ε. Each β is the effect of its own variable.

One slider moving, one locked
lock

"Holding the others constant"

This is the one genuinely new idea. β₁ is the effect of home runs while walks stay fixed — the extra scoring from a home run that walks can't already explain. Every coefficient is read this way.

A design matrix
grid_on

What you'll solve for

The coefficients come from one matrix formula, β̂ = (XᵀX)⁻¹Xᵀy — the grown-up version of Chapter 1's Sxy/Sxx. Then you judge the model with adjusted R² and an F-test.

table_chart

The data you'll work with

TeamHome runs (x?)Walks (x₂)Runs (y)
Yankees245520690
Dodgers221540696
Red Sox198505652
Astros214560706
Cubs177480623
Padres162500632

Three columns now: two predictors and one outcome. This teaching sample is small on purpose; the workflow is what transfers.

Ready? Let's fit the plane.

We'll set up the matrices, read the coefficients, and judge the model — step by step.

Open the worked example arrow_forward