Chapter 4: Multiple Regression

4.1 Extending Simple Regression

Simple linear regression models the relationship between one predictor and one outcome. In practice, business outcomes are rarely driven by a single factor. Multiple regression extends the model to include two or more predictors, allowing us to estimate each predictor's effect while holding the others constant.

The Multiple Regression Equation

The predicted value of the dependent variable is a linear combination of all predictors:

Multiple Regression Equation

📊 Excel: LINEST() array formula or Data Analysis Toolpak > Regression

where b₀ is the intercept, b₁, b₂, …, b_k are the partial regression coefficients, and x₁, x₂, …, x_k are the predictor variables.

Interpreting Coefficients

Each coefficient b_j represents the expected change in the dependent variable for a one-unit increase in x_j, holding all other predictors constant. This "holding constant" interpretation is what distinguishes multiple regression from running separate simple regressions.

🏪 NorthStar Enterprises

NorthStar wants to predict quarterly revenue (in thousands) using four predictors: advertising spend (x₁, in thousands), number of salespeople (x₂), average product price (x₃, in dollars), and an economic index (x₄, 0–100 scale). Using 20 quarters of historical data, the regression output yields:

Revenue = −120 + 2.8(Ad Spend) + 340(Salespeople) + 0.95(Avg Price) + 15.2(Econ Index)

The coefficient b₂ = 340 means that each additional salesperson is associated with a $340,000 increase in quarterly revenue, holding the other three predictors constant.

✓ Check Your Understanding

In NorthStar's regression, b₂ = 340 for the number of salespeople. This means:

x₂ has the largest effect on revenue

A one-unit increase in x₂ is associated with a 340-unit increase in y, holding other predictors constant

The number of salespeople explains 34% of revenue variance

The coefficient is statistically significant at the 5% level

4.2 Multiple R-Squared and Adjusted R-Squared

R² (coefficient of determination) measures the proportion of variance in the dependent variable explained by the set of predictors. However, R² has a fundamental flaw: it never decreases when you add a predictor, even if that predictor is useless noise.

Adjusted R-Squared

Adjusted R² penalizes the model for each additional predictor, only increasing when a new predictor improves the model more than expected by chance alone.

Adjusted R-Squared

📊 Excel: =1-(1-RSQ)*(n-1)/(n-k-1) or from Regression output

where R² is the unadjusted coefficient of determination, n is the sample size, and k is the number of predictors.

🏪 NorthStar Enterprises

NorthStar's 4-predictor model yields R² = 0.81 and Adjusted R² = 0.78. An analyst adds a fifth predictor (average employee tenure), and R² rises slightly to 0.812 — but Adjusted R² drops to 0.771.

The decline in Adjusted R² signals that the fifth predictor adds complexity without meaningful explanatory power. NorthStar should keep the simpler 4-predictor model.

✓ Check Your Understanding

Adjusted R² differs from R² in that it:

Is always larger than R²

Penalizes for adding predictors that do not meaningfully improve the model

Is only used in simple regression

Is the square of the correlation coefficient

4.3 Multicollinearity and Variable Selection

When two or more predictors are highly correlated with each other, the regression suffers from multicollinearity. This inflates the standard errors of the coefficients, making individual predictors appear non-significant even when the overall model is strong.

Variance Inflation Factor (VIF)

The VIF quantifies how much the variance of a coefficient is inflated due to collinearity with other predictors. It is calculated by regressing each predictor on all other predictors.

Variance Inflation Factor

📊 Excel: Run auxiliary regressions; =1/(1-RSQ(auxiliary))

where R_i² is the R-squared from regressing predictor x_i on all other predictors. A VIF of 1 means no collinearity; VIF > 5 or 10 signals problematic multicollinearity.

Interpreting VIF

VIF = 1: No correlation with other predictors.
1 < VIF < 5: Moderate correlation, generally acceptable.
VIF ≥ 5: Concerning — investigate further.
VIF ≥ 10: Serious multicollinearity — action needed.

Strategies to Address Multicollinearity

Remove one of the correlated predictors.
Combine correlated predictors into a single composite variable.
Use ridge regression or principal component analysis.
Collect more data (reduces standard errors but does not eliminate collinearity).

🏪 NorthStar Enterprises

NorthStar's analyst notices that advertising spend (x₁) and marketing budget (a new candidate predictor) are highly correlated — both measure how much the company invests in promotion. An auxiliary regression of advertising spend on marketing budget yields R² = 0.917.

VIF = 1 / (1 − 0.917) = 12.0

With VIF = 12, including both predictors severely inflates the standard errors. The analyst drops marketing budget and retains advertising spend, which has a clearer business interpretation.

💡 Key Takeaway

Multiple regression is powerful but requires careful model building. Always check VIF for multicollinearity, compare R² vs. Adjusted R² when adding predictors, and remember that each coefficient represents the effect of that predictor holding all others constant. More predictors is not always better — parsimony and interpretability matter.

Chapter Summary

This chapter introduced multiple regression, the distinction between R² and Adjusted R², and the problem of multicollinearity.

💡 Chapter 4 Summary

Multiple Regression: Predicts an outcome from multiple predictors. Each coefficient measures the effect of one predictor, holding all others constant.

Adjusted R²: Penalizes for adding useless predictors. Use it to compare models with different numbers of predictors.

Multicollinearity: When predictors are highly correlated, VIF > 5 or 10 signals problems. Address by removing or combining correlated variables.

📋 Chapter 4 — Formula Reference

Measure	Formula	Excel Function
Regression Equation		`LINEST() or Toolpak`
Adjusted R²		`=1-(1-RSQ)*(n-1)/(n-k-1)`
VIF		`=1/(1-RSQ(aux))`
R²		`=RSQ(y_range, x_range)`

Up Next

Chapter 5: Logistic Regression

→

Multiple Regression

4.1 Extending Simple Regression

The Multiple Regression Equation

Interpreting Coefficients

4.2 Multiple R-Squared and Adjusted R-Squared

Adjusted R-Squared

4.3 Multicollinearity and Variable Selection

Variance Inflation Factor (VIF)

Interpreting VIF

Strategies to Address Multicollinearity

Chapter Summary

Chapter Outline

Progress