Skip to content

Module 3 challenge :Regression Analysis: Simplify Complex Data Relationships (Google Advanced Data Analytics Professional Certificate) Answers 2025

1. Regression with 1 dependent + 3 independent variables

Multiple linear regression
❌ One hot encoding
❌ Simple linear regression
❌ Interaction terms


2. Which are categorical variables?

Shirt country of manufacture
Shirt size
Shirt type
❌ Shirt inventory (numeric → NOT categorical)


3. No two independent variables are highly correlated — testing which assumption?

No multicollinearity assumption
❌ No homoscedasticity
❌ No linearity
❌ No normality


4. Interaction terms affect changes in the _____ of the dependent variable.

mean
❌ assumption
❌ category
❌ multicollinearity


5. Adjusted R² — correct statements:

It can vary from 0 to 1.
It penalizes unnecessary explanatory variables.
It is a regression evaluation metric.
❌ It is greater than 1.


6. Stepwise method starting with full model?

Backward elimination
❌ Extra-sum-of-squares F-test
❌ Forward selection
❌ Overfit selection


7. Performs well on training but poorly on unseen data = too much?

Variance
❌ Bias
❌ Entropy
❌ Leakage


8. Which regularization removes variables completely?

Lasso regression
❌ Elastic net regression
❌ Ridge regression
❌ Independent regression


9. Suitable for multiple linear regression?

Utility company predicting electricity usage (many predictors)
Marketing team predicting sales from multiple inputs
❌ Educational researcher (only 1 predictor → simple regression)
❌ Bank fraud detection (classification, not regression)


10. Higher residuals for women → violates which assumption?

Homoscedasticity (unequal variance across groups)
❌ Linearity
❌ Independent observations
❌ No multicollinearity


🧾 Summary Table of All Answers

Q No. Correct Answer(s) Incorrect Options
1 Multiple linear regression One hot encoding, Simple linear regression, Interaction terms
2 Country of manufacture, Shirt size, Shirt type Shirt inventory
3 No multicollinearity assumption Homoscedasticity, Linearity, Normality
4 Mean Assumption, Category, Multicollinearity
5 Can vary 0–1; Penalizes variables; Regression metric Greater than 1
6 Backward elimination F-test, Forward selection, Overfit selection
7 Variance Bias, Entropy, Leakage
8 Lasso regression Elastic net, Ridge, Independent regression
9 Utility usage; Marketing sales Student study hours; Fraud detection
10 Homoscedasticity Linearity, Independent, Multicollinearity