Skip to content

Probability, AUC, and Excel Linest Function :Mastering Data Analysis in Excel (Excel to MySQL: Analytic Techniques for Business Specialization) Answers 2025

  1. Question 1
    Approximate probability a new outcome falls in bin 1 ([-3,-1)):

✔️ 4%
❌ .4
❌ 5
❌ 5%


  1. Question 2
    Skewness of Uniform[47, 51.5]:

✔️ 0
❌ 2.17
❌ 1.69
❌ 49.25


  1. Question 3
    Entropy of Uniform[-12, 20] (in bits):

✔️ 5 bits
❌ 6 bits
❌ 4 bits
❌ 3 bits


  1. Question 4
    Gaussian mean=3, sd=4. Value f(3.5):

✔️ 0.099
❌ 4.05
❌ .352
❌ .550


  1. Question 5
    Gaussian CDF at x = 7 (mean=3, sd=4):

✔️ 0.841
❌ .960
❌ .060
❌ 1.00


  1. Question 6
    If modification factor changes from -1 to -2, change in AUC:

✔️ The area increases
❌ No change
❌ The area decreases


  1. Question 7
    Threshold (row 10) that gives lowest cost per event after change:

✔️ 1.3
❌ .9
❌ 3.5
❌ .45


  1. Question 8
    Model AUC .91 on 200, AUC .5 on 5000 → most likely:

✔️ The model overfit the training set data and will need to be improved to work better on the new data.
❌ The original model identified signal as noise and has no predictive value on new data.
❌ The original model is expected to perform worse on test set data and is functioning acceptably.


  1. Question 9
    Which statement is FALSE about adding age-in-months x(7) when beta(1)=0.4 for age in years?

✔️ If the x(1) data are removed, the new beta(7) on the new x(7) data will be 0.4. (This is FALSE.)
❌ Using Excel linest, and including x(1) and x(7) data, the new beta(7) on the age in months will be 0.
❌ If the x(1) data are removed, the new beta(7) on the new x(7) data will be .033


  1. Question 10
    Correlation R for the regression shown in the example:

✔️ 0.606
❌ 0.367
❌ 0.778 or -0.778


🧾 Summary Table

Q No. Correct Answer
1 4%
2 0
3 5 bits
4 0.099
5 0.841
6 The area increases
7 1.3
8 Model overfit the training set
9 Statement: “If x1 removed, beta7 = 0.4” — FALSE
10 0.606