Programming Assignment 1: Quiz:R Programming(Data Science Specialization):Answers2025
Question 1
pollutantmean("specdata", "sulfate", 1:10)
✅ 4.064
❌ 6.026
❌ 3.782
❌ 3.666
❌ 4.868
❌ 6.545
Explanation:
The function computes the mean of all valid sulfate observations across monitors 1–10. After excluding NAs, the rounded mean = 4.064.
Question 2
pollutantmean("specdata", "nitrate", 70:72)
✅ 1.706
❌ 2.604
❌ 1.182
❌ 0.914
❌ 2.394
❌ 2.752
Explanation:
Average nitrate levels among monitors 70–72 (after removing missing values) give a mean ≈ 1.706.
Question 3
pollutantmean("specdata", "sulfate", 34)
✅ 1.477
❌ 1.573
❌ 1.300
❌ 0.450
❌ 0.680
❌ 0.591
Explanation:
The mean sulfate concentration for monitor 34 is 1.477 after NA removal.
Question 4
pollutantmean("specdata", "nitrate")
✅ 1.703
❌ 2.493
❌ 2.233
❌ 1.842
❌ 1.774
❌ 2.363
Explanation:
Across all 332 monitors, the average nitrate concentration (ignoring missing data) ≈ 1.703.
Question 5
✅ 228 148 124 165 104 460 232
❌ Other options
Explanation:
The complete() function counts completely observed cases per monitor ID. For the listed IDs, the counts are 228, 148, 124, 165, 104, 460, 232.
Question 6
✅ 219
❌ 220
❌ 248
❌ 205
❌ 228
❌ 213
Explanation:
Monitor ID 54 has 219 complete cases.
Question 7
✅ 711 135 74 445 178 73 49 0 687 237
❌ Other options
Explanation:
Sampling is reproducible with the set seed. The 10 randomly selected nobs values match this sequence.
Question 8
✅ 0.2688 0.1127 -0.0085 0.4586 0.0447
❌ Other options
Explanation:
Using seed 868 ensures reproducible random sampling from sorted correlations; the rounded result matches above.
Question 9
✅ 247.0000 0.1958 0.9304 -0.4851 -0.8229 -0.0679
❌ Other options
Explanation:
When threshold = 129, 247 valid monitor correlations exist. Sampled 5 correlation values match this output.
Question 10
✅ 3.0000 0.5342 -0.6713 0.3684
❌ Other options
Explanation:
Only 3 valid correlations exist with threshold 2000; then for threshold 1000, the sorted correlations are 0.5342, -0.6713, 0.3684.
🧾 Summary Table
| Q# | ✅ Correct Answer | Key Concept |
|---|---|---|
| 1 | 4.064 | Mean sulfate (IDs 1–10) |
| 2 | 1.706 | Mean nitrate (IDs 70–72) |
| 3 | 1.477 | Mean sulfate (ID 34) |
| 4 | 1.703 | Mean nitrate (all monitors) |
| 5 | 228 148 124 165 104 460 232 | Complete cases |
| 6 | 219 | Complete cases for ID 54 |
| 7 | 711 135 74 445 178 73 49 0 687 237 | Random sample reproducibility |
| 8 | 0.2688 0.1127 -0.0085 0.4586 0.0447 | Random correlations |
| 9 | 247.0000 0.1958 0.9304 -0.4851 -0.8229 -0.0679 | Threshold 129 correlations |
| 10 | 3.0000 0.5342 -0.6713 0.3684 | Threshold 1000 correlations |