Module 2 challenge :Process Data from Dirty to Clean (Google Data Analytics Professional Certificate) Answers 2025
Question 1
Merging client order and shipping datasets — ensure compatibility by:
✅ Map the data
❌ Spotcheck for null values
❌ Apply a data structure
❌ Use a visualization
Explanation:
Data mapping aligns fields (like “Client_ID” or “Order_Date”) between datasets so they match correctly before merging. It ensures consistency and prevents mismatched or lost data.
Question 2
Fill in the blank: When typing a LEN function, the correct _____ to follow is =LEN(range).
✅ Syntax
❌ System
❌ Algorithm
❌ Validation
Explanation:
Syntax means the correct way to write a formula or function (like =LEN(A1)), ensuring the spreadsheet interprets it properly.
Question 3
Which function extracts Che Price’s four-digit postcode (NT 0860)?
✅ =RIGHT(C3,4)
❌ =LEFT(C3,4)
❌ =LEFT(4,C3)
❌ =RIGHT(4,C3)
Explanation:
The RIGHT() function extracts characters from the end of a text string.
Here, =RIGHT(C3,4) correctly returns 0860.
Question 4
In a VLOOKUP function, the third argument tells VLOOKUP to return the value in the _____ row from the specified column.
✅ Same
❌ Neighboring
❌ False
❌ Right
Explanation:
The third argument specifies the column index (not row). VLOOKUP always returns the value from the same row as the lookup value but from the specified column.
Question 5
To create ID “Life_vest04”:
✅ =CONCATENATE(A4,B4)
❌ =CONCATENATE(B4,A4)
❌ =CONCATENATE(A4*B4)
❌ =CONCATENATE(A4+B4)
Explanation:=CONCATENATE(A4,B4) joins text from both cells.
Result: Life_vest + 04 = Life_vest04
Question 6
Count cells from A2–A50 with numbers below 100:
✅ =COUNTIF(A2:A50,”<100″)
❌ >=100
❌ <=100
❌ >100
Explanation:COUNTIF needs a range and a condition in quotes.
Here " <100 " counts all values less than 100.
Question 7
The hyphen’s function when splitting data like “Tofu-Eggs”:
✅ Delimiter
❌ Substring
❌ Duplicate
❌ String
Explanation:
A delimiter separates text into distinct values. Here, the hyphen (-) is the character that divides each item.
Question 8
Tool used to highlight misspelled client names:
✅ Conditional formatting
❌ Field length
❌ Data merging
❌ Cell filtering
Explanation:
Conditional formatting changes how cells look based on criteria (e.g., highlight cells where text contains “Prce” instead of “Price”).
Question 9
Combining hospital systems’ data — what’s the most important step?
✅ Data mapping to standardize, merge, and clean the data from both databases.
❌ Develop marketing campaigns
❌ Use VLOOKUP
❌ Design visualizations
Explanation:
Data mapping and cleaning ensure that patient records from both systems align correctly (e.g., same formats for ID, DOB, diagnosis codes) before analysis.
Question 10
Sales doubled but profit decreased — possible explanations?
✅ Profit values may be missing from the data.
✅ There might be errors in the data entry for sales figures.
✅ Extra data skewed the results.
❌ The original sales data was not backed up properly.
Explanation:
Decreased profit despite higher sales can occur due to:
-
Missing or incorrect data entries,
-
Extra/unfiltered data inflating sales counts,
not from backup issues (which affect availability, not accuracy).
🧾 Summary Table
| Q# | ✅ Correct Answer(s) | Key Concept |
|---|---|---|
| 1 | Map the data | Dataset alignment |
| 2 | Syntax | Correct formula structure |
| 3 | =RIGHT(C3,4) | Extracting text |
| 4 | Same | VLOOKUP logic |
| 5 | =CONCATENATE(A4,B4) | Combining text |
| 6 | =COUNTIF(A2:A50,”<100″) | Conditional counting |
| 7 | Delimiter | Text splitting |
| 8 | Conditional formatting | Visual flag for issues |
| 9 | Data mapping & cleaning | Pre-analysis prep |
| 10 | 1, 2, 4 ✅ | Data quality & consistency |