Skip to content

Module 2 challenge :Process Data from Dirty to Clean (Google Data Analytics Professional Certificate) Answers 2025

Question 1

Merging client order and shipping datasets — ensure compatibility by:

Map the data
❌ Spotcheck for null values
❌ Apply a data structure
❌ Use a visualization

Explanation:
Data mapping aligns fields (like “Client_ID” or “Order_Date”) between datasets so they match correctly before merging. It ensures consistency and prevents mismatched or lost data.


Question 2

Fill in the blank: When typing a LEN function, the correct _____ to follow is =LEN(range).

Syntax
❌ System
❌ Algorithm
❌ Validation

Explanation:
Syntax means the correct way to write a formula or function (like =LEN(A1)), ensuring the spreadsheet interprets it properly.


Question 3

Which function extracts Che Price’s four-digit postcode (NT 0860)?

=RIGHT(C3,4)
❌ =LEFT(C3,4)
❌ =LEFT(4,C3)
❌ =RIGHT(4,C3)

Explanation:
The RIGHT() function extracts characters from the end of a text string.
Here, =RIGHT(C3,4) correctly returns 0860.


Question 4

In a VLOOKUP function, the third argument tells VLOOKUP to return the value in the _____ row from the specified column.

Same
❌ Neighboring
❌ False
❌ Right

Explanation:
The third argument specifies the column index (not row). VLOOKUP always returns the value from the same row as the lookup value but from the specified column.


Question 5

To create ID “Life_vest04”:

=CONCATENATE(A4,B4)
❌ =CONCATENATE(B4,A4)
❌ =CONCATENATE(A4*B4)
❌ =CONCATENATE(A4+B4)

Explanation:
=CONCATENATE(A4,B4) joins text from both cells.
Result: Life_vest + 04 = Life_vest04


Question 6

Count cells from A2–A50 with numbers below 100:

=COUNTIF(A2:A50,”<100″)
❌ >=100
❌ <=100
❌ >100

Explanation:
COUNTIF needs a range and a condition in quotes.
Here " <100 " counts all values less than 100.


Question 7

The hyphen’s function when splitting data like “Tofu-Eggs”:

Delimiter
❌ Substring
❌ Duplicate
❌ String

Explanation:
A delimiter separates text into distinct values. Here, the hyphen (-) is the character that divides each item.


Question 8

Tool used to highlight misspelled client names:

Conditional formatting
❌ Field length
❌ Data merging
❌ Cell filtering

Explanation:
Conditional formatting changes how cells look based on criteria (e.g., highlight cells where text contains “Prce” instead of “Price”).


Question 9

Combining hospital systems’ data — what’s the most important step?

Data mapping to standardize, merge, and clean the data from both databases.
❌ Develop marketing campaigns
❌ Use VLOOKUP
❌ Design visualizations

Explanation:
Data mapping and cleaning ensure that patient records from both systems align correctly (e.g., same formats for ID, DOB, diagnosis codes) before analysis.


Question 10

Sales doubled but profit decreased — possible explanations?

Profit values may be missing from the data.
There might be errors in the data entry for sales figures.
Extra data skewed the results.
❌ The original sales data was not backed up properly.

Explanation:
Decreased profit despite higher sales can occur due to:

  • Missing or incorrect data entries,

  • Extra/unfiltered data inflating sales counts,
    not from backup issues (which affect availability, not accuracy).


🧾 Summary Table

Q# ✅ Correct Answer(s) Key Concept
1 Map the data Dataset alignment
2 Syntax Correct formula structure
3 =RIGHT(C3,4) Extracting text
4 Same VLOOKUP logic
5 =CONCATENATE(A4,B4) Combining text
6 =COUNTIF(A2:A50,”<100″) Conditional counting
7 Delimiter Text splitting
8 Conditional formatting Visual flag for issues
9 Data mapping & cleaning Pre-analysis prep
10 1, 2, 4 ✅ Data quality & consistency