Graded Quiz – Languages :Tools for Data Science (IBM Data Science Professional Certificate) Answers 2025
1️⃣ Question 1
What does Python use for Natural Language Processing?
-
❌ TensorFlow
-
✅ Natural Language Toolkit
-
❌ Pandas
-
❌ Keras
Explanation:
The Natural Language Toolkit (NLTK) is the standard Python library for NLP tasks.
2️⃣ Question 2
Which library is not used for Data Science?
-
❌ Pandas
-
❌ SciPy
-
✅ Caffe
-
❌ Natural Language Toolkit
Explanation:
Caffe is a deep learning framework, not a general-purpose data science library in Python.
3️⃣ Question 3
Which of the following is developed using the R language?
-
❌ Automation
-
❌ Relational DBMS
-
✅ Graphing
-
❌ Image processing
Explanation:
R is widely known for its visualization packages (ggplot2, lattice).
4️⃣ Question 4
Which statement is true about open-source and free software?
-
❌ Open source is more business focused; free software is more values focused.
-
❌ Both do not support collaboration.
-
❌ Open source is values-focused; free software is business-focused.
-
✅ None of the above.
Explanation:
Both open-source and free software support collaboration and share philosophical overlap; none of the given statements are accurate.
5️⃣ Question 5
Which is a global R community?
-
✅ useR
-
❌ RealPython
-
❌ HackerEarth
-
❌ PyLadies
Explanation:
useR! is the global user conference/community for R programmers.
6️⃣ Question 6
Is the statement true or false: R integrates well with C and C++?
-
✅ True
-
❌ False
Explanation:
R can easily call C/C++ code via Rcpp, improving performance.
7️⃣ Question 7
Which is not an SQL language element?
-
✅ Spreadsheets
-
❌ Predicate
-
❌ Clauses
-
❌ Queries
Explanation:
Spreadsheets are not part of SQL; the others are core SQL elements.
8️⃣ Question 8
Why is SQL an ANSI standard?
-
❌ SQL manages data
-
❌ SQL language is subdivided into several elements
-
❌ SQL can speed up workflows
-
✅ SQL knowledge applies to multiple databases
Explanation:
ANSI standardized SQL so it can work across many database systems (MySQL, PostgreSQL, Oracle, etc.).
9️⃣ Question 9
Which Scala program is used for data science?
-
✅ Apache Spark
-
❌ MongoDB
-
❌ Weka
-
❌ Caffe
Explanation:
Apache Spark is a Scala-based distributed computing framework used widely in data science.
🔟 Question 10
Which language was used to develop Hadoop?
-
❌ C++
-
❌ Ruby
-
❌ Python
-
✅ Java
Explanation:
Hadoop’s core framework, including HDFS and MapReduce, was written in Java.
🧾 Summary Table
| Q | Correct Answer | Key Concept |
|---|---|---|
| 1 | Natural Language Toolkit | Python NLP |
| 2 | Caffe | Not a DS library in Python |
| 3 | Graphing | R strengths |
| 4 | None of the above | Open-source & free software basics |
| 5 | useR | Global R community |
| 6 | True | R integrates with C/C++ |
| 7 | Spreadsheets | Not SQL element |
| 8 | SQL applies to multiple DBs | ANSI standard reason |
| 9 | Apache Spark | Scala-based DS tool |
| 10 | Java | Hadoop framework language |