Skip to content

Graded Quiz – Languages :Tools for Data Science (IBM Data Science Professional Certificate) Answers 2025

1️⃣ Question 1

What does Python use for Natural Language Processing?

  • ❌ TensorFlow

  • Natural Language Toolkit

  • ❌ Pandas

  • ❌ Keras

Explanation:
The Natural Language Toolkit (NLTK) is the standard Python library for NLP tasks.


2️⃣ Question 2

Which library is not used for Data Science?

  • ❌ Pandas

  • ❌ SciPy

  • Caffe

  • ❌ Natural Language Toolkit

Explanation:
Caffe is a deep learning framework, not a general-purpose data science library in Python.


3️⃣ Question 3

Which of the following is developed using the R language?

  • ❌ Automation

  • ❌ Relational DBMS

  • Graphing

  • ❌ Image processing

Explanation:
R is widely known for its visualization packages (ggplot2, lattice).


4️⃣ Question 4

Which statement is true about open-source and free software?

  • ❌ Open source is more business focused; free software is more values focused.

  • ❌ Both do not support collaboration.

  • ❌ Open source is values-focused; free software is business-focused.

  • None of the above.

Explanation:
Both open-source and free software support collaboration and share philosophical overlap; none of the given statements are accurate.


5️⃣ Question 5

Which is a global R community?

  • useR

  • ❌ RealPython

  • ❌ HackerEarth

  • ❌ PyLadies

Explanation:
useR! is the global user conference/community for R programmers.


6️⃣ Question 6

Is the statement true or false: R integrates well with C and C++?

  • True

  • ❌ False

Explanation:
R can easily call C/C++ code via Rcpp, improving performance.


7️⃣ Question 7

Which is not an SQL language element?

  • Spreadsheets

  • ❌ Predicate

  • ❌ Clauses

  • ❌ Queries

Explanation:
Spreadsheets are not part of SQL; the others are core SQL elements.


8️⃣ Question 8

Why is SQL an ANSI standard?

  • ❌ SQL manages data

  • ❌ SQL language is subdivided into several elements

  • ❌ SQL can speed up workflows

  • SQL knowledge applies to multiple databases

Explanation:
ANSI standardized SQL so it can work across many database systems (MySQL, PostgreSQL, Oracle, etc.).


9️⃣ Question 9

Which Scala program is used for data science?

  • Apache Spark

  • ❌ MongoDB

  • ❌ Weka

  • ❌ Caffe

Explanation:
Apache Spark is a Scala-based distributed computing framework used widely in data science.


🔟 Question 10

Which language was used to develop Hadoop?

  • ❌ C++

  • ❌ Ruby

  • ❌ Python

  • Java

Explanation:
Hadoop’s core framework, including HDFS and MapReduce, was written in Java.


🧾 Summary Table

Q Correct Answer Key Concept
1 Natural Language Toolkit Python NLP
2 Caffe Not a DS library in Python
3 Graphing R strengths
4 None of the above Open-source & free software basics
5 useR Global R community
6 True R integrates with C/C++
7 Spreadsheets Not SQL element
8 SQL applies to multiple DBs ANSI standard reason
9 Apache Spark Scala-based DS tool
10 Java Hadoop framework language