Knowledge Check :Building Data Lakes on AWS (AWS Cloud Solutions Architect Professional Certificate) Answers 2025
1. Question 1 (Select TWO)
Services used for data ingestion into a data lake
-
✅ Amazon Kinesis Data Firehose
-
❌ Amazon QuickSight
-
❌ Amazon Athena
-
✅ AWS Storage Gateway
-
❌ Amazon Redshift
Explanation:
-
Kinesis Data Firehose → real-time streaming ingestion into S3.
-
AWS Storage Gateway → hybrid ingestion from on-premises to S3.
-
Athena & QuickSight are querying/visualization tools, not ingestion.
-
Redshift is a data warehouse, not an ingestion mechanism.
2. Question 2
Continuous replication into Redshift & S3
-
❌ AWS Storage Gateway
-
❌ AWS SCT
-
✅ AWS Database Migration Service (AWS DMS)
-
❌ Kinesis Data Firehose
Explanation:
AWS DMS supports ongoing replication and streams data continuously into Amazon Redshift and Amazon S3.
3. Question 3
AWS Glue feature that catalogs your data
-
✅ AWS Glue Crawler
-
❌ AWS Glue DataBrew
-
❌ AWS Glue Studio
-
❌ AWS Glue Streaming ETL
Explanation:
Glue Crawlers automatically scan data sources and build the Data Catalog with schemas.
4. Question 4
Query multiple data stores (S3 + RDS + DynamoDB) with a single query
-
✅ Amazon Athena Federated Query
-
❌ Redshift Query Editor
-
❌ SQL Workbench
-
❌ AWS Glue DataBrew
Explanation:
Athena Federated Query lets you query S3, RDS, DynamoDB, and more in one SQL query without moving data.
🧾 Summary Table
| Q# | Correct Answer | Key Concept |
|---|---|---|
| 1 | Kinesis Firehose, Storage Gateway | Data ingestion |
| 2 | AWS DMS | Continuous replication |
| 3 | Glue Crawler | Data cataloging |
| 4 | Athena Federated Query | Multi-source querying |