Viewing Page 12 of 53 pages.

Download PDF version with 242 Questions

QUESTION: 45

A company uses Amazon Athena to run SQL queries for extract, transform, and load (ETL) tasks by using Create Table As Select (CTAS). The company must use Apache Spark instead of SQL to generate analytics.
Which solution will give the company the ability to use Spark to access Athena?

Athena query settings
Athena workgroup
Athena data source
Athena query editor

Answer(s): B

Reveal Solution Next Question

QUESTION: 46

A company needs to partition the Amazon S3 storage that the company uses for a data lake. The partitioning will use a path of the S3 object keys in the following format: s3://bucket/prefix/year=2023/month=01/day=01.
A data engineer must ensure that the AWS Glue Data Catalog synchronizes with the S3 storage when the company adds new partitions to the bucket.
Which solution will meet these requirements with the LEAST latency?

Schedule an AWS Glue crawler to run every morning.
Manually run the AWS Glue CreatePartition API twice each day.
Use code that writes data to Amazon S3 to invoke the Boto3 AWS Glue create_partition API call.
Run the MSCK REPAIR TABLE command from the AWS Glue console.

Answer(s): C

Reveal Solution Next Question

QUESTION: 47

A media company uses software as a service (SaaS) applications to gather data by using third-party tools. The company needs to store the data in an Amazon S3 bucket. The company will use Amazon Redshift to perform analytics based on the data.
Which AWS service or feature will meet these requirements with the LEAST operational overhead?

Amazon Managed Streaming for Apache Kafka (Amazon MSK)
Amazon AppFlow
AWS Glue Data Catalog
Amazon Kinesis

Answer(s): B

Reveal Solution Next Question

QUESTION: 48

A data engineer is using Amazon Athena to analyze sales data that is in Amazon S3. The data engineer writes a query to retrieve sales amounts for 2023 for several products from a table named sales_data. However, the query does not return results for all of the products that are in the sales_data table. The data engineer needs to troubleshoot the query to resolve the issue.
The data engineer's original query is as follows:
SELECT product_name, sum(sales_amount)
FROM sales_data
WHERE year = 2023
GROUP BY product_name
How should the data engineer modify the Athena query to meet these requirements?

Replace sum(sales_amount) with count(*) for the aggregation.
Change WHERE year = 2023 to WHERE extract(year FROM sales_data) = 2023.
Add HAVING sum(sales_amount) > 0 after the GROUP BY clause.
Remove the GROUP BY clause.

Answer(s): B

Reveal Solution Next Question

Viewing page 13 of 53
Viewing questions 45 - 48 out of 242 questions

Post your Comments and Discuss Amazon AWS Certified Data Engineer - Associate DEA-C01 exam with other Community members:

Comments:

Name:

Exam Discussions & Posts

RG Commented on June 30, 2025
Thanks for the work! It definitely will help learners in their goals!
Anonymous

Abhi Commented on May 19, 2025
this is great material
UNITED STATES

NSPK Commented on May 18, 2025
Q.62 aNS:- A(VACUUM FULL Orders;) - Reclaim disk space: It reclaims space occupied by deleted rows. Analyze the sort key column: By performing a full sort based on the interleaved sort key on the AWS Regions column, it ensures the data is optimally arranged for queries that utilize this column, making performance analysis of the sort key meaningful.
Anonymous

NSPK Commented on May 16, 2025
Q27. Ans:- D (Using Kinesis Data Streams + Kinesis Data Firehose + Amazon Redshift is the most efficient and least operationally intensive way to implement real-time analytics using AWS managed services.)
Anonymous

NSPK Commented on May 15, 2025
Q22. Ans: - AWS Glue Workflows
Anonymous

Joe Commented on May 14, 2025
Thanks for the great work
UNITED STATES

Stephane Commented on March 16, 2025
QUESTION: 23 WHY ISn't c) true ? Glacier Flexible is most costly than Deep Archive and low-latency is not a requirement. Thank you for making thaose mock questions available to us
Anonymous

Ming Commented on February 26, 2025
Very cool very precise. I highly recommend this study package.
UNITED STATES

Geovani Commented on February 25, 2025
Very useful content and point by point explanation. And also the payment and download process was straight forward. Good job guys.
Italy

Abhishek Commented on February 09, 2025
It was Nice
Anonymous

saif Ali Commented on December 13, 2024
for Question no 50 The answer would be using lambda vdf as this provides automation
INDIA

Josh Commented on November 28, 2024
Team, thanks for the wonderful support. This guide helped me a lot.
UNITED STATES

Free AWS Certified Data Engineer - Associate DEA-C01 Exam Braindumps (page: 13)

QUESTION: 45

QUESTION: 46

QUESTION: 47

QUESTION: 48

Exam Discussions & Posts