QUESTION: 9

A manufacturing company has been collecting IoT sensor data from devices on its factory oor for a year and is storing the data in Amazon Redshift for daily analysis. A data analyst has determined that, at an expected ingestion rate of about 2 TB per day, the cluster will be undersized in less than 4 months. A long-term solution is needed. The data analyst has indicated that most queries only reference the most recent 13 months of data, yet there are also quarterly reports that need to query all the data generated from the past 7 years. The chief technology o cer (CTO) is concerned about the costs, administrative effort, and performance of a long-term solution.
Which solution should the data analyst use to meet these requirements?

Create a daily job in AWS Glue to UNLOAD records older than 13 months to Amazon S3 and delete those records from Amazon Redshift. Create an external table in Amazon Redshift to point to the S3 location. Use Amazon Redshift Spectrum to join to data that is older than 13 months.
Take a snapshot of the Amazon Redshift cluster. Restore the cluster to a new cluster using dense storage nodes with additional storage capacity.
Execute a CREATE TABLE AS SELECT (CTAS) statement to move records that are older than 13 months to quarterly partitioned data in Amazon Redshift Spectrum backed by Amazon S3.
Unload all the tables in Amazon Redshift to an Amazon S3 bucket using S3 Intelligent-Tiering. Use AWS Glue to crawl the S3 bucket location to create external tables in an AWS Glue Data Catalog. Create an Amazon EMR cluster using Auto Scaling for any daily analytics needs, and use Amazon Athena for the quarterly reports, with both using the same AWS Glue Data Catalog.

Answer(s): A

Reveal Solution Next Question

QUESTION: 10

An insurance company has raw data in JSON format that is sent without a prede ned schedule through an Amazon Kinesis Data Firehose delivery stream to an
Amazon S3 bucket. An AWS Glue crawler is scheduled to run every 8 hours to update the schema in the data catalog of the tables stored in the S3 bucket. Data analysts analyze the data using Apache Spark SQL on Amazon EMR set up with AWS Glue Data Catalog as the metastore. Data analysts say that, occasionally, the data they receive is stale. A data engineer needs to provide access to the most up-to-date data.
Which solution meets these requirements?

Create an external schema based on the AWS Glue Data Catalog on the existing Amazon Redshift cluster to query new data in Amazon S3 with Amazon Redshift Spectrum.
Use Amazon CloudWatch Events with the rate (1 hour) expression to execute the AWS Glue crawler every hour.
Using the AWS CLI, modify the execution schedule of the AWS Glue crawler from 8 hours to 1 minute.
Run the AWS Glue crawler from an AWS Lambda function triggered by an S3:ObjectCreated:* event noti cation on the S3 bucket.

Answer(s): D

Reveal Solution Next Question

QUESTION: 11

A company that produces network devices has millions of users. Data is collected from the devices on an hourly basis and stored in an Amazon S3 data lake.
The company runs analyses on the last 24 hours of data ow logs for abnormality detection and to troubleshoot and resolve user issues. The company also analyzes historical logs dating back 2 years to discover patterns and look for improvement opportunities. The data ow logs contain many metrics, such as date, timestamp, source IP, and target IP. There are about 10 billion events every day.
How should this data be stored for optimal performance?

In Apache ORC partitioned by date and sorted by source IP
In compressed .csv partitioned by date and sorted by source IP
In Apache Parquet partitioned by source IP and sorted by date
In compressed nested JSON partitioned by source IP and sorted by date

Answer(s): A

Reveal Solution Next Question

QUESTION: 12

A banking company is currently using an Amazon Redshift cluster with dense storage (DS) nodes to store sensitive data. An audit found that the cluster is unencrypted. Compliance requirements state that a database with sensitive data must be encrypted through a hardware security module (HSM) with automated key rotation.
Which combination of steps is required to achieve compliance? (Choose two.)

Set up a trusted connection with HSM using a client and server certi cate with automatic key rotation.
Modify the cluster with an HSM encryption option and automatic key rotation.
Create a new HSM-encrypted Amazon Redshift cluster and migrate the data to the new cluster.
Enable HSM with key rotation through the AWS CLI.
Enable Elliptic Curve Di e-Hellman Ephemeral (ECDHE) encryption in the HSM.

Answer(s): A,C

Reference:

https://docs.aws.amazon.com/redshift/latest/mgmt/working-with-db-encryption.html

Reveal Solution Next Question

Free DAS-C01 Exam Braindumps (page: 20)

QUESTION: 9

QUESTION: 10

QUESTION: 11

QUESTION: 12

Reference: