QUESTION: 61

Does the EMR Hadoop input connector for Kinesis enable continuous stream processing?

Only in some regions
Yes
No
Only if the iteration process succeeds

Answer(s): C

Explanation:

The Hadoop MapReduce framework is a batch processing system. As such, it does not support continuous queries. However, there is an emerging set of Hadoop ecosystem frameworks like Twitter Storm and Spark Streaming that enable developers to build applications for continuous stream processing. A Storm connector for Kinesis is available on GitHub here and you can find a tutorial explaining how to setup Spark Streaming on EMR and run continuous queries here.
Additionally, developers can utilize the Kinesis client library to develop real-time stream processing applications.

Reference:

https://aws.amazon.com/elasticmapreduce/faqs/

Show Answer Next Question

QUESTION: 62

In AWS Data Pipeline, what precondition in a pipeline component containing conditional statements must be true before an activity can run? (choose three)

Check whether an Amazon S3 key is present
B. Check whether source data is present before a pipeline activity attempts to copy it
Check if the Hive script has compile errors in it
Check whether a database table exists

Answer(s): A,D

Explanation:

The following conditional statements must be true before an AWS Data Pipeline activity will run. Check whether source data is present before a pipeline activity attempts to copy it.
Check whether a database table exists. Check whether an Amazon S3 key is present.

Reference:

http://docs.aws.amazon.com/datapipeline/latest/DeveloperGuide/dp-managingpipeline. html

Show Answer Next Question

QUESTION: 63

What are supported ways you can use Task Runner to process your AWS Data pipeline? (choose three)

Install Task Runner on a long-running EC2 instance.
install Task Runner on a computational resource that you manage.
Install Task Runner on an Database migration service instance
Enable AWS Data Pipeline to install Task Runner for you on resources that are launched and managed by the AWS Data Pipeline web service.

Answer(s): A,B,D

Explanation:

Task Runner enables two use cases. Enable AWS Data Pipeline to install Task Runner for you on resources that are launched and managed by the AWS Data Pipeline web service. install Task Runner on a computational resource that you manage, such as a long-running EC2 instance or an on-premise server.

Reference:

http://docs.aws.amazon.com/datapipeline/latest/DeveloperGuide/dp-managingpipeline. html

Show Answer Next Question

QUESTION: 64

In AWS Data Pipeline data nodes are used for (choose two)

Loading data to the target
Accessing data from the source
Processing data transformations
Storing Logs

Answer(s): A,B

Explanation:

In AWS Data Pipeline data nodes are used for Accessing data from the source, Loading data to the target

Reference:

http://docs.aws.amazon.com/datapipeline/latest/DeveloperGuide/dp-conceptsdatanodes. html

Show Answer Next Question

Free Amazon AWS-Certified-Big-Data-Specialty Exam Braindumps (page: 17)

QUESTION: 61

Explanation:

Reference:

QUESTION: 62

Explanation:

Reference:

QUESTION: 63

Explanation:

Reference:

QUESTION: 64

Explanation:

Reference:

AWS-Certified-Big-Data-Specialty Exam Discussions & Posts