Free Professional Data Engineer Exam Braindumps (page: 28)

Page 27 of 95

Which of these rules apply when you add preemptible workers to a Dataproc cluster (select 2 answers)?

  1. Preemptible workers cannot use persistent disk.
  2. Preemptible workers cannot store data.
  3. If a preemptible worker is reclaimed, then a replacement worker must be added manually.
  4. A Dataproc cluster cannot have only preemptible workers.

Answer(s): B,D

Explanation:

The following rules will apply when you use preemptible workers with a Cloud Dataproc cluster:

. Processing only--Since preemptibles can be reclaimed at any time, preemptible workers do not store data. Preemptibles added to a Cloud Dataproc cluster only function as processing nodes.

. No preemptible-only clusters--To ensure clusters do not lose all workers, Cloud Dataproc cannot create preemptible-only clusters.

. Persistent disk size--As a default, all preemptible workers are created with the smaller of 100GB or the primary worker boot disk size. This disk space is used for local caching of data and is not available through HDFS.

The managed group automatically re-adds workers lost due to reclamation as capacity permits.


Reference:

https://cloud.google.com/dataproc/docs/concepts/preemptible-vms



When using Cloud Dataproc clusters, you can access the YARN web interface by configuring a browser to connect through a ____ proxy.

  1. HTTPS
  2. VPN
  3. SOCKS
  4. HTTP

Answer(s): C

Explanation:

When using Cloud Dataproc clusters, configure your browser to use the SOCKS proxy. The SOCKS proxy routes data intended for the Cloud Dataproc cluster through an SSH tunnel.


Reference:

https://cloud.google.com/dataproc/docs/concepts/cluster-web-interfaces#interfaces



Cloud Dataproc is a managed Apache Hadoop and Apache _____ service.

  1. Blaze
  2. Spark
  3. Fire
  4. Ignite

Answer(s): B

Explanation:

Cloud Dataproc is a managed Apache Spark and Apache Hadoop service that lets you use open source data tools for batch processing, querying, streaming, and machine learning.


Reference:

https://cloud.google.com/dataproc/docs/



Which action can a Cloud Dataproc Viewer perform?

  1. Submit a job.
  2. Create a cluster.
  3. Delete a cluster.
  4. List the jobs.

Answer(s): D

Explanation:

A Cloud Dataproc Viewer is limited in its actions based on its role. A viewer can only list clusters, get cluster details, list jobs, get job details, list operations, and get operation details.


Reference:

https://cloud.google.com/dataproc/docs/concepts/iam#iam_roles_and_cloud_dataproc_operations _summary






Post your Comments and Discuss Google Professional Data Engineer exam with other Community members:

Exam Discussions & Posts