Free Professional Data Engineer Exam Braindumps (page: 29)

Page 28 of 95

Dataproc clusters contain many configuration files. To update these files, you will need to use the -- properties option. The format for the option is: file_prefix:property=_____.

  1. details
  2. value
  3. null
  4. id

Answer(s): B

Explanation:

To make updating files and properties easy, the --properties command uses a special format to specify the configuration file and the property and value within the file that should be updated. The formatting is as follows: file_prefix:property=value.


Reference:

https://cloud.google.com/dataproc/docs/concepts/cluster-properties#formatting



Scaling a Cloud Dataproc cluster typically involves ____.

  1. increasing or decreasing the number of worker nodes
  2. increasing or decreasing the number of master nodes
  3. moving memory to run more applications on a single node
  4. deleting applications from unused nodes periodically

Answer(s): A

Explanation:

After creating a Cloud Dataproc cluster, you can scale the cluster by increasing or decreasing the number of worker nodes in the cluster at any time, even when jobs are running on the cluster. Cloud Dataproc clusters are typically scaled to:

1) increase the number of workers to make a job run faster

2) decrease the number of workers to save money

3) increase the number of nodes to expand available Hadoop Distributed Filesystem (HDFS) storage


Reference:

https://cloud.google.com/dataproc/docs/concepts/scaling-clusters



Cloud Dataproc charges you only for what you really use with _____ billing.

  1. month-by-month
  2. minute-by-minute
  3. week-by-week
  4. hour-by-hour

Answer(s): B

Explanation:

One of the advantages of Cloud Dataproc is its low cost. Dataproc charges for what you really use with minute-by-minute billing and a low, ten-minute-minimum billing period.


Reference:

https://cloud.google.com/dataproc/docs/concepts/overview



The YARN ResourceManager and the HDFS NameNode interfaces are available on a Cloud Dataproc cluster ____.

  1. application node
  2. conditional node
  3. master node
  4. worker node

Answer(s): C

Explanation:

The YARN ResourceManager and the HDFS NameNode interfaces are available on a Cloud Dataproc cluster master node. The cluster master-host-name is the name of your Cloud Dataproc cluster followed by an -m suffix--for example, if your cluster is named "my-cluster", the master-host-name would be "my-cluster-m".


Reference:

https://cloud.google.com/dataproc/docs/concepts/cluster-web-interfaces#interfaces






Post your Comments and Discuss Google Professional Data Engineer exam with other Community members:

Exam Discussions & Posts