Free Professional Data Engineer Exam Braindumps (page: 30)

Page 29 of 95

Which of these is NOT a way to customize the software on Dataproc cluster instances?

  1. Set initialization actions
  2. Modify configuration files using cluster properties
  3. Configure the cluster using Cloud Deployment Manager
  4. Log into the master node and make changes from there

Answer(s): C

Explanation:

You can access the master node of the cluster by clicking the SSH button next to it in the Cloud Console.

You can easily use the --properties option of the dataproc command in the Google Cloud SDK to modify many common configuration files when creating a cluster.

When creating a Cloud Dataproc cluster, you can specify initialization actions in executables and/or scripts that Cloud Dataproc will run on all nodes in your Cloud Dataproc cluster immediately after the cluster is set up. [https://cloud.google.com/dataproc/docs/concepts/configuring-clusters/init- actions]


Reference:

https://cloud.google.com/dataproc/docs/concepts/configuring-clusters/cluster- properties



In order to securely transfer web traffic data from your computer's web browser to the Cloud Dataproc cluster you should use a(n) _____.

  1. VPN connection
  2. Special browser
  3. SSH tunnel
  4. FTP connection

Answer(s): C

Explanation:

To connect to the web interfaces, it is recommended to use an SSH tunnel to create a secure connection to the master node.


Reference:

https://cloud.google.com/dataproc/docs/concepts/cluster-web- interfaces#connecting_to_the_web_interfaces



All Google Cloud Bigtable client requests go through a front-end server ______ they are sent to a Cloud Bigtable node.

  1. before
  2. after
  3. only if
  4. once

Answer(s): A

Explanation:

In a Cloud Bigtable architecture all client requests go through a front-end server before they are sent to a Cloud Bigtable node.

The nodes are organized into a Cloud Bigtable cluster, which belongs to a Cloud Bigtable instance, which is a container for the cluster. Each node in the cluster handles a subset of the requests to the cluster.

When additional nodes are added to a cluster, you can increase the number of simultaneous requests that the cluster can handle, as well as the maximum throughput for the entire cluster.


Reference:

https://cloud.google.com/bigtable/docs/overview



What is the general recommendation when designing your row keys for a Cloud Bigtable schema?

  1. Include multiple time series values within the row key
  2. Keep the row keep as an 8 bit integer
  3. Keep your row key reasonably short
  4. Keep your row key as long as the field permits

Answer(s): C

Explanation:

A general guide is to, keep your row keys reasonably short. Long row keys take up additional memory and storage and increase the time it takes to get responses from the Cloud Bigtable server.


Reference:

https://cloud.google.com/bigtable/docs/schema-design#row-keys






Post your Comments and Discuss Google Professional Data Engineer exam with other Community members:

Exam Discussions & Posts