Which of the following IAM roles does your Compute Engine account require to be able to run pipeline jobs?
Answer(s): A
The dataflow.worker role provides the permissions necessary for a Compute Engine service account to execute work units for a Dataflow pipeline
https://cloud.google.com/dataflow/access-control
Which of the following is not true about Dataflow pipelines?
Answer(s): D
The data and transforms in a pipeline are unique to, and owned by, that pipeline. While your program can create multiple pipelines, pipelines cannot share data or transforms
https://cloud.google.com/dataflow/model/pipelines
By default, which of the following windowing behavior does Dataflow apply to unbounded data sets?
Answer(s): B
Dataflow's default windowing behavior is to assign all elements of a PCollection to a single, global window, even for unbounded PCollections
https://cloud.google.com/dataflow/model/pcollection
Which of the following job types are supported by Cloud Dataproc (select 3 answers)?
Answer(s): A,B,D
Cloud Dataproc provides out-of-the box and end-to-end support for many of the most popular job types, including Spark, Spark SQL, PySpark, MapReduce, Hive, and Pig jobs.
https://cloud.google.com/dataproc/docs/resources/faq#what_type_of_jobs_can_i_run
Post your Comments and Discuss Google Professional Data Engineer exam with other Community members:
madhan Commented on June 16, 2023 next question EUROPEAN UNION
To protect our content from bots for real learners like you, we ask you to register for free. Sign in or sign up now to continue with the Professional Data Engineer material!