Free Certified Data Engineer Professional Exam Braindumps (page: 16)

Page 16 of 46

A member of the data engineering team has submitted a short notebook that they wish to schedule as part of a larger data pipeline. Assume that the commands provided below produce the logically correct results when run as presented.

Which command should be removed from the notebook before scheduling it as a job?

  1. Cmd 2
  2. Cmd 3
  3. Cmd 4
  4. Cmd 5
  5. Cmd 6

Answer(s): E



The business reporting team requires that data for their dashboards be updated every hour. The total processing time for the pipeline that extracts transforms, and loads the data for their pipeline runs in 10 minutes.

Assuming normal operating conditions, which configuration will meet their service-level agreement requirements with the lowest cost?

  1. Manually trigger a job anytime the business reporting team refreshes their dashboards
  2. Schedule a job to execute the pipeline once an hour on a new job cluster
  3. Schedule a Structured Streaming job with a trigger interval of 60 minutes
  4. Schedule a job to execute the pipeline once an hour on a dedicated interactive cluster
  5. Configure a job that executes every time new data lands in a given directory

Answer(s): B



A Databricks SQL dashboard has been configured to monitor the total number of records present in a collection of Delta Lake tables using the following query pattern:
SELECT COUNT (*) FROM table

Which of the following describes how results are generated each time the dashboard is updated?

  1. The total count of rows is calculated by scanning all data files
  2. The total count of rows will be returned from cached results unless REFRESH is run
  3. The total count of records is calculated from the Delta transaction logs
  4. The total count of records is calculated from the parquet file metadata
  5. The total count of records is calculated from the Hive metastore

Answer(s): C



A Delta Lake table was created with the below query:

Consider the following query:
DROP TABLE prod.sales_by_store

If this statement is executed by a workspace admin, which result will occur?

  1. Nothing will occur until a COMMIT command is executed.
  2. The table will be removed from the catalog but the data will remain in storage.
  3. The table will be removed from the catalog and the data will be deleted.
  4. An error will occur because Delta Lake prevents the deletion of production data.
  5. Data will be marked as deleted but still recoverable with Time Travel.

Answer(s): C



Page 16 of 46



Post your Comments and Discuss Databricks Certified Data Engineer Professional exam with other Community members:

Puran commented on September 18, 2024
Good material and very honest and knowledgeable support team. Contacted the support team and got a reply in less than 30 minutes.
New Zealand
upvote