Databricks Certified Data Engineer Associate Exam
Certified Data Engineer Associate (Page 4 )

Updated On: 19-Jan-2026

Which of the following data lakehouse features results in improved data quality over a traditional data lake?

  1. A data lakehouse provides storage solutions for structured and unstructured data.
  2. A data lakehouse supports ACID-compliant transactions.
  3. A data lakehouse allows the use of SQL queries to examine data.
  4. A data lakehouse stores data in open formats.
  5. A data lakehouse enables machine learning and artificial Intelligence workloads.

Answer(s): B

Explanation:

ACID-compliant transactions ensure that data is consistent, reliable, and accurate. By supporting ACID transactions, a data lakehouse can provide improved data quality over a traditional data lake. This is because ACID transactions guarantee that updates to the data are either completed in their entirety or not at all, reducing the risk of data corruption or errors.



A data engineer needs to determine whether to use the built-in Databricks Notebooks versioning or version their project using Databricks Repos.

Which of the following is an advantage of using Databricks Repos over the Databricks Notebooks versioning?

  1. Databricks Repos automatically saves development progress
  2. Databricks Repos supports the use of multiple branches
  3. Databricks Repos allows users to revert to previous versions of a notebook
  4. Databricks Repos provides the ability to comment on specific changes
  5. Databricks Repos is wholly housed within the Databricks Lakehouse Platform

Answer(s): B



A data engineer has left the organization. The data team needs to transfer ownership of the data engineer’s Delta tables to a new data engineer. The new data engineer is the lead engineer on the data team.

Assuming the original data engineer no longer has access, which of the following individuals must be the one to transfer ownership of the Delta tables in Data Explorer?

  1. Databricks account representative
  2. This transfer is not possible
  3. Workspace administrator
  4. New lead data engineer
  5. Original data engineer

Answer(s): C


Reference:

https://www.databricks.com/blog/2022/08/26/databricks-workspace-administration-best-practices-for-account-workspace-and-metastore-admins.html



A data analyst has created a Delta table sales that is used by the entire data analysis team. They want help from the data engineering team to implement a series of tests to ensure the data is clean. However, the data engineering team uses Python for its tests rather than SQL.

Which of the following commands could the data engineering team use to access sales in PySpark?

  1. SELECT * FROM sales
  2. There is no way to share data between PySpark and SQL.
  3. spark.sql("sales")
  4. spark.delta.table("sales")
  5. spark.table("sales")

Answer(s): E

Explanation:

Creamos una tabla: create or replace table delta_su (id INT , nombre STRING)
Insertamos la tabla y posteriomente obtenemos los valores registrados con : spark.table("delta_su").show()



Viewing page 4 of 36
Viewing questions 13 - 16 out of 198 questions



Post your Comments and Discuss Databricks Certified Data Engineer Associate exam prep with other Community members:

Join the Certified Data Engineer Associate Discussion