Databricks Certified Data Engineer Associate Exam Questions
Certified Data Engineer Associate (Page 4 )

Updated On: 16-Feb-2026

A data engineer has left the organization. The data team needs to transfer ownership of the data engineer’s Delta tables to a new data engineer. The new data engineer is the lead engineer on the data team.

Assuming the original data engineer no longer has access, which of the following individuals must be the one to transfer ownership of the Delta tables in Data Explorer?

  1. Databricks account representative
  2. This transfer is not possible
  3. Workspace administrator
  4. New lead data engineer
  5. Original data engineer

Answer(s): C


Reference:

https://www.databricks.com/blog/2022/08/26/databricks-workspace-administration-best-practices-for-account-workspace-and-metastore-admins.html



A data analyst has created a Delta table sales that is used by the entire data analysis team. They want help from the data engineering team to implement a series of tests to ensure the data is clean. However, the data engineering team uses Python for its tests rather than SQL.

Which of the following commands could the data engineering team use to access sales in PySpark?

  1. SELECT * FROM sales
  2. There is no way to share data between PySpark and SQL.
  3. spark.sql("sales")
  4. spark.delta.table("sales")
  5. spark.table("sales")

Answer(s): E

Explanation:

Creamos una tabla: create or replace table delta_su (id INT , nombre STRING)
Insertamos la tabla y posteriomente obtenemos los valores registrados con : spark.table("delta_su").show()



Which of the following commands will return the location of database customer360?

  1. DESCRIBE LOCATION customer360;
  2. DROP DATABASE customer360;
  3. DESCRIBE DATABASE customer360;
  4. ALTER DATABASE customer360 SET DBPROPERTIES ('location' = '/user'};
  5. USE DATABASE customer360;

Answer(s): C



A data engineer wants to create a new table containing the names of customers that live in France. They have written the following command:


A senior data engineer mentions that it is organization policy to include a table property indicating that the new table includes personally identifiable information (PII).

Which of the following lines of code fills in the above blank to successfully complete the task?

  1. There is no way to indicate whether a table contains PII.
  2. "COMMENT PII"
  3. TBLPROPERTIES PII
  4. COMMENT "Contains PII"
  5. PII

Answer(s): D

Explanation:

CREATE TABLE my_table (id INT COMMENT 'Unique Identification Number', name STRING COMMENT 'PII', age INT COMMENT 'PII')
TBLPROPERTIES ('contains_pii'=True)
COMMENT 'Contains PII';


Reference:

https://www.databricks.com/discover/pages/data-quality-management



Which of the following benefits is provided by the array functions from Spark SQL?

  1. An ability to work with data in a variety of types at once
  2. An ability to work with data within certain partitions and windows
  3. An ability to work with time-related data in specified intervals
  4. An ability to work with complex, nested data ingested from JSON files
  5. An ability to work with an array of tables for procedural automation

Answer(s): D

Explanation:

Spark SQL Array functions allow us to work with nested datasets in JSON files.






Post your Comments and Discuss Databricks Certified Data Engineer Associate exam dumps with other Community members:

Join the Certified Data Engineer Associate Discussion