Search Header Logo

Databricks Certified Data Engineer Quiz part 3

Authored by esra tepebasi

Computers

Professional Development

Used 5+ times

Databricks Certified Data Engineer Quiz part 3
AI

AI Actions

Add similar questions

Adjust reading levels

Convert to real-world scenario

Translate activity

More...

    Content View

    Student View

31 questions

Show all answers

1.

MULTIPLE CHOICE QUESTION

30 sec • 1 pt

A data engineer has realized that the data files associated with a Delta table are incredibly small. They want to compact the small files to form larger files to improve performance. Which keyword can be used to compact the small files?

OPTIMIZE

VACUUM

COMPACTION

REPARTITION

2.

MULTIPLE CHOICE QUESTION

30 sec • 1 pt

A data engineer wants to create a data entity from a couple of tables. The data entity must be used by other data engineers in other sessions. It also must be saved to a physical location. Which of the following data entities should the data engineer create?

Table

Function

View

Temporary view

3.

MULTIPLE CHOICE QUESTION

30 sec • 1 pt

Media Image

A data engineer runs a statement every day to copy the previous day’s sales into the table transactions. Each day’s sales are in their own file in the location "/transactions/raw". Today, the data engineer runs the following command to complete this task: After running the command today, the data engineer notices that the number of records in table transactions has not changed. What explains why the statement might not have copied any new records into the table?

The format of the files to be copied were not included with the FORMAT_OPTIONS keyword.

The COPY INTO statement requires the table to be refreshed to view the copied rows.

The previous day’s file has already been copied into the table.

The PARQUET file format does not support COPY INTO.

4.

MULTIPLE CHOICE QUESTION

30 sec • 1 pt

Which command can be used to write data into a Delta table while avoiding the writing of duplicate records?

DROP

INSERT

MERGE

APPEND

5.

MULTIPLE CHOICE QUESTION

30 sec • 1 pt

A data analyst has created a Delta table sales that is used by the entire data analysis team. They want help from the data engineering team to implement a series of tests to ensure the data is clean. However, the data engineering team uses Python for its tests rather than SQL. Which command could the data engineering team use to access sales in PySpark?

SELECT * FROM sales

spark.table("sales")

spark.sql("sales")

spark.delta.table("sales")

6.

MULTIPLE CHOICE QUESTION

30 sec • 1 pt

A data engineer has created a new database using the following command: CREATE DATABASE IF NOT EXISTS customer360; In which location will the customer360 database be located?

dbfs:/user/hive/database/customer360

dbfs:/user/hive/warehouse

dbfs:/user/hive/customer360

dbfs:/user/hive/database

7.

MULTIPLE CHOICE QUESTION

30 sec • 1 pt

A data engineer is attempting to drop a Spark SQL table my_table and runs the following command: DROP TABLE IF EXISTS my_table; After running this command, the engineer notices that the data files and metadata files have been deleted from the file system. What is the reason behind the deletion of all these files?

The table was managed

The table's data was smaller than 10 GB

The table did not have a location

The table was external

Access all questions and much more by creating a free account

Create resources

Host any resource

Get auto-graded reports

Google

Continue with Google

Email

Continue with Email

Classlink

Continue with Classlink

Clever

Continue with Clever

or continue with

Microsoft

Microsoft

Apple

Apple

Others

Others

Already have an account?