
Databricks Certified Data Engineer Quiz part 3
Authored by esra tepebasi
Computers
Professional Development
Used 5+ times

AI Actions
Add similar questions
Adjust reading levels
Convert to real-world scenario
Translate activity
More...
Content View
Student View
31 questions
Show all answers
1.
MULTIPLE CHOICE QUESTION
30 sec • 1 pt
A data engineer has realized that the data files associated with a Delta table are incredibly small. They want to compact the small files to form larger files to improve performance. Which keyword can be used to compact the small files?
OPTIMIZE
VACUUM
COMPACTION
REPARTITION
2.
MULTIPLE CHOICE QUESTION
30 sec • 1 pt
A data engineer wants to create a data entity from a couple of tables. The data entity must be used by other data engineers in other sessions. It also must be saved to a physical location. Which of the following data entities should the data engineer create?
Table
Function
View
Temporary view
3.
MULTIPLE CHOICE QUESTION
30 sec • 1 pt
A data engineer runs a statement every day to copy the previous day’s sales into the table transactions. Each day’s sales are in their own file in the location "/transactions/raw". Today, the data engineer runs the following command to complete this task: After running the command today, the data engineer notices that the number of records in table transactions has not changed. What explains why the statement might not have copied any new records into the table?
The format of the files to be copied were not included with the FORMAT_OPTIONS keyword.
The COPY INTO statement requires the table to be refreshed to view the copied rows.
The previous day’s file has already been copied into the table.
The PARQUET file format does not support COPY INTO.
4.
MULTIPLE CHOICE QUESTION
30 sec • 1 pt
Which command can be used to write data into a Delta table while avoiding the writing of duplicate records?
DROP
INSERT
MERGE
APPEND
5.
MULTIPLE CHOICE QUESTION
30 sec • 1 pt
A data analyst has created a Delta table sales that is used by the entire data analysis team. They want help from the data engineering team to implement a series of tests to ensure the data is clean. However, the data engineering team uses Python for its tests rather than SQL. Which command could the data engineering team use to access sales in PySpark?
SELECT * FROM sales
spark.table("sales")
spark.sql("sales")
spark.delta.table("sales")
6.
MULTIPLE CHOICE QUESTION
30 sec • 1 pt
A data engineer has created a new database using the following command: CREATE DATABASE IF NOT EXISTS customer360; In which location will the customer360 database be located?
dbfs:/user/hive/database/customer360
dbfs:/user/hive/warehouse
dbfs:/user/hive/customer360
dbfs:/user/hive/database
7.
MULTIPLE CHOICE QUESTION
30 sec • 1 pt
A data engineer is attempting to drop a Spark SQL table my_table and runs the following command: DROP TABLE IF EXISTS my_table; After running this command, the engineer notices that the data files and metadata files have been deleted from the file system. What is the reason behind the deletion of all these files?
The table was managed
The table's data was smaller than 10 GB
The table did not have a location
The table was external
Access all questions and much more by creating a free account
Create resources
Host any resource
Get auto-graded reports

Continue with Google

Continue with Email

Continue with Classlink

Continue with Clever
or continue with

Microsoft
%20(1).png)
Apple
Others
Already have an account?