
SESION 9 - Examen de Práctica de Databricks
Authored by licibeth delacruz
English
Professional Development

AI Actions
Add similar questions
Adjust reading levels
Convert to real-world scenario
Translate activity
More...
Content View
Student View
28 questions
Show all answers
1.
MULTIPLE CHOICE QUESTION
30 sec • 1 pt
Which of the following locations houses the controller and worker nodes of a cluster managed by Databricks?
Data plane
Control plane
Databricks file system
JDBC data source
Databricks web application
2.
MULTIPLE CHOICE QUESTION
30 sec • 1 pt
Two junior data engineers are drafting separate parts of a single data pipeline notebook. They are working on separate Git branches to be able to pair program in the same notebook simultaneously. A senior data engineer with experience in Databricks suggests that there is a better alternative for this type of collaboration. Which of the following supports the senior data engineer's statement?
Databricks notebooks support automatic change tracking and versioning.
Databricks notebooks support real-time co-authoring in a single notebook.
Databricks notebooks support comments and notification comments.
Databricks notebooks support the use of multiple languages in the same notebook.
Databricks notebooks support the creation of interactive data visualizations.
3.
MULTIPLE CHOICE QUESTION
30 sec • 1 pt
Which of the following describes how Databricks Repos can help facilitate CI/CD workflows on the Databricks Lakehouse Platform?
Databricks Repos can facilitate the pull request, review, and approval process before merging branches.
Databricks Repos can merge changes from a Git secondary branch into a Git main branch.
Databricks Repos can be used to design, develop, and activate Git automation pipelines.
Databricks Repos can store the Git repository as the single source of truth.
Databricks Repos can commit or push code changes to trigger a CI/CD process.
4.
MULTIPLE CHOICE QUESTION
30 sec • 1 pt
A data architect has determined that a table in the following format is necessary:
What of the following code blocks uses SQL DDL commands to create an empty Delta table in the above format, regardless of whether a table with this name already exists?
CREATE OR REPLACE TABLE table_name AS SELECT id STRING, birthDate DATE, avgRating FLOAT USING DELTA
CREATE OR REPLACE TABLE table_name (id STRING, birthDate DATE, avgRating FLOAT)
CREATE TABLE IF NOT EXISTS table_name (id STRING, birthDate DATE, avgRating FLOAT)
CREATE TABLE table_name AS SELECT id STRING, birthDate DATE, avgRating FLOAT
CREATE OR REPLACE TABLE table_name WITH COLUMNS (id STRING, birthDate DATE, avgRating FLOAT) USING DELTA
5.
MULTIPLE CHOICE QUESTION
30 sec • 1 pt
A data engineering team needs to query a Delta table to extract rows that all meet the same condition. However, the team has noticed that the query is running slowly. The team has already adjusted the size of the data files. After investigating, the team concluded that the rows meeting the condition are located sparsely throughout each of the data files. Based on the scenario, which of the following optimization techniques could speed up the query?
Data skipping
Z-Ordering
Bin packing
Write as a Parquet file
Tuning the file size
6.
MULTIPLE CHOICE QUESTION
30 sec • 1 pt
A data engineer wants to create a relational object by extracting data from two tables. The relational object should be used by other data engineers in other sessions. To save on storage costs, the data engineer wants to avoid copying and storing physical data. Which of the following relational objects should the data engineer create?
View
Temporary view
Delta table
Database
Spark SQL table
7.
MULTIPLE CHOICE QUESTION
30 sec • 1 pt
A data engineer has ingested a JSON file into a raw_table with the following schema:
cart_id STRING,
items ARRAY <item_id:STRING>
The junior data engineer would like to unnest the items column in raw_table to result in a new table with the following schema:
cart_id STRING, item_id STRING.
Which of the following commands should the junior data engineer run to complete this task?
SELECT cart_id, filter(items) AS item_id FROM raw_table;
SELECT cart_id, flatten(items) AS item_id FROM raw_table;
SELECT cart_id, reduce(items) AS item_id FROM raw_table;
SELECT cart_id, explode(items) AS item_id FROM raw_table;
SELECT cart_id, slice(items) AS item_id FROM raw_table;
Access all questions and much more by creating a free account
Create resources
Host any resource
Get auto-graded reports

Continue with Google

Continue with Email

Continue with Classlink

Continue with Clever
or continue with

Microsoft
%20(1).png)
Apple
Others
Already have an account?