SESION 9 - Examen de Práctica de Databricks

SESION 9 - Examen de Práctica de Databricks

Professional Development

28 Qs

quiz-placeholder

Similar activities

In, On, Under, Behind

In, On, Under, Behind

Professional Development

28 Qs

IN ON AT PREPOSITIONS

IN ON AT PREPOSITIONS

8th Grade - Professional Development

23 Qs

UNit 4 list verb II ( 1 )

UNit 4 list verb II ( 1 )

Professional Development

24 Qs

Beginners group - United 2

Beginners group - United 2

Professional Development

26 Qs

THE INGREDIENTS: Writing Task 1 Vocabulary List

THE INGREDIENTS: Writing Task 1 Vocabulary List

Professional Development

26 Qs

BASIC GRAMMAR MINI TEST

BASIC GRAMMAR MINI TEST

Professional Development

25 Qs

Technical Skills- End  Examin - 13-01-2024 ECE & EEE SEM V

Technical Skills- End Examin - 13-01-2024 ECE & EEE SEM V

Professional Development

25 Qs

CCCN: IA-04 final written test practice

CCCN: IA-04 final written test practice

Professional Development

26 Qs

SESION 9 - Examen de Práctica de Databricks

SESION 9 - Examen de Práctica de Databricks

Assessment

Quiz

English

Professional Development

Hard

Created by

licibeth delacruz

FREE Resource

28 questions

Show all answers

1.

MULTIPLE CHOICE QUESTION

30 sec • 1 pt

Which of the following locations houses the controller and worker nodes of a cluster managed by Databricks?

Data plane

Control plane

Databricks file system

JDBC data source

Databricks web application

2.

MULTIPLE CHOICE QUESTION

30 sec • 1 pt

Two junior data engineers are drafting separate parts of a single data pipeline notebook. They are working on separate Git branches to be able to pair program in the same notebook simultaneously. A senior data engineer with experience in Databricks suggests that there is a better alternative for this type of collaboration. Which of the following supports the senior data engineer's statement?

Databricks notebooks support automatic change tracking and versioning.

Databricks notebooks support real-time co-authoring in a single notebook.

Databricks notebooks support comments and notification comments.

Databricks notebooks support the use of multiple languages in the same notebook.

Databricks notebooks support the creation of interactive data visualizations.

3.

MULTIPLE CHOICE QUESTION

30 sec • 1 pt

Which of the following describes how Databricks Repos can help facilitate CI/CD workflows on the Databricks Lakehouse Platform?

Databricks Repos can facilitate the pull request, review, and approval process before merging branches.

Databricks Repos can merge changes from a Git secondary branch into a Git main branch.

Databricks Repos can be used to design, develop, and activate Git automation pipelines.

Databricks Repos can store the Git repository as the single source of truth.

Databricks Repos can commit or push code changes to trigger a CI/CD process.

4.

MULTIPLE CHOICE QUESTION

30 sec • 1 pt

Media Image

A data architect has determined that a table in the following format is necessary:

What of the following code blocks uses SQL DDL commands to create an empty Delta table in the above format, regardless of whether a table with this name already exists?

CREATE OR REPLACE TABLE table_name AS SELECT id STRING, birthDate DATE, avgRating FLOAT USING DELTA

CREATE OR REPLACE TABLE table_name (id STRING, birthDate DATE, avgRating FLOAT)

CREATE TABLE IF NOT EXISTS table_name (id STRING, birthDate DATE, avgRating FLOAT)

CREATE TABLE table_name AS SELECT id STRING, birthDate DATE, avgRating FLOAT

CREATE OR REPLACE TABLE table_name WITH COLUMNS (id STRING, birthDate DATE, avgRating FLOAT) USING DELTA

5.

MULTIPLE CHOICE QUESTION

30 sec • 1 pt

A data engineering team needs to query a Delta table to extract rows that all meet the same condition. However, the team has noticed that the query is running slowly. The team has already adjusted the size of the data files. After investigating, the team concluded that the rows meeting the condition are located sparsely throughout each of the data files. Based on the scenario, which of the following optimization techniques could speed up the query?

Data skipping

Z-Ordering

Bin packing

Write as a Parquet file

Tuning the file size

6.

MULTIPLE CHOICE QUESTION

30 sec • 1 pt

A data engineer wants to create a relational object by extracting data from two tables. The relational object should be used by other data engineers in other sessions. To save on storage costs, the data engineer wants to avoid copying and storing physical data. Which of the following relational objects should the data engineer create?

View

Temporary view

Delta table

Database

Spark SQL table

7.

MULTIPLE CHOICE QUESTION

30 sec • 1 pt

A data engineer has ingested a JSON file into a raw_table with the following schema:

cart_id STRING,

items ARRAY <item_id:STRING>

The junior data engineer would like to unnest the items column in raw_table to result in a new table with the following schema:

cart_id STRING, item_id STRING.

Which of the following commands should the junior data engineer run to complete this task?

SELECT cart_id, filter(items) AS item_id FROM raw_table;

SELECT cart_id, flatten(items) AS item_id FROM raw_table;

SELECT cart_id, reduce(items) AS item_id FROM raw_table;

SELECT cart_id, explode(items) AS item_id FROM raw_table;

SELECT cart_id, slice(items) AS item_id FROM raw_table;

Create a free account and access millions of resources

Create resources
Host any resource
Get auto-graded reports
or continue with
Microsoft
Apple
Others
By signing up, you agree to our Terms of Service & Privacy Policy
Already have an account?