SESION 9 - Examen de Práctica de Databricks

SESION 9 - Examen de Práctica de Databricks

Professional Development

28 Qs

quiz-placeholder

Similar activities

8 pm QUIZ 07/06

8 pm QUIZ 07/06

7th Grade - Professional Development

25 Qs

General English

General English

Professional Development

25 Qs

TOEIC Short Conversation 1

TOEIC Short Conversation 1

Professional Development

25 Qs

Possessive Adjectives

Possessive Adjectives

4th Grade - Professional Development

23 Qs

Review Level 1 final exam

Review Level 1 final exam

Professional Development

24 Qs

A32 test _Unit 1 Working Day

A32 test _Unit 1 Working Day

Professional Development

24 Qs

Who, Whom, Which and That

Who, Whom, Which and That

9th Grade - Professional Development

25 Qs

ASSESSMENT OF VITAL SIGNS FOR NURSING STUDENTS

ASSESSMENT OF VITAL SIGNS FOR NURSING STUDENTS

Professional Development

24 Qs

SESION 9 - Examen de Práctica de Databricks

SESION 9 - Examen de Práctica de Databricks

Assessment

Quiz

English

Professional Development

Practice Problem

Hard

Created by

licibeth delacruz

FREE Resource

AI

Enhance your content in a minute

Add similar questions
Adjust reading levels
Convert to real-world scenario
Translate activity
More...

28 questions

Show all answers

1.

MULTIPLE CHOICE QUESTION

30 sec • 1 pt

Which of the following locations houses the controller and worker nodes of a cluster managed by Databricks?

Data plane

Control plane

Databricks file system

JDBC data source

Databricks web application

2.

MULTIPLE CHOICE QUESTION

30 sec • 1 pt

Two junior data engineers are drafting separate parts of a single data pipeline notebook. They are working on separate Git branches to be able to pair program in the same notebook simultaneously. A senior data engineer with experience in Databricks suggests that there is a better alternative for this type of collaboration. Which of the following supports the senior data engineer's statement?

Databricks notebooks support automatic change tracking and versioning.

Databricks notebooks support real-time co-authoring in a single notebook.

Databricks notebooks support comments and notification comments.

Databricks notebooks support the use of multiple languages in the same notebook.

Databricks notebooks support the creation of interactive data visualizations.

3.

MULTIPLE CHOICE QUESTION

30 sec • 1 pt

Which of the following describes how Databricks Repos can help facilitate CI/CD workflows on the Databricks Lakehouse Platform?

Databricks Repos can facilitate the pull request, review, and approval process before merging branches.

Databricks Repos can merge changes from a Git secondary branch into a Git main branch.

Databricks Repos can be used to design, develop, and activate Git automation pipelines.

Databricks Repos can store the Git repository as the single source of truth.

Databricks Repos can commit or push code changes to trigger a CI/CD process.

4.

MULTIPLE CHOICE QUESTION

30 sec • 1 pt

Media Image

A data architect has determined that a table in the following format is necessary:

What of the following code blocks uses SQL DDL commands to create an empty Delta table in the above format, regardless of whether a table with this name already exists?

CREATE OR REPLACE TABLE table_name AS SELECT id STRING, birthDate DATE, avgRating FLOAT USING DELTA

CREATE OR REPLACE TABLE table_name (id STRING, birthDate DATE, avgRating FLOAT)

CREATE TABLE IF NOT EXISTS table_name (id STRING, birthDate DATE, avgRating FLOAT)

CREATE TABLE table_name AS SELECT id STRING, birthDate DATE, avgRating FLOAT

CREATE OR REPLACE TABLE table_name WITH COLUMNS (id STRING, birthDate DATE, avgRating FLOAT) USING DELTA

5.

MULTIPLE CHOICE QUESTION

30 sec • 1 pt

A data engineering team needs to query a Delta table to extract rows that all meet the same condition. However, the team has noticed that the query is running slowly. The team has already adjusted the size of the data files. After investigating, the team concluded that the rows meeting the condition are located sparsely throughout each of the data files. Based on the scenario, which of the following optimization techniques could speed up the query?

Data skipping

Z-Ordering

Bin packing

Write as a Parquet file

Tuning the file size

6.

MULTIPLE CHOICE QUESTION

30 sec • 1 pt

A data engineer wants to create a relational object by extracting data from two tables. The relational object should be used by other data engineers in other sessions. To save on storage costs, the data engineer wants to avoid copying and storing physical data. Which of the following relational objects should the data engineer create?

View

Temporary view

Delta table

Database

Spark SQL table

7.

MULTIPLE CHOICE QUESTION

30 sec • 1 pt

A data engineer has ingested a JSON file into a raw_table with the following schema:

cart_id STRING,

items ARRAY <item_id:STRING>

The junior data engineer would like to unnest the items column in raw_table to result in a new table with the following schema:

cart_id STRING, item_id STRING.

Which of the following commands should the junior data engineer run to complete this task?

SELECT cart_id, filter(items) AS item_id FROM raw_table;

SELECT cart_id, flatten(items) AS item_id FROM raw_table;

SELECT cart_id, reduce(items) AS item_id FROM raw_table;

SELECT cart_id, explode(items) AS item_id FROM raw_table;

SELECT cart_id, slice(items) AS item_id FROM raw_table;

Create a free account and access millions of resources

Create resources

Host any resource

Get auto-graded reports

Google

Continue with Google

Email

Continue with Email

Classlink

Continue with Classlink

Clever

Continue with Clever

or continue with

Microsoft

Microsoft

Apple

Apple

Others

Others

Already have an account?