Pertemuan 6 - Azure Data Engineer

Pertemuan 6 - Azure Data Engineer

Professional Development

7 Qs

quiz-placeholder

Similar activities

Cloud QUIZ

Cloud QUIZ

University - Professional Development

12 Qs

Data Validation

Data Validation

KG - Professional Development

11 Qs

Cloud computing 2

Cloud computing 2

Professional Development

10 Qs

Post Test Pelatihan Kelas Digital

Post Test Pelatihan Kelas Digital

Professional Development

10 Qs

OWASP Top 10

OWASP Top 10

Professional Development

10 Qs

Computer Hardware and Software Quiz

Computer Hardware and Software Quiz

Professional Development

10 Qs

Tablero

Tablero

Professional Development

12 Qs

Python Test 1

Python Test 1

Professional Development

10 Qs

Pertemuan 6 - Azure Data Engineer

Pertemuan 6 - Azure Data Engineer

Assessment

Quiz

Computers

Professional Development

Practice Problem

Medium

Created by

Herlina Herlina

Used 1+ times

FREE Resource

AI

Enhance your content in a minute

Add similar questions
Adjust reading levels
Convert to real-world scenario
Translate activity
More...

7 questions

Show all answers

1.

MULTIPLE CHOICE QUESTION

30 sec • 1 pt

Which definition best describes Apache Spark?

A highly scalable relational database management system

A virtual server with a Python runtime

A distributed platform for parallel data processing using multiple languages

2.

MULTIPLE CHOICE QUESTION

30 sec • 1 pt

You need to use Spark to analyze data in a parquet file. What should you do?

Load the parquet file into a dataframe

Import the data into a table in a serverless SQL pool

Convert the data to CSV format

3.

MULTIPLE CHOICE QUESTION

30 sec • 1 pt

Which method of the Dataframe object is used to save a dataframe as a file?

toFile()

write()

save()

4.

MULTIPLE CHOICE QUESTION

30 sec • 1 pt

Which method is used to split the data across folders when saving a dataframe?

splitBy()

distributeBy()

partitionBy()

5.

MULTIPLE CHOICE QUESTION

30 sec • 1 pt

What happens if you drop an external table that is based on existing files?

An error – you must delete the files first

The table is dropped from the metastore but the files remain unaffected

The table is dropped from the metastore and the files are deleted

6.

MULTIPLE CHOICE QUESTION

30 sec • 1 pt

You want to write code in a notebook cell that uses a SQL query to retrieve data from a view in the Spark catalog. Which magic should you use?

%%spark

%%pyspark

%%sql

7.

MULTIPLE CHOICE QUESTION

30 sec • 1 pt

You've loaded a Spark dataframe with data, that you now want to use in a Delta Lake table. What format should you use to write the dataframe to storage?

CSV

PARQUET

DELTA