Pertemuan 6 - Azure Data Engineer

Pertemuan 6 - Azure Data Engineer

Professional Development

7 Qs

quiz-placeholder

Similar activities

Redshift 2

Redshift 2

Professional Development

4 Qs

cars

cars

Professional Development

8 Qs

PySpark ML Workflow

PySpark ML Workflow

Professional Development

10 Qs

MLOps 3

MLOps 3

Professional Development

5 Qs

Data Frame

Data Frame

11th Grade - Professional Development

4 Qs

AML Bootcamp quiz

AML Bootcamp quiz

Professional Development

10 Qs

TECNO MOBILE

TECNO MOBILE

Professional Development

10 Qs

Pandas Series

Pandas Series

Professional Development

12 Qs

Pertemuan 6 - Azure Data Engineer

Pertemuan 6 - Azure Data Engineer

Assessment

Quiz

Computers

Professional Development

Medium

Created by

Herlina Herlina

Used 1+ times

FREE Resource

7 questions

Show all answers

1.

MULTIPLE CHOICE QUESTION

30 sec • 1 pt

Which definition best describes Apache Spark?

A highly scalable relational database management system

A virtual server with a Python runtime

A distributed platform for parallel data processing using multiple languages

2.

MULTIPLE CHOICE QUESTION

30 sec • 1 pt

You need to use Spark to analyze data in a parquet file. What should you do?

Load the parquet file into a dataframe

Import the data into a table in a serverless SQL pool

Convert the data to CSV format

3.

MULTIPLE CHOICE QUESTION

30 sec • 1 pt

Which method of the Dataframe object is used to save a dataframe as a file?

toFile()

write()

save()

4.

MULTIPLE CHOICE QUESTION

30 sec • 1 pt

Which method is used to split the data across folders when saving a dataframe?

splitBy()

distributeBy()

partitionBy()

5.

MULTIPLE CHOICE QUESTION

30 sec • 1 pt

What happens if you drop an external table that is based on existing files?

An error – you must delete the files first

The table is dropped from the metastore but the files remain unaffected

The table is dropped from the metastore and the files are deleted

6.

MULTIPLE CHOICE QUESTION

30 sec • 1 pt

You want to write code in a notebook cell that uses a SQL query to retrieve data from a view in the Spark catalog. Which magic should you use?

%%spark

%%pyspark

%%sql

7.

MULTIPLE CHOICE QUESTION

30 sec • 1 pt

You've loaded a Spark dataframe with data, that you now want to use in a Delta Lake table. What format should you use to write the dataframe to storage?

CSV

PARQUET

DELTA