PySpark Quiz Round

PySpark Quiz Round

Professional Development

11 Qs

quiz-placeholder

Similar activities

Basic Pandas (wrapup)

Basic Pandas (wrapup)

Professional Development

10 Qs

Data Analysis Class Quiz 1

Data Analysis Class Quiz 1

Professional Development

10 Qs

Office Administration Skill

Office Administration Skill

Professional Development

10 Qs

MsSQL Server - Quiz-3

MsSQL Server - Quiz-3

Professional Development

10 Qs

Grand Chase Dream - Evento Independência do Brasil

Grand Chase Dream - Evento Independência do Brasil

Professional Development

7 Qs

CSM EDAPI Quiz - Room 1

CSM EDAPI Quiz - Room 1

Professional Development

10 Qs

Live Events

Live Events

Professional Development

15 Qs

Desafio Professores 3

Desafio Professores 3

Professional Development

10 Qs

PySpark Quiz Round

PySpark Quiz Round

Assessment

Quiz

Other

Professional Development

Hard

Created by

Ankita Chatterjee

Used 1+ times

FREE Resource

11 questions

Show all answers

1.

MULTIPLE CHOICE QUESTION

30 sec • 1 pt

Which of the following is a transformation operation in PySpark?

count()

filter()

reduce()

collect()

2.

MULTIPLE CHOICE QUESTION

30 sec • 1 pt

Which of the following is true for RDD?

RDD is programming paradigm

RDD in Apache Spark is an immutable collection of objects

It is a database

None of the above

3.

MULTIPLE CHOICE QUESTION

30 sec • 1 pt

words_list = sc.parallelize ( ["pyspark", "quiz", "questions", "at", "quiz.com"] )

filtered_words = words_list.filter(lambda x: 'quiz' in x)

matched_words= filtered_words.collect()

print(matched_words)

[ "quiz", "quiz.com" ]

[ "quiz" ]

["quiz.com" ]

Error

4.

MULTIPLE CHOICE QUESTION

30 sec • 2 pts

Let us consider, we have a data frame "df". Then what does the expression '[.]{2,}' signify for the following transformation?

df = df.withColumn('var_addrss', sf.regexp_replace('var_addrss', '[.]{2,}', ''))

A single dot (".") followed by 2 integers

A single dot (".") followed by the integer '2'

Single dot (".") appearing twice consecutively

None of these

5.

MULTIPLE CHOICE QUESTION

30 sec • 2 pts

Let us consider, we have a data frame "df". Then what does the expression '^[0]*' signify for the following transformation?

df = df.withColumn('var_addrss', sf.regexp_replace('var_addrss', '^[0]*', ''))

The value starts with 0 OR followed by a sequence of 0s

The value starts with 0 and ends with 0

The value starts with 0 and followed by a sequence of 0s

The value starts with anything other than 0

6.

MULTIPLE SELECT QUESTION

45 sec • 1 pt

Media Image

Let's assume we have the following data frame "df".

How to display the 'age' column in descending order?

display(df.orderBy(df.age.desc()))

display(df.sort(df.age.desc()))

display(df.orderBy(df.age, sort = desc()))

None of these

7.

MULTIPLE CHOICE QUESTION

30 sec • 1 pt

What will the data type of the columns for the following PySpark data frame "df"?

df = spark.read.format("csv").option("header", "true").option("inferSchema", "false").option("delimeter", ",").load("/mnt/temp/test.csv")

Data types of columns will be int

Data types of columns will be read as per the data types defined in the file

Data types of all columns will be string

None of the above

Create a free account and access millions of resources

Create resources
Host any resource
Get auto-graded reports
or continue with
Microsoft
Apple
Others
By signing up, you agree to our Terms of Service & Privacy Policy
Already have an account?