M14 : Processing - EMR

M14 : Processing - EMR

Professional Development

10 Qs

quiz-placeholder

Similar activities

AWS Basic Quiz

AWS Basic Quiz

Professional Development

10 Qs

Hardisk and Filesystem

Hardisk and Filesystem

Professional Development

11 Qs

Aula dia 1

Aula dia 1

Professional Development

10 Qs

Spitfire-2020

Spitfire-2020

Professional Development

15 Qs

Class 2 (Introduction to Cloud Computing; IAM & S3) Sat 22, 2023

Class 2 (Introduction to Cloud Computing; IAM & S3) Sat 22, 2023

Professional Development

6 Qs

FinTech 13-1 AWS

FinTech 13-1 AWS

Professional Development

9 Qs

aws-teste

aws-teste

Professional Development

11 Qs

vSAN Penalty Shoot Out

vSAN Penalty Shoot Out

Professional Development

10 Qs

M14 : Processing - EMR

M14 : Processing - EMR

Assessment

Quiz

Other

Professional Development

Medium

Created by

Carina Martin

Used 2+ times

FREE Resource

10 questions

Show all answers

1.

MULTIPLE SELECT QUESTION

30 sec • 1 pt

Which are components of Elastic MapReduce ? (Choose all that apply)

Master nodes

Core nodes

Leader nodes

Task nodes

S3 or HDFS

2.

MULTIPLE SELECT QUESTION

30 sec • 1 pt

When would you use Elastic MapReduce (EMR)? (Choose all that apply)

for analysis of structured data

for on-demand EC2 billing

for analysis of unstructured data

for serverless querying

3.

MULTIPLE SELECT QUESTION

30 sec • 1 pt

Which are true statements regarding Elastic MapReduce (EMR)? (Choose all that apply)

EMR clusters can't be used in conjunction with Auto Scaling groups.

It uses S3 to store data for its cluster.

It is a customer-managed, EC2 cluster-based product.

It is an AWS-managed, EC2 cluster-based product.

It is an AWS product that allows the analysis of large sets of structured and unstructured data.

4.

MULTIPLE CHOICE QUESTION

30 sec • 1 pt

You have advertising campaign information stored in a DynamoDB table. You need to write queries that join clickstream data to identify the most effective categories of ads that are displayed on websites. Which tool should you use?

Quicksight

Kinesis data streams

EMR

Data Pipeline

5.

MULTIPLE CHOICE QUESTION

30 sec • 1 pt

You need to store and process data quickly in a cost-effective manner. You can move data easily from its location on disk to wherever you'd like without needing to stream the data. Also, you do not know how much data you will be handling in 6 months, and your processing needs spike intermittently. Specifically, you need to transform the data that comes in by aggregating the different disparate metrics into summary information. Which Big Data tools should you use?

DynamoDB and Redshift

Kinesis Data Streams and DynamoDB

S3 and Spark on EMR

S3 and Amazon Machine Learning

6.

MULTIPLE SELECT QUESTION

30 sec • 1 pt

Your EMR cluster uses 12 m4.large instances and runs 24 hours per day, but it is only used for processing and reporting during business hours. Which options can you use to reduce the costs? ​

(Choose two answers)

Run 12 d2.8xlarge instead without turn-off.

Use Spot instances for task nodes when needed.

Use the ReduceMapper distribution of EMR.

Migrate the data from HDFS to S3 using S3DistCp and turn off the cluster when not in use.

7.

MULTIPLE CHOICE QUESTION

30 sec • 1 pt

What is AWS Glue ?

Fully managed extract, transform and load service.

Petabyte scale cloud data warehouse

Real time data streaming service

None of the above

Create a free account and access millions of resources

Create resources
Host any resource
Get auto-graded reports
or continue with
Microsoft
Apple
Others
By signing up, you agree to our Terms of Service & Privacy Policy
Already have an account?