
M14 : Processing - EMR

Quiz
•
Other
•
Professional Development
•
Medium

Carina Martin
Used 2+ times
FREE Resource
10 questions
Show all answers
1.
MULTIPLE SELECT QUESTION
30 sec • 1 pt
Which are components of Elastic MapReduce ? (Choose all that apply)
Master nodes
Core nodes
Leader nodes
Task nodes
S3 or HDFS
2.
MULTIPLE SELECT QUESTION
30 sec • 1 pt
When would you use Elastic MapReduce (EMR)? (Choose all that apply)
for analysis of structured data
for on-demand EC2 billing
for analysis of unstructured data
for serverless querying
3.
MULTIPLE SELECT QUESTION
30 sec • 1 pt
Which are true statements regarding Elastic MapReduce (EMR)? (Choose all that apply)
EMR clusters can't be used in conjunction with Auto Scaling groups.
It uses S3 to store data for its cluster.
It is a customer-managed, EC2 cluster-based product.
It is an AWS-managed, EC2 cluster-based product.
It is an AWS product that allows the analysis of large sets of structured and unstructured data.
4.
MULTIPLE CHOICE QUESTION
30 sec • 1 pt
You have advertising campaign information stored in a DynamoDB table. You need to write queries that join clickstream data to identify the most effective categories of ads that are displayed on websites. Which tool should you use?
Quicksight
Kinesis data streams
EMR
Data Pipeline
5.
MULTIPLE CHOICE QUESTION
30 sec • 1 pt
You need to store and process data quickly in a cost-effective manner. You can move data easily from its location on disk to wherever you'd like without needing to stream the data. Also, you do not know how much data you will be handling in 6 months, and your processing needs spike intermittently. Specifically, you need to transform the data that comes in by aggregating the different disparate metrics into summary information. Which Big Data tools should you use?
DynamoDB and Redshift
Kinesis Data Streams and DynamoDB
S3 and Spark on EMR
S3 and Amazon Machine Learning
6.
MULTIPLE SELECT QUESTION
30 sec • 1 pt
Your EMR cluster uses 12 m4.large instances and runs 24 hours per day, but it is only used for processing and reporting during business hours. Which options can you use to reduce the costs?
(Choose two answers)
Run 12 d2.8xlarge instead without turn-off.
Use Spot instances for task nodes when needed.
Use the ReduceMapper distribution of EMR.
Migrate the data from HDFS to S3 using S3DistCp and turn off the cluster when not in use.
7.
MULTIPLE CHOICE QUESTION
30 sec • 1 pt
What is AWS Glue ?
Fully managed extract, transform and load service.
Petabyte scale cloud data warehouse
Real time data streaming service
None of the above
Create a free account and access millions of resources
Similar Resources on Wayground
15 questions
Cloud Guardians - Network Security

Quiz
•
1st Grade - Professio...
7 questions
cloudformation

Quiz
•
Professional Development
10 questions
Quiz 2 - EHS Taxo-Trivia ROUND 2

Quiz
•
Professional Development
15 questions
AWS Quiz Show 2023 (Week 4)

Quiz
•
Professional Development
15 questions
AWS Quiz Show 2023 (Week 2)

Quiz
•
Professional Development
15 questions
AWS Quiz Show 2023 (Week 3)

Quiz
•
Professional Development
7 questions
AWS Architect Test 3 - parte 2

Quiz
•
Professional Development
6 questions
Class 2 (Introduction to Cloud Computing; IAM & S3) Sat 22, 2023

Quiz
•
Professional Development
Popular Resources on Wayground
50 questions
Trivia 7/25

Quiz
•
12th Grade
11 questions
Standard Response Protocol

Quiz
•
6th - 8th Grade
11 questions
Negative Exponents

Quiz
•
7th - 8th Grade
12 questions
Exponent Expressions

Quiz
•
6th Grade
4 questions
Exit Ticket 7/29

Quiz
•
8th Grade
20 questions
Subject-Verb Agreement

Quiz
•
9th Grade
20 questions
One Step Equations All Operations

Quiz
•
6th - 7th Grade
18 questions
"A Quilt of a Country"

Quiz
•
9th Grade