AWS Certified Data Analytics Specialty 2021 - Hands-On! - What is Glue? + Partitioning your Data Lake

AWS Certified Data Analytics Specialty 2021 - Hands-On! - What is Glue? + Partitioning your Data Lake

Assessment

Interactive Video

Information Technology (IT), Architecture

University

Hard

Created by

Quizizz Content

FREE Resource

The video tutorial covers AWS Glue, a serverless ETL service that plays a crucial role in AWS exams. It explains Glue's ability to automatically handle table definitions and schema discovery, serving as a central metadata repository. The tutorial highlights Glue's use of Apache Spark for ETL jobs and its integration with tools like Athena and Redshift. It also discusses the Glue Crawler and Data Catalog, which infer schemas from unstructured data in S3. Finally, it provides strategies for partitioning data in S3 to optimize query performance.

Read more

1 questions

Show all answers

1.

OPEN ENDED QUESTION

3 mins • 1 pt

What new insight or understanding did you gain from this video?

Evaluate responses using AI:

OFF