Data Fundamentals Post-Test

Data Fundamentals Post-Test

Professional Development

7 Qs

quiz-placeholder

Similar activities

WORKSHOP: HEALTHCARE DATA LAKEHOUSE IN ACTION

WORKSHOP: HEALTHCARE DATA LAKEHOUSE IN ACTION

Professional Development

10 Qs

DE INTRODUCTION

DE INTRODUCTION

Professional Development

10 Qs

Fabric

Fabric

Professional Development

9 Qs

Introduction to Data Science Quiz

Introduction to Data Science Quiz

Professional Development

10 Qs

Implikasi UU PDP terhadap penerapan cybersecurity di Bank

Implikasi UU PDP terhadap penerapan cybersecurity di Bank

Professional Development

10 Qs

GitLab Architecture

GitLab Architecture

Professional Development

6 Qs

GD Catalog การใช้ประโยชน์ระบบบัญชีข้อมูลภาครัฐ

GD Catalog การใช้ประโยชน์ระบบบัญชีข้อมูลภาครัฐ

Professional Development

11 Qs

Knowledge Check 2

Knowledge Check 2

Professional Development

10 Qs

Data Fundamentals Post-Test

Data Fundamentals Post-Test

Assessment

Quiz

Information Technology (IT)

Professional Development

Hard

Created by

Irsyad Firsandi Wahyudi

FREE Resource

7 questions

Show all answers

1.

MULTIPLE CHOICE QUESTION

1 min • 1 pt

Which of the following is NOT a characteristic of unstructured data?

It often requires advanced AI/ML techniques for processing.

It is typically organized into tables with rows and columns.

Examples include images, audio files, and plain text documents.

It accounts for the majority of data generated in the world.

2.

MULTIPLE CHOICE QUESTION

1 min • 1 pt

You are designing a data pipeline where the schema of your incoming data (e.g., user profiles) might change frequently, with new optional fields being added over time, but you need to ensure backward compatibility for existing consumers. Which data storage format would be most suitable for this scenario?

CSV

JSON

Parquet

Avro

3.

MULTIPLE CHOICE QUESTION

1 min • 1 pt

A data engineer is building a system to generate daily sales reports by summing up individual transaction amounts from the previous day. Which data ingestion and processing approach is most appropriate for this task?

Real-time streaming and data enrichment.

Batch processing and data aggregation.

Continuous ingestion and data cleaning.

Manual data entry and normalization.

4.

MULTIPLE CHOICE QUESTION

1 min • 1 pt

Why is viewing data as a valuable 'organizational asset' important for a junior data engineer?

It helps them choose the cheapest data storage solutions.

It enables them to avoid working with complex data types.

It provides context for their technical work, highlighting how data quality and availability impact business decisions and competitive advantage.

It ensures they only work with structured data.

5.

OPEN ENDED QUESTION

10 mins • 3 pts

Describe a real-world scenario where a company would likely combine all three data types (structured, semi-structured, and unstructured data). Explain briefly how each type might be used in that scenario.

  • Scenario Example (E-commerce Company)

Evaluate responses using AI:

OFF

6.

OPEN ENDED QUESTION

10 mins • 3 pts

Compare and contrast Parquet and Avro data storage formats. Discuss at least two key differences in their design philosophies or primary use cases.

Evaluate responses using AI:

OFF

7.

OPEN ENDED QUESTION

1 min • 3 pts

You are tasked with building a data pipeline for a company. Data arrives from an external API as JSON files, needs to be combined with customer data from a relational database, cleaned, and then loaded into a data warehouse for analytical reporting. Identify at least three distinct transformation steps you would consider applying in this pipeline.

Evaluate responses using AI:

OFF