Search Header Logo

10 CT Analysing Data Wk12 PostQuiz

Authored by Alex Song

Computers

9th Grade

10 CT Analysing Data Wk12 PostQuiz
AI

AI Actions

Add similar questions

Adjust reading levels

Convert to real-world scenario

Translate activity

More...

    Content View

    Student View

11 questions

Show all answers

1.

MULTIPLE CHOICE QUESTION

30 sec • 1 pt

Which open-source framework lets you create “expectations” that automatically test whether each batch of data meets defined quality rules before it is used for analysis?

dbt Core

Apache NiFi

Great Expectations

Apache Superset

2.

MULTIPLE CHOICE QUESTION

30 sec • 1 pt

During an ETL workflow, at which stage is schema validation most effective for catching type mismatches and missing mandatory fields?

A. Extraction

B. Transformation

C. Loading

D. Visualization

3.

MULTIPLE CHOICE QUESTION

30 sec • 1 pt

Amazon’s Deequ library is designed primarily to:

Generate business-intelligence dashboards

Orchestrate distributed machine-learning training jobs

Define unit tests for data quality on Apache Spark datasets

Replace relational database indexing

4.

MULTIPLE CHOICE QUESTION

30 sec • 1 pt

Which validation test would best detect the sudden appearance of an unexpected new category in a “payment_type” column?

Null-value check

Row-count check

Domain (allowed-values) check

Duplicate-row check

5.

MULTIPLE CHOICE QUESTION

30 sec • 1 pt

Talend Data Quality, Informatica, and Great Expectations all include features that allow analysts to:

Clean and validate data

Visualize data trends

Build machine learning models

Store large volumes of data

6.

MULTIPLE CHOICE QUESTION

30 sec • 1 pt

Which of the following is an example of profiling datasets and quantifying data-quality metrics?

Build GPU-accelerated neural networks

Profile datasets and quantify data-quality metrics (e.g., completeness, uniqueness, validity)

Encrypt column-level data at rest

Perform A/B testing on web applications

7.

MULTIPLE CHOICE QUESTION

30 sec • 1 pt

A pipeline ingests 15,000 IoT sensor readings every second and triggers near-real-time alerts. Which of the 3 V's is the primary challenge to evaluate in this scenario?

Volume

Variety

Velocity

Veracity

Access all questions and much more by creating a free account

Create resources

Host any resource

Get auto-graded reports

Google

Continue with Google

Email

Continue with Email

Classlink

Continue with Classlink

Clever

Continue with Clever

or continue with

Microsoft

Microsoft

Apple

Apple

Others

Others

Already have an account?