Data Cleansing and Missing Data Quiz

Data Cleansing and Missing Data Quiz

12th Grade

15 Qs

quiz-placeholder

Similar activities

Season 3 #Spaic Machine learning Weekly Quiz

Season 3 #Spaic Machine learning Weekly Quiz

KG - Professional Development

20 Qs

Data Engineering y BigQuery V1

Data Engineering y BigQuery V1

12th Grade

10 Qs

CN IT - LO3 Collecting, storing & using data

CN IT - LO3 Collecting, storing & using data

8th - 12th Grade

20 Qs

Python Data Science - Naive Bayes

Python Data Science - Naive Bayes

9th - 12th Grade

18 Qs

Vertex AI Pipelines V1

Vertex AI Pipelines V1

12th Grade

10 Qs

Machine Learning

Machine Learning

8th - 12th Grade

10 Qs

Week 2: AI and Big Data Quiz

Week 2: AI and Big Data Quiz

12th Grade

16 Qs

SQL: DML DDL DCL

SQL: DML DDL DCL

11th - 12th Grade

13 Qs

Data Cleansing and Missing Data Quiz

Data Cleansing and Missing Data Quiz

Assessment

Quiz

Computers

12th Grade

Easy

Created by

Chris Keeble

Used 2+ times

FREE Resource

15 questions

Show all answers

1.

MULTIPLE CHOICE QUESTION

30 sec • 1 pt

What is the main goal of data cleansing?

To delete irrelevant data

To ensure data is accurate, consistent, and reliable

To create duplicates of the data

To remove outliers from the dataset

2.

MULTIPLE CHOICE QUESTION

30 sec • 1 pt

Which of the following best describes Missing Completely at Random (MCAR)?

Data missing due to external reasons related to the data itself

Data missing in a pattern linked to observed data

Data missing without any identifiable pattern or reason

Data missing due to systematic errors in collection

3.

MULTIPLE CHOICE QUESTION

30 sec • 1 pt

Outliers are:

Data points that follow the same trend as the rest

Data points that are duplicated in the dataset

Data points that significantly differ from other observations

Data points that are missing completely at random

4.

MULTIPLE CHOICE QUESTION

30 sec • 1 pt

What is a common issue when different data formats are used in a dataset?

Outliers become more frequent

The dataset becomes smaller

Analysis can be inaccurate or fail entirely

Data will automatically standardise

5.

MULTIPLE CHOICE QUESTION

30 sec • 1 pt

Which type of missing data is likely to occur due to a survey question that respondents prefer not to answer?

Missing Completely at Random (MCAR)

Missing at Random (MAR)

Missing Not at Random (MNAR)

Systematic Missing Data

6.

MULTIPLE CHOICE QUESTION

30 sec • 1 pt

Why are duplicates a significant problem in datasets?

They provide too much data

They skew analysis and results

They increase the overall accuracy

They improve data quality

7.

MULTIPLE CHOICE QUESTION

30 sec • 1 pt

Inconsistent data types (e.g., mixing text and numbers in a field) can cause issues because:

It makes the dataset look messy

It causes errors in calculations and analysis

It results in more missing data

It automatically corrects the data format

Create a free account and access millions of resources

Create resources
Host any resource
Get auto-graded reports
or continue with
Microsoft
Apple
Others
By signing up, you agree to our Terms of Service & Privacy Policy
Already have an account?