Search Header Logo

Recap Quiz

Authored by Timilehin Aderinola

Information Technology (IT)

University

Used 1+ times

Recap Quiz
AI

AI Actions

Add similar questions

Adjust reading levels

Convert to real-world scenario

Translate activity

More...

    Content View

    Student View

10 questions

Show all answers

1.

MULTIPLE CHOICE QUESTION

20 sec • 1 pt

In the CRISP-DM methodology, what is the primary purpose of the Data Understanding phase?

To clean and transform raw data into a usable format

To explore and gather initial insights about the data through tables and visualizations

To build and evaluate machine learning models

To deploy the final model into production

2.

MULTIPLE CHOICE QUESTION

20 sec • 1 pt

During data cleaning, why might you choose to clamp outliers rather than remove those rows entirely?

Clamping is required by most machine learning algorithms.

Clamping is faster computationally than deleting rows.

Clamping preserves all data points and reduces the influence of extremes without losing information about those cases.

Removing rows corrupts the original dataset permanently.

3.

MULTIPLE CHOICE QUESTION

20 sec • 1 pt

When preparing data for a machine learning model, why is stratified sampling important when splitting your data into training and test sets?

It ensures both the training and test sets have similar class distributions for the target variable.

It guarantees equal numbers of every class in each set.

It automatically normalizes the numeric features.

It prevents the model from overfitting to the majority class.

4.

MULTIPLE CHOICE QUESTION

20 sec • 1 pt

A basketball player's weight is standardized using a z-score transformation. If a player's weight is exactly equal to the mean weight of the dataset, what will their standardized value be?

-1

0

1

0.5

5.

MULTIPLE CHOICE QUESTION

20 sec • 1 pt

A linear regression model evaluates its predictions and achieves a Mean Absolute Error (MAE) of 25 and a Root Mean Squared Error (RMSE) of 45. Why is the RMSE noticeably larger than the MAE?

MAE cannot exceed RMSE by definition.

RMSE uses entirely different units of measurement than MAE.

There was likely a calculation error during evaluation.

RMSE penalizes large errors more heavily due to the squaring of the residuals.

6.

MULTIPLE CHOICE QUESTION

20 sec • 1 pt

What is the primary output of a logistic regression model before any decision threshold is applied?

A probability score for class membership

A definitive class label (e.g., Class 0 or Class 1)

The distance to the nearest cluster centroid

A continuous numeric prediction with no upper bound

7.

MULTIPLE CHOICE QUESTION

20 sec • 1 pt

In an imbalanced classification setting (e.g., a dataset with 95% negative cases and 5% positive cases), why can evaluating the model solely on "Accuracy" be misleading?

Accuracy cannot be computed for binary classification tasks.

Accuracy ignores class distribution and a model can appear highly accurate simply by predicting the majority class every time.

Accuracy requires probabilities rather than discrete labels.

Accuracy is only a valid metric when using cross-validation.

Access all questions and much more by creating a free account

Create resources

Host any resource

Get auto-graded reports

Google

Continue with Google

Email

Continue with Email

Classlink

Continue with Classlink

Clever

Continue with Clever

or continue with

Microsoft

Microsoft

Apple

Apple

Others

Others

Already have an account?