ML Checkpoint 2

ML Checkpoint 2

Professional Development

10 Qs

quiz-placeholder

Similar activities

DevOps

DevOps

Professional Development

14 Qs

CIT JULY 2022

CIT JULY 2022

Professional Development

10 Qs

Quiz  VMware Horizon DAV Visão Geral Técnica do Produto

Quiz VMware Horizon DAV Visão Geral Técnica do Produto

Professional Development

10 Qs

CSS flexbox quiz

CSS flexbox quiz

Professional Development

10 Qs

CloudBinary_Apache_Tomcat_Quiz- 2

CloudBinary_Apache_Tomcat_Quiz- 2

Professional Development

15 Qs

Cloud Computing

Cloud Computing

KG - Professional Development

11 Qs

Module 3 - Dark web

Module 3 - Dark web

Professional Development

7 Qs

WORKING IN IT

WORKING IN IT

Professional Development

10 Qs

ML Checkpoint 2

ML Checkpoint 2

Assessment

Quiz

Computers

Professional Development

Hard

Created by

Miguel Saavedra

Used 29+ times

FREE Resource

10 questions

Show all answers

1.

MULTIPLE SELECT QUESTION

3 mins • 1 pt

A data scientist wants to visualize the correlation between features in their dataset. What tool(s) can they use to visualize a heatmap for a correlation matrix?

Pandas

Numpy

Scikitlearn

Seaborn

2.

MULTIPLE CHOICE QUESTION

3 mins • 1 pt

You are preprocessing a dataset that includes categorical features. You want to determine which categories of particular features are most common in your dataset. Which basic descriptive statistic could you use?

Mode

Median

Standard Deviation

Mean

3.

MULTIPLE CHOICE QUESTION

3 mins • 1 pt

You've plotted the correlation matrix of your dataset's features and realized that two of the features present a high negative correlation (-0.95). What should you do?

Do nothing because correlated features won't affect the model performance

Do nothing as the correlation is not positive

Remove both features

Remove one of the features

4.

MULTIPLE SELECT QUESTION

3 mins • 1 pt

You are in charge of preprocessing the data your publishing company wants to use for a new ML model they’re building, which aims to predict the influence an academic journal will have in its field. The preprocessing step is necessary to prepare the data for model training.


What type of issue with the data might you encounter during this preprocessing phase. (Select TWO.)

Outliers

Overfit data

Missing values

Residuals

Insufficient batch size

5.

MULTIPLE SELECT QUESTION

3 mins • 1 pt

A Machine Learning Engineer is creating and preparing data for a linear regression model. However, while preparing the data, the Engineer notices that about 20% of the numerical data contains missing values in the same two columns. The shape of the data is 500 rows by 4 columns, including the target column.


How can the Engineer handle the missing values in the data? (Select TWO.)

Remove the rows containing the missing values

Remove the columns containing the missing values

Fill the missing values with mean of the column

Impute the missing values using regression

Add regularization to the model

6.

MULTIPLE CHOICE QUESTION

3 mins • 1 pt

A Data Scientist created a correlation matrix between nine variables and the target variable. The correlation coefficient between two of the numerical variables, variable 1 and variable 5, is -0.95. How should the Data Scientist interpret the correlation coefficient?

As variable 1 increases, variable 5 increases

As variable 1 increases, variable 5 decreases

Variable 1 does not have any influence on variable 5

The data is not sufficient to make a well-informed interpretation

7.

MULTIPLE CHOICE QUESTION

3 mins • 1 pt

An advertising and analytics company uses machine learning to predict user response to online advertisements using a custom XGBoost model. The company wants to improve its ML pipeline by porting its training and inference code, to Amazon SageMaker, and do so with minimal changes to the existing code.


How should the company set up this new pipeline?

Use the Amazon pre-built R container option and port the existing code over to the container. Register the container in Amazon Elastic Container Registry (Amazon ECR). Finally, run the training and inference jobs using this container.

Use Amazon in-built algorithms to run their training and inference jobs.

Use the Build Your Own Container (BYOC) Amazon SageMaker option. Create a new Docker container with the existing code. Register the container in Amazon Elastic Container Registry (ECR). Finally, run the training and inference jobs using this container.

Create a new Amazon SageMaker notebook instance. Copy the existing code into an Amazon SageMaker notebook. Then, run the pipeline from this notebook.

Create a free account and access millions of resources

Create resources
Host any resource
Get auto-graded reports
or continue with
Microsoft
Apple
Others
By signing up, you agree to our Terms of Service & Privacy Policy
Already have an account?