ML Chapter 06

ML Chapter 06

University

15 Qs

quiz-placeholder

Similar activities

NLP-Transformers Last  Quiz

NLP-Transformers Last Quiz

University

10 Qs

DL LMS QUIZ-1

DL LMS QUIZ-1

University

20 Qs

Module 1 Neural Networks

Module 1 Neural Networks

University

20 Qs

Machine Learning Basics

Machine Learning Basics

University

20 Qs

AI Bootcamp quiz 2

AI Bootcamp quiz 2

University

20 Qs

Pretest Deep Learning

Pretest Deep Learning

University

20 Qs

TOOLS ADOBE PHOTOSHOP

TOOLS ADOBE PHOTOSHOP

University

15 Qs

DataQuest_Quiz

DataQuest_Quiz

University

15 Qs

ML Chapter 06

ML Chapter 06

Assessment

Quiz

Computers

University

Medium

Created by

Jhonston Benjumea

Used 1+ times

FREE Resource

15 questions

Show all answers

1.

MULTIPLE CHOICE QUESTION

30 sec • 1 pt

What does SGD stand for in neural network training?
Soft Gradient Descent
Stochastic Gradient Descent
Strong Graph Derivative
Semi-Gain Depth

2.

MULTIPLE CHOICE QUESTION

30 sec • 1 pt

What is the main idea behind Stochastic Gradient Descent (SGD)?
Using the full dataset for every update
Adding randomness to initialization
Updating weights using small random batches
Freezing weights during training

3.

MULTIPLE CHOICE QUESTION

30 sec • 1 pt

What problem does the Momentum method solve in SGD?
Overfitting
Vanishing gradient
Oscillations in gradient updates
Data imbalance

4.

MULTIPLE CHOICE QUESTION

30 sec • 1 pt

How does AdaGrad adjust the learning rate?
Keeps it constant
Increases it exponentially
Adapts it for each parameter based on past gradients
Resets it every epoch

5.

MULTIPLE CHOICE QUESTION

30 sec • 1 pt

What is the main feature of the Adam optimizer?
Ignores momentum
Uses only recent gradients
Combines Momentum and AdaGrad
Requires no tuning

6.

MULTIPLE CHOICE QUESTION

30 sec • 1 pt

Why is initializing weights with a standard deviation of 0.01 sometimes problematic?
It slows down learning
It may cause vanishing gradients
It improves generalization
It speeds up convergence too much

7.

MULTIPLE CHOICE QUESTION

30 sec • 1 pt

What is the Xavier initialization designed for?
ReLU activations
Linear regression
Layers with sigmoid/tanh activations
Binary classification

Create a free account and access millions of resources

Create resources
Host any resource
Get auto-graded reports
or continue with
Microsoft
Apple
Others
By signing up, you agree to our Terms of Service & Privacy Policy
Already have an account?