ML Chapter 06

ML Chapter 06

University

15 Qs

quiz-placeholder

Similar activities

iGCSE ICT: Software

iGCSE ICT: Software

KG - University

16 Qs

Chapter 8

Chapter 8

University

10 Qs

DataFeud Round 1

DataFeud Round 1

University

20 Qs

Neural Networks Quiz

Neural Networks Quiz

University

10 Qs

Neuron Network

Neuron Network

University

14 Qs

6CSM1 B2 QUIZZ Optimization

6CSM1 B2 QUIZZ Optimization

University

11 Qs

ทวิศึกษา รุ่น 4 (พะลาน) 2

ทวิศึกษา รุ่น 4 (พะลาน) 2

University

15 Qs

DL_unit-2

DL_unit-2

University

14 Qs

ML Chapter 06

ML Chapter 06

Assessment

Quiz

Computers

University

Medium

Created by

Jhonston Benjumea

Used 1+ times

FREE Resource

15 questions

Show all answers

1.

MULTIPLE CHOICE QUESTION

30 sec • 1 pt

What does SGD stand for in neural network training?
Soft Gradient Descent
Stochastic Gradient Descent
Strong Graph Derivative
Semi-Gain Depth

2.

MULTIPLE CHOICE QUESTION

30 sec • 1 pt

What is the main idea behind Stochastic Gradient Descent (SGD)?
Using the full dataset for every update
Adding randomness to initialization
Updating weights using small random batches
Freezing weights during training

3.

MULTIPLE CHOICE QUESTION

30 sec • 1 pt

What problem does the Momentum method solve in SGD?
Overfitting
Vanishing gradient
Oscillations in gradient updates
Data imbalance

4.

MULTIPLE CHOICE QUESTION

30 sec • 1 pt

How does AdaGrad adjust the learning rate?
Keeps it constant
Increases it exponentially
Adapts it for each parameter based on past gradients
Resets it every epoch

5.

MULTIPLE CHOICE QUESTION

30 sec • 1 pt

What is the main feature of the Adam optimizer?
Ignores momentum
Uses only recent gradients
Combines Momentum and AdaGrad
Requires no tuning

6.

MULTIPLE CHOICE QUESTION

30 sec • 1 pt

Why is initializing weights with a standard deviation of 0.01 sometimes problematic?
It slows down learning
It may cause vanishing gradients
It improves generalization
It speeds up convergence too much

7.

MULTIPLE CHOICE QUESTION

30 sec • 1 pt

What is the Xavier initialization designed for?
ReLU activations
Linear regression
Layers with sigmoid/tanh activations
Binary classification

Create a free account and access millions of resources

Create resources
Host any resource
Get auto-graded reports
or continue with
Microsoft
Apple
Others
By signing up, you agree to our Terms of Service & Privacy Policy
Already have an account?