mi1_13_RL-Q

Authored by MI Team

Science

University

Used 49+ times

AI Actions

Add similar questions

Adjust reading levels

Convert to real-world scenario

Translate activity

More...

Content View

Student View

10 questions

Show all answers

MULTIPLE CHOICE QUESTION

10 sec • Ungraded

Did you attempt the last exercise sheet?

Yes, I finally did

No )-:

MULTIPLE CHOICE QUESTION

20 sec • 1 pt

Which Q-value is highest for the tile in the center (the smurf is the agent)?

Q(x, →)

Q(x, ←)

Q(x, ↓)

Q(x, ↑)

MULTIPLE CHOICE QUESTION

20 sec • 1 pt

An MDP implies...

I know all possible states beforehand

model-based learning

model-free evaluation

policy iteration

MULTIPLE CHOICE QUESTION

20 sec • 1 pt

Which provides the most direct way for extracting the optimal policy π*?

V* = argmax V^π

Q* = argmax Q^π

neither

what does the * mean?

MULTIPLE CHOICE QUESTION

20 sec • 1 pt

SARSA stands for

some small constant

state action reward state action

such a small reward so awful

StAte Reward StAte

MULTIPLE CHOICE QUESTION

10 sec • 1 pt

The optimal policy π* is unique (T/F)

True

False

MULTIPLE CHOICE QUESTION

10 sec • 1 pt

With Q-values, the optimal policy π* becomes unique (T/F)

True

False

Access all questions and much more by creating a free account

Create resources

Host any resource

Get auto-graded reports

Continue with Google

Continue with Email

Continue with Microsoft

or continue with

Facebook

Apple

Others

Already have an account?

Similar Resources on Wayground

15 questions

C1. Biochemistry and Monosaccharides

Quiz

•

University

15 questions

Geology Quiz

Quiz

•

4th Grade - University

15 questions

Quiz on Stars and Astronomy

Quiz

•

8th Grade - University

10 questions

Black Inventors

Quiz

•

3rd Grade - University

14 questions

Water Use, Pollution, and Conservation

Quiz

•

University

14 questions

Properties of Matter Quiz

Quiz

•

3rd Grade - University

15 questions

Magnetism Quiz

Quiz

•

7th Grade - University

14 questions

AC DRIVES

Quiz

•

University

Popular Resources on Wayground

5 questions

A Home on the Shore

Quiz

•

3rd Grade

28 questions

US History Regents Review

Quiz

•

11th Grade

6 questions

A Horse Tale

Quiz

•

3rd Grade

20 questions

Math Review

Quiz

•

3rd Grade

10 questions

Juneteenth History and Significance

Interactive video

•

5th - 8th Grade

20 questions

Dividing Fractions

Quiz

•

5th Grade

55 questions

A Long Walk to Water Final Review

Quiz

•

6th - 8th Grade

10 questions

Equation Word Problems

Quiz

•

7th Grade

Discover more resources for Science

40 questions

Flags of the World

Quiz

•

KG - Professional Dev...

mi1_13_RL-Q

Did you attempt the last exercise sheet?

Which Q-value is highest for the tile in the center (the smurf is the agent)?

An MDP implies...

Which provides the most direct way for extracting the optimal policy π*?

SARSA stands for

The optimal policy π* is unique (T/F)

With Q-values, the optimal policy π* becomes unique (T/F)

"contracting" means

For a learning rate η>0, TD Learning...

Which describes "off policy"?

Access all questions and much more by creating a free account

Similar Resources on Wayground

Popular Resources on Wayground

Discover more resources for Science