Quiz RL - Temporal Difference Algorithm

Quiz
•
Computers
•
University
•
Hard
meilana siswanto
Used 2+ times
FREE Resource
10 questions
Show all answers
1.
MULTIPLE CHOICE QUESTION
30 sec • 1 pt
Dalam lingkup kajian Reinforcement Learning, Temporal Difference
Learning termasuk ...
Model-based algorithm
Model free algorithm
Reward based algorithm
Environment-based algorithm
2.
MULTIPLE CHOICE QUESTION
30 sec • 1 pt
Berikut pernyataan yang benar tentang Temporal Difference
Learning adalah...
Model-based environment
Agent belajar dari lingkungan melalui pemodelan lengkap
Kombinasi dari Monte Carlo dan Dynamic Programming
Tidak ada jawaban yang benar
3.
MULTIPLE CHOICE QUESTION
30 sec • 1 pt
Mengapa dikatakan bahwa Monte Carlo adalah ide dasar dari Temporal Difference Learning?
Karena dalam Monte Carlo, value-nya dievaluasi tiap episode
Karena pada algoritma Monte Carlo tidak perlu ada termination
Karena Monte Carlo merupakan model free algorithm
Karena setiap episode dalam Monte Carlo tidak independent
4.
MULTIPLE CHOICE QUESTION
45 sec • 1 pt
Berikut merupakan pernyataan yang benar tentang Temporal Difference Learning adalah...
Bersifat episodik dalam melakukan evaluasi value-nya
Bersifat non-episodik dalam melakukan evaluasi value-nya
Tidak memiliki learning rate
Bersifat independent, tidak bootstrapping
5.
MULTIPLE CHOICE QUESTION
30 sec • 1 pt
Apa yang menyebabkan Dynamic Programming (DP) merupakan ide dari Temporal Difference Learning (TDL)?
DP dalam meng-update value-state harus menyelesaikan 1 episode
DP dapat meng-update value-state per-step dari episode
Semua kemungkinan transisi state tidak dipertimbangkan pada setiap step
TDL tidak bersifat bootstrapping sebagaimana DP
6.
MULTIPLE CHOICE QUESTION
30 sec • 1 pt
Dua diantara pilihan berikut mana yang merupakan Temporal Difference Control adalah...
Monte Carlo dan Dynamic Programming
Markov Decision Process dan Monte Carlo
SARSA dan Q-Learning
SARSA dan Monte Carlo
7.
MULTIPLE CHOICE QUESTION
30 sec • 1 pt
Apa yang dimaksud dengan SARSA pada Temporal Difference Learning?
Merupakan Action-Value function
Off policy
Update value secara episodik
Semua jawaban benar
Create a free account and access millions of resources
Similar Resources on Wayground
10 questions
Beneficios de los algoritmos probabilísticos

Quiz
•
University
5 questions
Quiz 2 for Rennes

Quiz
•
University
8 questions
Analysis of Algorithm Chapter 11 Randomized algorithm

Quiz
•
University
15 questions
MACHINE LEARNING

Quiz
•
University
10 questions
Topic 3 Cybersec Quiz

Quiz
•
University
14 questions
LMS

Quiz
•
University
10 questions
Aprendizaje Móvil

Quiz
•
University
10 questions
GSI DP-100 Day 1

Quiz
•
University - Professi...
Popular Resources on Wayground
10 questions
Lab Safety Procedures and Guidelines

Interactive video
•
6th - 10th Grade
10 questions
Nouns, nouns, nouns

Quiz
•
3rd Grade
10 questions
9/11 Experience and Reflections

Interactive video
•
10th - 12th Grade
25 questions
Multiplication Facts

Quiz
•
5th Grade
11 questions
All about me

Quiz
•
Professional Development
22 questions
Adding Integers

Quiz
•
6th Grade
15 questions
Subtracting Integers

Quiz
•
7th Grade
9 questions
Tips & Tricks

Lesson
•
6th - 8th Grade
Discover more resources for Computers
21 questions
Spanish-Speaking Countries

Quiz
•
6th Grade - University
20 questions
Levels of Measurements

Quiz
•
11th Grade - University
7 questions
Common and Proper Nouns

Interactive video
•
4th Grade - University
12 questions
Los numeros en español.

Lesson
•
6th Grade - University
7 questions
PC: Unit 1 Quiz Review

Quiz
•
11th Grade - University
7 questions
Supporting the Main Idea –Informational

Interactive video
•
4th Grade - University
12 questions
Hurricane or Tornado

Quiz
•
3rd Grade - University
7 questions
Enzymes (Updated)

Interactive video
•
11th Grade - University