
Quiz RL - Temporal Difference Algorithm
Authored by meilana siswanto
Computers
University
Used 2+ times

AI Actions
Add similar questions
Adjust reading levels
Convert to real-world scenario
Translate activity
More...
Content View
Student View
10 questions
Show all answers
1.
MULTIPLE CHOICE QUESTION
30 sec • 1 pt
Dalam lingkup kajian Reinforcement Learning, Temporal Difference
Learning termasuk ...
Model-based algorithm
Model free algorithm
Reward based algorithm
Environment-based algorithm
2.
MULTIPLE CHOICE QUESTION
30 sec • 1 pt
Berikut pernyataan yang benar tentang Temporal Difference
Learning adalah...
Model-based environment
Agent belajar dari lingkungan melalui pemodelan lengkap
Kombinasi dari Monte Carlo dan Dynamic Programming
Tidak ada jawaban yang benar
3.
MULTIPLE CHOICE QUESTION
30 sec • 1 pt
Mengapa dikatakan bahwa Monte Carlo adalah ide dasar dari Temporal Difference Learning?
Karena dalam Monte Carlo, value-nya dievaluasi tiap episode
Karena pada algoritma Monte Carlo tidak perlu ada termination
Karena Monte Carlo merupakan model free algorithm
Karena setiap episode dalam Monte Carlo tidak independent
4.
MULTIPLE CHOICE QUESTION
45 sec • 1 pt
Berikut merupakan pernyataan yang benar tentang Temporal Difference Learning adalah...
Bersifat episodik dalam melakukan evaluasi value-nya
Bersifat non-episodik dalam melakukan evaluasi value-nya
Tidak memiliki learning rate
Bersifat independent, tidak bootstrapping
5.
MULTIPLE CHOICE QUESTION
30 sec • 1 pt
Apa yang menyebabkan Dynamic Programming (DP) merupakan ide dari Temporal Difference Learning (TDL)?
DP dalam meng-update value-state harus menyelesaikan 1 episode
DP dapat meng-update value-state per-step dari episode
Semua kemungkinan transisi state tidak dipertimbangkan pada setiap step
TDL tidak bersifat bootstrapping sebagaimana DP
6.
MULTIPLE CHOICE QUESTION
30 sec • 1 pt
Dua diantara pilihan berikut mana yang merupakan Temporal Difference Control adalah...
Monte Carlo dan Dynamic Programming
Markov Decision Process dan Monte Carlo
SARSA dan Q-Learning
SARSA dan Monte Carlo
7.
MULTIPLE CHOICE QUESTION
30 sec • 1 pt
Apa yang dimaksud dengan SARSA pada Temporal Difference Learning?
Merupakan Action-Value function
Off policy
Update value secara episodik
Semua jawaban benar
Access all questions and much more by creating a free account
Create resources
Host any resource
Get auto-graded reports

Continue with Google

Continue with Email

Continue with Classlink

Continue with Clever
or continue with

Microsoft
%20(1).png)
Apple
Others
Already have an account?
Similar Resources on Wayground
10 questions
UIT2221 - THE CONCEPT OF SIMULATION (Mod 2)
Quiz
•
University
10 questions
Machine Learning (Introduction)
Quiz
•
University
10 questions
INTO Artificial Intelligence
Quiz
•
University - Professi...
10 questions
Clustering_Pertemuan2_Quiz_Ceria
Quiz
•
University
10 questions
Aula Virtual 5
Quiz
•
University
10 questions
Predictive analytics
Quiz
•
University
10 questions
Data analytics basics
Quiz
•
University
10 questions
Chapter 7 - System Implementation
Quiz
•
University
Popular Resources on Wayground
15 questions
Fractions on a Number Line
Quiz
•
3rd Grade
20 questions
Equivalent Fractions
Quiz
•
3rd Grade
25 questions
Multiplication Facts
Quiz
•
5th Grade
22 questions
fractions
Quiz
•
3rd Grade
20 questions
Main Idea and Details
Quiz
•
5th Grade
20 questions
Context Clues
Quiz
•
6th Grade
15 questions
Equivalent Fractions
Quiz
•
4th Grade
20 questions
Figurative Language Review
Quiz
•
6th Grade