
Intro to ML: Decision Trees - 2024
Authored by Josiah Wang
Computers
University
Used 10+ times

AI Actions
Add similar questions
Adjust reading levels
Convert to real-world scenario
Translate activity
More...
Content View
Student View
10 questions
Show all answers
1.
MULTIPLE CHOICE QUESTION
1 min • 1 pt
Which system requires more entropy to be described?
Tossing a fair coin
Tossing a biased coin
Answer explanation
Tossing a fair coin requires more entropy to be described because it has a higher level of uncertainty. In a fair coin toss, both outcomes (heads and tails) have an equal probability of occurring, making it more difficult to predict the outcome. In contrast, a biased coin has a higher probability for one outcome, reducing the overall uncertainty and entropy.
2.
MULTIPLE CHOICE QUESTION
1 min • 1 pt
Is it possible to test the same attribute twice along the same path on a decision tree for a categorical problem?
Yes
No
Answer explanation
No, it is not possible to test the same attribute twice along the same path on a decision tree for a categorical problem. Once an attribute is used for a split, it should not be used again in the same path, as it would not provide any additional information or improve the classification.
3.
MULTIPLE CHOICE QUESTION
1 min • 1 pt
Is it possible that the same attribute will get selected twice in an ordinal or real-valued problem?
Yes
No
Answer explanation
In ordinal or real-valued problems, it is indeed possible for the same attribute to be selected twice. This can occur due to the nature of these problems, where the same attribute may have different impacts at different stages or conditions. Therefore, the answer to the question is 'Yes'.
4.
MULTIPLE CHOICE QUESTION
1 min • 1 pt
Decision trees are an algorithm for which machine learning task?
Clustering
Classification
Classification and Regression
Dimensionality reduction
Regression
Answer explanation
Decision trees are a versatile machine learning algorithm that can be used for both classification and regression tasks. They work by creating a model that predicts the value of a target variable by learning simple decision rules inferred from the data features. Therefore, the correct answer is 'Classification and Regression'.
5.
MULTIPLE CHOICE QUESTION
1 min • 1 pt
Oliver is analysing a decision tree model for predicting the types of plants in a garden. He notices that the tree is significantly deep. What does this indicate?
The samples have a large number of attributes
The dataset is possibly noisy
The tree under-fits the training data
None of these
Answer explanation
A significantly deep tree indicates that the dataset is possibly noisy. This is because a deep tree tends to overfit the data, capturing noise and making it less generalizable to new data. A large number of attributes or underfitting the training data are not directly related to the depth of the tree.
6.
MULTIPLE CHOICE QUESTION
1 min • 1 pt
Which of the following is true:
Deeper trees will always improve performance on the training data
Deeper trees will always improve performance when testing the model on unseen data
If a deeper tree improves performance on the training data, then it will also improve performance on new unseen data
Answer explanation
Deeper trees will always improve performance on the training data because they can capture more complex patterns and relationships in the data. However, this does not guarantee better performance on unseen data, as deeper trees can also lead to overfitting, which reduces the model's ability to generalize to new data.
7.
MULTIPLE CHOICE QUESTION
1 min • 1 pt
Kiara is working on a binary classification problem to predict whether a customer will make a purchase or not. She has a balanced dataset, and she finds that the customer's age completely determines the class for each observation. What is the information gain from using the age feature as the first node in a decision tree?
0
0.5
1
Not enough information
Answer explanation
In a binary classification problem with a balanced dataset, if one feature completely determines the class for each observation, the information gain from using this feature as the first node in a tree is 1. This is because the feature perfectly partitions the data, resulting in pure child nodes with no uncertainty remaining.
Access all questions and much more by creating a free account
Create resources
Host any resource
Get auto-graded reports

Continue with Google

Continue with Email

Continue with Classlink

Continue with Clever
or continue with

Microsoft
%20(1).png)
Apple
Others
Already have an account?