Search Header Logo

Intro to ML: Decision Trees - 2024

Authored by Josiah Wang

Computers

University

Used 10+ times

Intro to ML: Decision Trees - 2024
AI

AI Actions

Add similar questions

Adjust reading levels

Convert to real-world scenario

Translate activity

More...

    Content View

    Student View

10 questions

Show all answers

1.

MULTIPLE CHOICE QUESTION

1 min • 1 pt

Which system requires more entropy to be described?

Tossing a fair coin

Tossing a biased coin

Answer explanation

Tossing a fair coin requires more entropy to be described because it has a higher level of uncertainty. In a fair coin toss, both outcomes (heads and tails) have an equal probability of occurring, making it more difficult to predict the outcome. In contrast, a biased coin has a higher probability for one outcome, reducing the overall uncertainty and entropy.

2.

MULTIPLE CHOICE QUESTION

1 min • 1 pt

Is it possible to test the same attribute twice along the same path on a decision tree for a categorical problem?

Yes

No

Answer explanation

No, it is not possible to test the same attribute twice along the same path on a decision tree for a categorical problem. Once an attribute is used for a split, it should not be used again in the same path, as it would not provide any additional information or improve the classification.

3.

MULTIPLE CHOICE QUESTION

1 min • 1 pt

Is it possible that the same attribute will get selected twice in an ordinal or real-valued problem?

Yes

No

Answer explanation

In ordinal or real-valued problems, it is indeed possible for the same attribute to be selected twice. This can occur due to the nature of these problems, where the same attribute may have different impacts at different stages or conditions. Therefore, the answer to the question is 'Yes'.

4.

MULTIPLE CHOICE QUESTION

1 min • 1 pt

Decision trees are an algorithm for which machine learning task?

Clustering

Classification

Classification and Regression

Dimensionality reduction

Regression

Answer explanation

Decision trees are a versatile machine learning algorithm that can be used for both classification and regression tasks. They work by creating a model that predicts the value of a target variable by learning simple decision rules inferred from the data features. Therefore, the correct answer is 'Classification and Regression'.

5.

MULTIPLE CHOICE QUESTION

1 min • 1 pt

Oliver is analysing a decision tree model for predicting the types of plants in a garden. He notices that the tree is significantly deep. What does this indicate?

The samples have a large number of attributes

The dataset is possibly noisy

The tree under-fits the training data

None of these

Answer explanation

A significantly deep tree indicates that the dataset is possibly noisy. This is because a deep tree tends to overfit the data, capturing noise and making it less generalizable to new data. A large number of attributes or underfitting the training data are not directly related to the depth of the tree.

6.

MULTIPLE CHOICE QUESTION

1 min • 1 pt

Which of the following is true:

Deeper trees will always improve performance on the training data

Deeper trees will always improve performance when testing the model on unseen data

If a deeper tree improves performance on the training data, then it will also improve performance on new unseen data

Answer explanation

Deeper trees will always improve performance on the training data because they can capture more complex patterns and relationships in the data. However, this does not guarantee better performance on unseen data, as deeper trees can also lead to overfitting, which reduces the model's ability to generalize to new data.

7.

MULTIPLE CHOICE QUESTION

1 min • 1 pt

Kiara is working on a binary classification problem to predict whether a customer will make a purchase or not. She has a balanced dataset, and she finds that the customer's age completely determines the class for each observation. What is the information gain from using the age feature as the first node in a decision tree?

0

0.5

1

Not enough information

Answer explanation

In a binary classification problem with a balanced dataset, if one feature completely determines the class for each observation, the information gain from using this feature as the first node in a tree is 1. This is because the feature perfectly partitions the data, resulting in pure child nodes with no uncertainty remaining.

Access all questions and much more by creating a free account

Create resources

Host any resource

Get auto-graded reports

Google

Continue with Google

Email

Continue with Email

Classlink

Continue with Classlink

Clever

Continue with Clever

or continue with

Microsoft

Microsoft

Apple

Apple

Others

Others

Already have an account?