Why is 0.75 applied to probability values during sampling?

To balance the selection of frequent and rare words

What analogy problem demonstrates the power of word2vec vectors?

Paris - France + Spain = France

ML B2 CH4

Authored by Jhonston Benjumea

Computers

University

Used 1+ times

AI Actions

Add similar questions

Adjust reading levels

Convert to real-world scenario

Translate activity

More...

Content View

Student View

10 questions

Show all answers

MULTIPLE CHOICE QUESTION

30 sec • 1 pt

What is a major bottleneck of the original CBOW model?

The size of the input vector

One-hot encoding of the context

Large matrix calculations with softmax over huge vocabulary

Too few context words

Answer explanation

The original CBOW model uses softmax across a large vocabulary, which is computationally expensive.

MULTIPLE CHOICE QUESTION

30 sec • 1 pt

What is the function of the embedding layer in word2vec?

Compress the entire matrix into a scalar

Skip one-hot encoding and extract a word's vector directly

Generate audio features

Apply dropout to context words

Answer explanation

Embedding layers replace the need for one-hot vectors by selecting specific word vectors directly from the matrix.

MULTIPLE CHOICE QUESTION

30 sec • 1 pt

What is the main goal of negative sampling in word2vec?

To sample only frequent words

To train the model faster using binary classification

To filter out correct labels

To convert softmax into sigmoid

Answer explanation

Negative sampling simplifies training by converting the multi-class problem into several binary decisions.

MULTIPLE CHOICE QUESTION

30 sec • 1 pt

In negative sampling, what kind of output do we expect for negative examples?

Close to 1

Exactly 1

Close to 0

Negative values

Answer explanation

Negative examples should result in output values close to 0, indicating they are not the correct context.

MULTIPLE CHOICE QUESTION

30 sec • 1 pt

Why is the sigmoid function used in binary classification for word2vec?

It simplifies matrix multiplication

It outputs discrete values only

It provides probabilities between 0 and 1

It reduces the training data size

Answer explanation

Sigmoid functions output values between 0 and 1, ideal for interpreting binary classification probabilities.

MULTIPLE CHOICE QUESTION

30 sec • 1 pt

What does the cross-entropy error measure in binary classification?

How long training takes

The difference between output probability and the correct label

The number of samples per class

The angle between word vectors

Answer explanation

Cross-entropy measures the loss based on the distance between predicted probabilities and actual labels.

MULTIPLE CHOICE QUESTION

30 sec • 1 pt

What technique helps select which negative examples to use?

Random dropout

Context padding

Probability-based sampling

Gradient descent

Answer explanation

Negative examples are sampled based on their frequency using probability-based techniques.

Access all questions and much more by creating a free account

Create resources

Host any resource

Get auto-graded reports

Continue with Google

Continue with Email

Continue with Classlink

Continue with Clever

or continue with

Microsoft

Apple

Others

Already have an account?

Similar Resources on Wayground

10 questions

Microcontroller

Quiz

•

11th Grade - University

14 questions

Input, output and storage de

Quiz

•

8th Grade - Professio...

15 questions

IT 209

Quiz

•

University

10 questions

Living in the IT Era

Quiz

•

University

15 questions

Assessment 08

Quiz

•

University

15 questions

Monday Week#2

Quiz

•

University

10 questions

It's App to You!

Quiz

•

University

10 questions

JSPS Competition Hackathon - Scratch Language - Grade 1

Quiz

•

2nd Grade - University

Popular Resources on Wayground

15 questions

Fractions on a Number Line

Quiz

•

3rd Grade

20 questions

Equivalent Fractions

Quiz

•

3rd Grade

25 questions

Multiplication Facts

Quiz

•

5th Grade

54 questions

Analyzing Line Graphs & Tables

Quiz

•

4th Grade

$fractions$

22 questions

fractions

Quiz

•

3rd Grade

20 questions

Main Idea and Details

Quiz

•

5th Grade

20 questions

Context Clues

Quiz

•

6th Grade

15 questions

Equivalent Fractions

Quiz

•

4th Grade

Discover more resources for Computers

20 questions

CompTIA Network+ - Ports and Protocols

Quiz

•

University

ML B2 CH4

What is a major bottleneck of the original CBOW model?

The original CBOW model uses softmax across a large vocabulary, which is computationally expensive.

What is the function of the embedding layer in word2vec?

Embedding layers replace the need for one-hot vectors by selecting specific word vectors directly from the matrix.

What is the main goal of negative sampling in word2vec?

Negative sampling simplifies training by converting the multi-class problem into several binary decisions.

In negative sampling, what kind of output do we expect for negative examples?

Negative examples should result in output values close to 0, indicating they are not the correct context.

Why is the sigmoid function used in binary classification for word2vec?

Sigmoid functions output values between 0 and 1, ideal for interpreting binary classification probabilities.

What does the cross-entropy error measure in binary classification?

Cross-entropy measures the loss based on the distance between predicted probabilities and actual labels.

What technique helps select which negative examples to use?

Negative examples are sampled based on their frequency using probability-based techniques.

Why is 0.75 applied to probability values during sampling?

Raising probabilities to 0.75 allows rare words a fairer chance of being sampled.

What analogy problem demonstrates the power of word2vec vectors?

This classic example shows how vector arithmetic can capture semantic relationships.

What does word2vec convert words and sentences into?

The goal of word2vec is to represent words and even sentences as vectors in a continuous space.

Access all questions and much more by creating a free account

Similar Resources on Wayground

Popular Resources on Wayground

Discover more resources for Computers