Search Header Logo

NLP Lecture II

Authored by Hazem Abdelazim

others

Used 13+ times

NLP Lecture II
AI

AI Actions

Add similar questions

Adjust reading levels

Convert to real-world scenario

Translate activity

More...

    Content View

    Student View

9 questions

Show all answers

1.

MULTIPLE CHOICE QUESTION

30 sec • 1 pt

What is a binary bag of words?

A) A bag used to store words
B) A technique for text vectorization
C) A type of vocabulary list
D) A specific type of document

2.

MULTIPLE CHOICE QUESTION

30 sec • 1 pt

Which vectorization technique takes word importance and frequency into account?

A) Bag of Words (BoW)
B) Binary Bag of Words (BBoW)
C) TF-IDF
D) N-grams

3.

MULTIPLE CHOICE QUESTION

30 sec • 1 pt

What is the main advantage of using TF-IDF over simple BoW?

A) Simplicity
B) Higher dimensionality
C) Word importance and frequency consideration
D) Larger corpus size

4.

MULTIPLE CHOICE QUESTION

30 sec • 1 pt

In the context of text vectorization, when might using n-grams be more advantageous than simple word tokenization?

A) When dealing with small text corpora
B) When you need to preserve the order of words
C) N-grams are never more advantageous
D) When working with images, not text

5.

MULTIPLE CHOICE QUESTION

30 sec • 1 pt

Consider a document-term matrix for text vectorization, where rows represent documents and columns represent terms (words). How could we extract feature vectors for each word ?

rows can be used as feature vectors

columns can be used as feature vectors

We need first to convert the matrix to a BoW matrix

use countvectorizer(Binary=True)

6.

MULTIPLE CHOICE QUESTION

30 sec • 1 pt

Suppose you have a text corpus with hundreds of thousands of documents. You're using TF-IDF for vectorization. What is the potential issue you might encounter with such a large corpus when computing the TF-IDF matrix?

A) The matrix will be too small to handle efficiently
B) The dimensionality of the matrix becomes very high
C) TF-IDF is not suitable for large corpora
D) The computation time is reduced

7.

MULTIPLE CHOICE QUESTION

30 sec • 1 pt

What is the primary objective of Named Entity Recognition (NER) in natural language processing?

A) Identifying and classifying specific entities in text, such as names of people, places, and organizations.
B) Analyzing sentence structure and grammar to determine overall text sentiment.

C)Identifying Names of persons in the documents

D) Counting the frequency of stop words in a text corpus.

Access all questions and much more by creating a free account

Create resources

Host any resource

Get auto-graded reports

Google

Continue with Google

Email

Continue with Email

Classlink

Continue with Classlink

Clever

Continue with Clever

or continue with

Microsoft

Microsoft

Apple

Apple

Others

Others

Already have an account?