Text Mining with Python (week 9)

Text Mining with Python (week 9)

University

7 Qs

quiz-placeholder

Similar activities

Present Participle Phrases

Present Participle Phrases

University

10 Qs

communication 1 : main model

communication 1 : main model

University

10 Qs

GE3B Week 9.5AM Does Grammar Matter?

GE3B Week 9.5AM Does Grammar Matter?

12th Grade - University

10 Qs

Cutting

Cutting

University

10 Qs

Tokens  Lexeme and Pattern

Tokens Lexeme and Pattern

University

10 Qs

Data Mining Kel 10

Data Mining Kel 10

University

12 Qs

Behaviorism

Behaviorism

University

12 Qs

FNLP - Quiz 1

FNLP - Quiz 1

University

8 Qs

Text Mining with Python (week 9)

Text Mining with Python (week 9)

Assessment

Quiz

Other

University

Hard

Created by

Mikhail Bukhtoyarov

Used 1+ times

FREE Resource

7 questions

Show all answers

1.

MULTIPLE SELECT QUESTION

30 sec • 1 pt

Natural language processing (NLP) is a subfield of ... and ... that uses machine learning to enable computers to understand and communicate with human language.

corpus linguistics

computer science

statistics

2.

MULTIPLE CHOICE QUESTION

1 min • 1 pt

What is the correct order of preprocessing the text for further mining?

tokenization, lemmatization, and removing stop words

lemmatization, tokenization, and removing stop words

lemmatization, removing stop words, and tokenization

removing stop words, lemmatization, and tokenization

3.

MULTIPLE CHOICE QUESTION

1 min • 1 pt

What does this piece of code do?

def most_common(df, top_n=10):
all_words = [word for word in ' '.join(df['processed_comments']).split() if word not in string.punctuation]
word_counts = Counter(all_words)
top_10_words = word_counts.most_common(top_n)
return top_10_words

identifies the most recurrent themes or issues

tokenizes the textual data

categorizes comments

scrapes the html files to csv

4.

MULTIPLE SELECT QUESTION

1 min • 1 pt

What Python libraries are used for text mining?

NLTK

Pandas

BeutifulSoup

TextBlob

5.

MULTIPLE SELECT QUESTION

1 min • 1 pt

What can NLTK (natural language toolkit) be used for?

stemming and lemmatization

tokenization

sentiment analysis

speech recognition

6.

MULTIPLE CHOICE QUESTION

30 sec • 1 pt

What is Pandas library used for?

working with MS Office files

building neural networks

data manipulation and analysis

speech recognition

7.

MULTIPLE CHOICE QUESTION

45 sec • 1 pt

The goal of sentiment analysis is to determine the overall sentiment polarity of a piece of text, which can be ...

positive, negative, or neutral

positive, negative, emotional, or neutral

sentimental, rational

poetical, prosaic, academic, legal, etc.