Text Mining with Python (week 9)

Text Mining with Python (week 9)

University

7 Qs

quiz-placeholder

Similar activities

The Medicine Bag Vocabulary

The Medicine Bag Vocabulary

5th Grade - Professional Development

12 Qs

POU

POU

1st Grade - Professional Development

10 Qs

Memory

Memory

University

10 Qs

Vocabulary for Renters

Vocabulary for Renters

9th Grade - University

12 Qs

Unit- II Survey Quiz on Research Design

Unit- II Survey Quiz on Research Design

University

10 Qs

Qualities of a Good Speech

Qualities of a Good Speech

7th Grade - University

10 Qs

Q010. E-MARKETING: EMAIL MARKETING

Q010. E-MARKETING: EMAIL MARKETING

University - Professional Development

10 Qs

SOLUTIONs: Communication Competence

SOLUTIONs: Communication Competence

University

10 Qs

Text Mining with Python (week 9)

Text Mining with Python (week 9)

Assessment

Quiz

Other

University

Practice Problem

Hard

Created by

Mikhail Bukhtoyarov

Used 1+ times

FREE Resource

AI

Enhance your content in a minute

Add similar questions
Adjust reading levels
Convert to real-world scenario
Translate activity
More...

7 questions

Show all answers

1.

MULTIPLE SELECT QUESTION

30 sec • 1 pt

Natural language processing (NLP) is a subfield of ... and ... that uses machine learning to enable computers to understand and communicate with human language.

corpus linguistics

computer science

statistics

2.

MULTIPLE CHOICE QUESTION

1 min • 1 pt

What is the correct order of preprocessing the text for further mining?

tokenization, lemmatization, and removing stop words

lemmatization, tokenization, and removing stop words

lemmatization, removing stop words, and tokenization

removing stop words, lemmatization, and tokenization

3.

MULTIPLE CHOICE QUESTION

1 min • 1 pt

What does this piece of code do?

def most_common(df, top_n=10):
all_words = [word for word in ' '.join(df['processed_comments']).split() if word not in string.punctuation]
word_counts = Counter(all_words)
top_10_words = word_counts.most_common(top_n)
return top_10_words

identifies the most recurrent themes or issues

tokenizes the textual data

categorizes comments

scrapes the html files to csv

4.

MULTIPLE SELECT QUESTION

1 min • 1 pt

What Python libraries are used for text mining?

NLTK

Pandas

BeutifulSoup

TextBlob

5.

MULTIPLE SELECT QUESTION

1 min • 1 pt

What can NLTK (natural language toolkit) be used for?

stemming and lemmatization

tokenization

sentiment analysis

speech recognition

6.

MULTIPLE CHOICE QUESTION

30 sec • 1 pt

What is Pandas library used for?

working with MS Office files

building neural networks

data manipulation and analysis

speech recognition

7.

MULTIPLE CHOICE QUESTION

45 sec • 1 pt

The goal of sentiment analysis is to determine the overall sentiment polarity of a piece of text, which can be ...

positive, negative, or neutral

positive, negative, emotional, or neutral

sentimental, rational

poetical, prosaic, academic, legal, etc.