Search Header Logo

IST2334 - Lecture 3

Authored by Yoke Yong

Used 4+ times

IST2334 - Lecture 3
AI

AI Actions

Add similar questions

Adjust reading levels

Convert to real-world scenario

Translate activity

More...

    Content View

    Student View

8 questions

Show all answers

1.

MULTIPLE CHOICE QUESTION

45 sec • 1 pt

What is web crawling?

Extracting data from a website

Indexing web pages for search engines

Creating a new website

Sending emails to website owners

2.

MULTIPLE CHOICE QUESTION

45 sec • 1 pt

Which of the following ethical considerations should be kept in mind while web scraping?

Scraping any data without permission is always ethical.

It's not necessary to give credit to the source website.

Scraping personal and sensitive data is acceptable.

Respect website terms of service and robots.txt.

3.

MULTIPLE CHOICE QUESTION

45 sec • 1 pt

What is the purpose of robots.txt file on a website?

To collect analytics data

To tell search engine crawlers which URLs the crawler can access on your site.

To display a custom 404 error page

To improve website security

4.

MULTIPLE CHOICE QUESTION

45 sec • 1 pt

What is the primary difference between web crawling and web scraping?

Web crawling is used to extract structured data, while web scraping is used to follow links.

Web crawling is focused on indexing, while web scraping is focused on extracting data.

Web crawling requires human interaction, while web scraping is automated.

Web crawling and web scraping are synonymous terms.

5.

MULTIPLE CHOICE QUESTION

45 sec • 1 pt

Which of the following is NOT a type of web crawler?

Focused Web Crawler

Parallel Web Crawler

Incremental Web Crawler

Convolutional Web Crawler

6.

MULTIPLE SELECT QUESTION

45 sec • 1 pt

Select the three principles followed by all web scraping bots when scraping web data.

Making an HTTP request to a server

Add URL links to frontier

Extracting and parsing the website’s code

Saving the relevant data locally

7.

MULTIPLE CHOICE QUESTION

45 sec • 1 pt

What is the term for automatically following links from one web page to another during web crawling?

Link forwarding

Link traversal

Link indexing

Link extraction

Access all questions and much more by creating a free account

Create resources

Host any resource

Get auto-graded reports

Google

Continue with Google

Email

Continue with Email

Classlink

Continue with Classlink

Clever

Continue with Clever

or continue with

Microsoft

Microsoft

Apple

Apple

Others

Others

Already have an account?