Search Header Logo
Big Data- Lesson 4

Big Data- Lesson 4

Assessment

Presentation

Computers

9th - 12th Grade

Practice Problem

Easy

Created by

Jennifer Tuttle

Used 2+ times

FREE Resource

4 Slides • 2 Questions

1

Big Data- Lesson 4

By Jennifer Tuttle

2

Open Ended

What do you think "clean" data means?

3

Open Ended

Why do you think it is important to make sure your data is "clean"?

4

Cleaning Data- More Efficiently

For this lesson you will learn some tips and tricks to making sure your data is cleaned up and ready to be analyzed.


There will be multiple videos attached to the assignment and to this slide show. We will go over the videos and then you will have time to use these new Google Sheets strategies to work on another movie database.

5

Directions:

  • Remove duplicate rows- keep the row with the highest number of votes

  • Change date range to only the last year, with no dashes or parentheses

  • Delete any rating with less than 100 votes

  • Remove anything that is not a movie

  • Create 2 new rows for Genre

    • If more than one genre listed, move those entries to the new row

  • Look up the RunTime for any movie where this is blank

  • Create a new column called Language and indicate movies that do NOT have an English title.

Use the movies2 data and your new skills to clean the new set of data.

This is found attached with this assignment in Google class room.

6

​You will use these with the Movie Spreadsheet attached to Google Classroom.

Big Data- Lesson 4

By Jennifer Tuttle

Show answer

Auto Play

Slide 1 / 6

SLIDE