Search Header Logo

Mastering Your Data - Post Test

Authored by Irsyad Firsandi Wahyudi

Information Technology (IT)

Professional Development

Used 2+ times

Mastering Your Data - Post Test
AI

AI Actions

Add similar questions

Adjust reading levels

Convert to real-world scenario

Translate activity

More...

    Content View

    Student View

9 questions

Show all answers

1.

MULTIPLE CHOICE QUESTION

1 min • 1 pt

You are working with a dataset of customer support tickets where each row represents a single interaction (e.g., a phone call, an email, a chat message) a customer had with support. What is the granularity of this dataset?

Per unique customer

Per customer support agent

Per customer interaction

Per issue resolution

2.

MULTIPLE CHOICE QUESTION

1 min • 1 pt

A column named 'RegistrationDate' in your dataset contains values like "01/20/2023" or "March 15, 2024". Your data analysis tool initially identifies this column as a 'string' or 'object' type. You need to calculate the average time customers remain active since registration. What is the most crucial step you must take with this column before performing such a calculation?

Remove all rows with missing 'RegistrationDate' values.

Convert the column to an integer type.

Standardize the date format and convert the column to a datetime object.

Calculate the mean of the string values.

3.

MULTIPLE CHOICE QUESTION

1 min • 1 pt

After loading a new dataset into your analytical environment, you execute a command (e.g., df.isnull().sum() in Pandas) and observe that the 'Email' column has a count of 50 missing values. What does this observation primarily tell you about your dataset?

All 50 missing emails belong to the same customer.

The 'Email' column contains 50 unique email addresses.

50 records in your dataset lack email information, indicating a data completeness issue for that column.

The 'Email' column has been incorrectly imported as a numeric type.

4.

MULTIPLE CHOICE QUESTION

1 min • 1 pt

You are analyzing a dataset of product reviews. The 'Rating' column contains numbers from 1 to 5. However, you discover that some users accidentally submitted a rating of '10' which was then truncated to '1' by the system during data entry. This issue is not visible by simply checking df.dtypes or df.isnull().sum(). What kind of hidden data quality issue is this?

Duplicate records

Inconsistent data entry leading to data corruption

High cardinality

Outdated information

5.

MULTIPLE CHOICE QUESTION

1 min • 1 pt

What is the primary benefit of using methods like df.head() and df.tail() during the initial data inspection phase?

They perform complex statistical analysis on the entire dataset.

They automatically clean and preprocess the data for analysis.

They provide a quick visual overview of the dataset's beginning and end, helping to spot immediate structural issues or unexpected values.

They are used to calculate the mean and median of all numerical columns.

6.

OPEN ENDED QUESTION

10 mins • 1 pt

Imagine you have two datasets related to customer orders: Dataset A: Each row represents a single order placed by a customer. Dataset B: Each row represents a single item within an order (so one order might have multiple rows). If you want to calculate the average number of items per order, which dataset's granularity is more suitable for direct calculation, and why? If the other dataset were used, what initial step would be required?

Evaluate responses using AI:

OFF

7.

OPEN ENDED QUESTION

10 mins • 1 pt

You load a CSV file, and when you check the data types, a column named 'Customer_Age' is identified as an 'object' (string) type, even though you know it should contain numbers. a) What is the most likely reason for 'Customer_Age' being read as an 'object' instead of an integer or float? b) What problem would this incorrect data type cause if you tried to calculate the average customer age?

Evaluate responses using AI:

OFF

Access all questions and much more by creating a free account

Create resources

Host any resource

Get auto-graded reports

Google

Continue with Google

Email

Continue with Email

Classlink

Continue with Classlink

Clever

Continue with Clever

or continue with

Microsoft

Microsoft

Apple

Apple

Others

Others

Already have an account?