
Associate Data Practitioner Part 1
Authored by Nikko (xWF)
Mathematics
KG
Used 2+ times

AI Actions
Add similar questions
Adjust reading levels
Convert to real-world scenario
Translate activity
More...
Content View
Student View
36 questions
Show all answers
1.
MULTIPLE CHOICE QUESTION
30 sec • 1 pt
Your retail company wants to predict customer churn using historical purchase data stored in BigQuery. The dataset includes customer demographics, purchase history, and a label indicating whether the customer churned or not. You want to build a machine learning model to identify customers at risk of churning. You need to create and train a logistic regression model for predicting customer churn, using the customer_data table with the churned column as the target label. Which BigQuery ML query should you use?
A.
-------------------------
B
You can use SELECT * EXCEPT (foo) and SELECT foo AS bar.
C.
-------------------------
D.
-------------------------
2.
MULTIPLE CHOICE QUESTION
30 sec • 1 pt
Another team in your organization is requesting access to a BigQuery dataset. You need to share the dataset with the team while minimizing the risk of unauthorized copying of data. You also want to create a reusable framework in case you need to share this data with other teams in the future. What should you do?
A.
Create authorized views in the team’s Google Cloud project that is only accessible by the team.
B.
Create a private exchange using Analytics Hub with data egress restriction, and grant access to the team members.
C.
Enable domain restricted sharing on the project. Grant the team members the BigQuery Data Viewer IAM role on the dataset.
D.
Export the dataset to a Cloud Storage bucket in the team’s Google Cloud project that is only accessible by the team.
3.
MULTIPLE CHOICE QUESTION
30 sec • 1 pt
Your company has developed a website that allows users to upload and share video files. These files are most frequently accessed and shared when they are initially uploaded. Over time, the files are accessed and shared less frequently, although some old video files may remain very popular.
You need to design a storage system that is simple and cost-effective. What should you do?
A.
Create a single-region bucket with Autoclass enabled.
B.
Create a single-region bucket. Configure a Cloud Scheduler job that runs every 24 hours and changes the storage class based on upload date.
C.
Create a single-region bucket with custom Object Lifecycle Management policies based on upload date.
D.
Create a single-region bucket with Archive as the default storage class.
4.
MULTIPLE CHOICE QUESTION
30 sec • 1 pt
You recently inherited a task for managing Dataflow streaming pipelines in your organization and noticed that proper access had not been provisioned to you. You need to request a Google-provided IAM role so you can restart the pipelines. You need to follow the principle of least privilege. What should you do?
A.
Request the Dataflow Developer role.
B.
Request the Dataflow Viewer role.
C.
Request the Dataflow Worker role.
D.
Request the Dataflow Admin role.
5.
MULTIPLE CHOICE QUESTION
30 sec • 1 pt
You need to create a new data pipeline. You want a serverless solution that meets the following requirements:
• Data is streamed from Pub/Sub and is processed in real-time.
• Data is transformed before being stored.
• Data is stored in a location that will allow it to be analyzed with SQL using Looker.
Which Google Cloud services should you recommend for the pipeline?
A.
1. Dataproc Serverless 2. Bigtable
B.
1. Cloud Composer 2. Cloud SQL for MySQL
C.
1. BigQuery 2. Analytics Hub
D.
1. Dataflow 2. BigQuery
6.
MULTIPLE CHOICE QUESTION
30 sec • 1 pt
Your team wants to create a monthly report to analyze inventory data that is updated daily. You need to aggregate the inventory counts by using only the most recent month of data, and save the results to be used in a Looker Studio dashboard. What should you do?
A.
Create a materialized view in BigQuery that uses the SUM( ) function and the DATE_SUB( ) function.
B.
Create a saved query in the BigQuery console that uses the SUM( ) function and the DATE_SUB( ) function. Re-run the saved query every month, and save the results to a BigQuery table.
C.
Create a BigQuery table that uses the SUM( ) function and the _PARTITIONDATE filter.
D.
Create a BigQuery table that uses the SUM( ) function and the DATE_DIFF( ) function.
7.
MULTIPLE CHOICE QUESTION
30 sec • 1 pt
You have a BigQuery dataset containing sales data. This data is actively queried for the first 6 months. After that, the data is not queried but needs to be retained for 3 years for compliance reasons. You need to implement a data management strategy that meets access and compliance requirements, while keeping cost and administrative overhead to a minimum. What should you do?
A.
Use BigQuery long-term storage for the entire dataset. Set up a Cloud Run function to delete the data from BigQuery after 3 years.
B.
Partition a BigQuery table by month. After 6 months, export the data to Coldline storage. Implement a lifecycle policy to delete the data from Cloud Storage after 3 years.
C.
Set up a scheduled query to export the data to Cloud Storage after 6 months. Write a stored procedure to delete the data from BigQuery after 3 years.
D.
Store all data in a single BigQuery table without partitioning or lifecycle policies.
Access all questions and much more by creating a free account
Create resources
Host any resource
Get auto-graded reports

Continue with Google

Continue with Email

Continue with Classlink

Continue with Clever
or continue with

Microsoft
%20(1).png)
Apple
Others
Already have an account?