QUESTION: 41 Exam Topic: 3, MJTelco Case Study

MJTelco needs you to create a schema in Google Bigtable that will allow for the historical analysis of the last 2 years of records. Each record that comes in is sent every 15 minutes, and contains a unique identifier of the device and a data record. The most common query is for all the data for a given device for a given day.
Which schema should you use?

Rowkey: date#device_idColumn data: data_point
Rowkey: dateColumn data: device_id, data_point
Rowkey: device_idColumn data: date, data_point
Rowkey: data_pointColumn data: device_id, date
Rowkey: date#data_pointColumn data: device_id

Answer(s): D

Show Answer Next Question

QUESTION: 42 Exam Topic: 3, MJTelco Case Study

View Related Case Study

MJTelco is building a custom interface to share dat

They have these requirements:
They need to do aggregations over their petabyte-scale datasets.
They need to scan specific time range rows with a very fast response time (milliseconds).
Which combination of Google Cloud Platform products should you recommend?
Cloud Datastore and Cloud Bigtable
Cloud Bigtable and Cloud SQL
BigQuery and Cloud Bigtable
BigQuery and Cloud Storage

Answer(s): C

Show Answer Next Question

QUESTION: 43 Exam Topic: 3, MJTelco Case Study

View Related Case Study

You need to compose visualization for operations teams with the following requirements:

Telemetry must include data from all 50,000 installations for the most recent 6 weeks (sampling once every minute)

The report must not be more than 3 hours delayed from live data.

The actionable report should only show suboptimal links.

Most suboptimal links should be sorted to the top.

Suboptimal links can be grouped and filtered by regional geography.

User response time to load the report must be <5 seconds.

You create a data source to store the last 6 weeks of data, and create visualizations that allow viewers to see multiple date ranges, distinct geographic regions, and unique installation types. You always show the latest data without any changes to your visualizations. You want to avoid creating and updating new visualizations each month.
What should you do?

Look through the current data and compose a series of charts and tables, one for each possible combination of criteria.
Look through the current data and compose a small set of generalized charts and tables bound to criteria filters that allow value selection.
Export the data to a spreadsheet, compose a series of charts and tables, one for each possible combination of criteria, and spread them across multiple tabs.
Load the data into relational database tables, write a Google App Engine application that queries all rows, summarizes the data across each criteria, and then renders results using the Google Charts and visualization API.

Answer(s): B

Show Answer Next Question

QUESTION: 44 Exam Topic: 3, MJTelco Case Study

View Related Case Study

Given the record streams MJTelco is interested in ingesting per day, they are concerned about the cost of Google BigQuery increasing. MJTelco asks you to provide a design solution. They require a single large data table called tracking_table. Additionally, they want to minimize the cost of daily queries while performing fine-grained analysis of each day's events. They also want to use streaming ingestion.
What should you do?

Create a table called tracking_table and include a DATE column.
Create a partitioned table called tracking_table and include a TIMESTAMP column.
Create sharded tables for each day following the pattern tracking_table_YYYYMMDD.
Create a table called tracking_table with a TIMESTAMP column to represent the day.

Answer(s): B

Explanation:

Show Answer Next Question

Free Google Professional Data Engineer Exam Questions (page: 12)

QUESTION: 41 Exam Topic: 3, MJTelco Case Study

QUESTION: 42 Exam Topic: 3, MJTelco Case Study

QUESTION: 43 Exam Topic: 3, MJTelco Case Study

QUESTION: 44 Exam Topic: 3, MJTelco Case Study

Explanation:

Professional Data Engineer Exam Discussions & Posts