QUESTION: 69 Exam Topic: 5, Practice Questions

Why do you need to split a machine learning dataset into training data and test data?

So you can try two different sets of features
To make sure your model is generalized for more than just the training data
To allow you to create unit tests in your code
So you can use one dataset for a wide model and one for a deep model

Answer(s): B

Explanation:

The flaw with evaluating a predictive model on training data is that it does not inform you on how well the model has generalized to new unseen data. A model that is selected for its accuracy on the training dataset rather than its accuracy on an unseen test dataset is very likely to have lower accuracy on an unseen test dataset. The reason is that the model is not as generalized. It has specialized to the structure in the training dataset. This is called overfitting.

Reference:

https://machinelearningmastery.com/a-simple-intuition-for-overfitting/

Show Answer Next Question

QUESTION: 70 Exam Topic: 5, Practice Questions

Which of these numbers are adjusted by a neural network as it learns from a training dataset (select 2 answers)?

Weights
Biases
Continuous features
Input values

Answer(s): A,B

Explanation:

A neural network is a simple mechanism that's implemented with basic math. The only difference between the traditional programming model and a neural network is that you let the computer determine the parameters (weights and bias) by learning from training datasets.

Reference:

https://cloud.google.com/blog/big-data/2016/07/understanding-neural-networks-with- tensorflow-playground

Show Answer Next Question

QUESTION: 71 Exam Topic: 5, Practice Questions

The CUSTOM tier for Cloud Machine Learning Engine allows you to specify the number of which types of cluster nodes?

Workers
Masters, workers, and parameter servers
Workers and parameter servers
Parameter servers

Answer(s): C

Explanation:

The CUSTOM tier is not a set tier, but rather enables you to use your own cluster specification.
When you use this tier, set values to configure your processing cluster according to these guidelines:

You must set TrainingInput.masterType to specify the type of machine to use for your master node.

You may set TrainingInput.workerCount to specify the number of workers to use.

You may set TrainingInput.parameterServerCount to specify the number of parameter servers to use.

You can specify the type of machine for the master node, but you can't specify more than one master node.

Reference:

https://cloud.google.com/ml-engine/docs/training- overview#job_configuration_parameters

Show Answer Next Question

QUESTION: 72 Exam Topic: 5, Practice Questions

Which software libraries are supported by Cloud Machine Learning Engine?

Theano and TensorFlow
Theano and Torch
TensorFlow
TensorFlow and Torch

Answer(s): C

Explanation:

Cloud ML Engine mainly does two things:

Enables you to train machine learning models at scale by running TensorFlow training applications in the cloud.

Hosts those trained models for you in the cloud so that you can use them to get predictions about new data.

Reference:

https://cloud.google.com/ml-engine/docs/technical-overview#what_it_does

Show Answer Next Question

Free Google Professional Data Engineer Exam Questions (page: 25)

QUESTION: 69 Exam Topic: 5, Practice Questions

Explanation:

Reference:

QUESTION: 70 Exam Topic: 5, Practice Questions

Explanation:

Reference:

QUESTION: 71 Exam Topic: 5, Practice Questions

Explanation:

Reference:

QUESTION: 72 Exam Topic: 5, Practice Questions

Explanation:

Reference:

Professional Data Engineer Exam Discussions & Posts