A Data Scientist is developing a machine learning model to classify whether a financial transaction is fraudulent. The labeled data available for training consists of 100,000 non-fraudulent observations and 1,000 fraudulent observations.
The Data Scientist applies the XGBoost algorithm to the data, resulting in the following confusion matrix when the trained model is applied to a previously unseen validation dataset. The accuracy of the model is 99.1%, but the Data Scientist needs to reduce the number of false negatives.
Which combination of steps should the Data Scientist take to reduce the number of false negative predictions by the model? (Choose two.)
- Change the XGBoost eval_metric parameter to optimize based on Root Mean Square Error (RMSE).
- Increase the XGBoost scale_pos_weight parameter to adjust the balance of positive and negative weights.
- Increase the XGBoost max_depth parameter because the model is currently underfitting the data.
- Change the XGBoost eval_metric parameter to optimize based on Area Under the ROC Curve (AUC).
- Decrease the XGBoost max_depth parameter because the model is currently overfitting the data.
Reveal Solution Next Question