What metric is BEST suited to score the model?

A Machine Learning Specialist is working for a credit card processing company and receives an unbalanced dataset containing credit card transactions. It contains 99,000 valid transactions and 1,000 fraudulent transactions The Specialist is asked to score a model that was run against the dataset The Specialist has been advised that...

January 13, 2024 No Comments READ MORE +

What actions would allow the Specialist to get relevant numerical representations?

A Machine Learning Specialist is developing recommendation engine for a photography blog Given a picture, the recommendation engine should show a picture that captures similar objects The Specialist would like to create a numerical representation feature to perform nearest-neighbor searches What actions would allow the Specialist to get relevant numerical...

January 13, 2024 No Comments READ MORE +

What model evaluation technique should the Specialist use to understand how different classification thresholds will impact the model's performance?

A Machine Learning Specialist is building a logistic regression model that will predict whether or not a person will order a pizza. The Specialist is trying to build the optimal model with an ideal classification threshold. What model evaluation technique should the Specialist use to understand how different classification thresholds...

January 13, 2024 No Comments READ MORE +

Which approach should the Specialist use to continue working?

A Machine Learning Specialist is assigned a TensorFlow project using Amazon SageMaker for training, and needs to continue working for an extended period with no Wi-Fi access. Which approach should the Specialist use to continue working?A . Install Python 3 and boto3 on their laptop and continue the code development...

January 13, 2024 No Comments READ MORE +

Which of the following should be used to resolve this issue?

An insurance company is developing a new device for vehicles that uses a camera to observe drivers' behavior and alert them when they appear distracted. The company created approximately 10,000 training images in a controlled environment that a Machine Learning Specialist will use to train and evaluate machine learning models...

January 13, 2024 No Comments READ MORE +

What model should be used to complete this work?

A Machine Learning Specialist was given a dataset consisting of unlabeled data The Specialist must create a model that can help the team classify the data into different buckets. What model should be used to complete this work?A . K-means clusteringB . Random Cut Forest (RCF)C . XGBoostD . BlazingTextView...

January 12, 2024 No Comments READ MORE +

What is the MOST likely cause of this issue?

During mini-batch training of a neural network for a classification problem, a Data Scientist notices that training accuracy oscillates. What is the MOST likely cause of this issue?A . The class distribution in the dataset is imbalancedB . Dataset shuffling is disabledC . The batch size is too bigD ....

January 12, 2024 No Comments READ MORE +

How should a Machine Learning Specialist address this issue for future documents?

A company is using Amazon Polly to translate plaintext documents to speech for automated company announcements However company acronyms are being mispronounced in the current documents How should a Machine Learning Specialist address this issue for future documents?A . Convert current documents to SSML with pronunciation tagsB . Create an...

January 11, 2024 No Comments READ MORE +

How should the Specialist frame this business problem?

A Machine Learning Specialist works for a credit card processing company and needs to predict which transactions may be fraudulent in near-real time. Specifically, the Specialist must train a model that returns the probability that a given transaction may be fraudulent. How should the Specialist frame this business problem?A ....

January 11, 2024 No Comments READ MORE +

Which storage scheme is MOST adapted to this scenario?

A Data Science team is designing a dataset repository where it will store a large amount of training data commonly used in its machine learning models. As Data Scientists may create an arbitrary number of new datasets every day the solution has to scale automatically and be cost-effective. Also, it...

January 11, 2024 No Comments READ MORE +