What should be done to reduce the impact of having such a large number of features?

A Machine Learning Specialist is building a prediction model for a large number of features using linear models, such as linear regression and logistic regression During exploratory data analysis the Specialist observes that many features are highly correlated with each other This may make the model unstable What should be...

January 15, 2024 No Comments READ MORE +

Which built-in Amazon SageMaker machine learning algorithm should be used for modeling this problem?

A bank's Machine Learning team is developing an approach for credit card fraud detection The company has a large dataset of historical data labeled as fraudulent. The goal is to build a model to take the information from new transactions and predict whether each transaction is fraudulent or not Which...

January 15, 2024 No Comments READ MORE +

Which of the following methods should the Specialist consider using to correct this?

A Machine Learning Specialist has created a deep learning neural network model that performs well on the training data but performs poorly on the test data. Which of the following methods should the Specialist consider using to correct this? (Select THREE.)A . Decrease regularization.B . Increase regularization.C . Increase dropout.D...

January 15, 2024 No Comments READ MORE +

How should the records be stored in Amazon S3 to improve query performance?

A monitoring service generates 1 TB of scale metrics record data every minute A Research team performs queries on this data using Amazon Athena The queries run slowly due to the large volume of data, and the team requires better performance How should the records be stored in Amazon S3...

January 15, 2024 No Comments READ MORE +

Which solution requires the LEAST effort to be able to query this data?

A manufacturing company has structured and unstructured data stored in an Amazon S3 bucket A Machine Learning Specialist wants to use SQL to run queries on this data. Which solution requires the LEAST effort to be able to query this data?A . Use AWS Data Pipeline to transform the data...

January 15, 2024 No Comments READ MORE +

Based on the stated parameters and given that the invocations per instance setting is measured on a per-minute basis, what should the Specialist set as the sageMaker variant invocations Per instance setting?

A Machine Learning Specialist wants to determine the appropriate SageMaker Variant Invocations Per Instance setting for an endpoint automatic scaling configuration. The Specialist has performed a load test on a single instance and determined that peak requests per second (RPS) without service degradation is about 20 RPS As this is...

January 14, 2024 No Comments READ MORE +

Which type of data repository is the MOST cost-effective solution?

A Machine Learning Specialist needs to create a data repository to hold a large amount of time-based training data for a new model. In the source system, new files are added every hour Throughout a single 24-hour period, the volume of hourly updates will change significantly. The Specialist always wants...

January 14, 2024 No Comments READ MORE +

Which of the following will accomplish this?

A company is observing low accuracy while training on the default built-in image classification algorithm in Amazon SageMaker. The Data Science team wants to use an Inception neural network architecture instead of a ResNet architecture. Which of the following will accomplish this? (Select TWO.)A . Customize the built-in image classification...

January 14, 2024 No Comments READ MORE +

How should the company set up the job?

A company is running an Amazon SageMaker training job that will access data stored in its Amazon S3 bucket A compliance policy requires that the data never be transmitted across the internet. How should the company set up the job?A . Launch the notebook instances in a public subnet and...

January 14, 2024 No Comments READ MORE +

Which combination of steps should the Data Scientist take to reduce the number of false positive predictions by the model?

A Data Scientist is developing a machine learning model to classify whether a financial transaction is fraudulent. The labeled data available for training consists of 100,000 non-fraudulent observations and 1,000 fraudulent observations. The Data Scientist applies the XGBoost algorithm to the data, resulting in the following confusion matrix when the...

January 14, 2024 No Comments READ MORE +