Which solution meets these requirements?

An airline has .csv-formatted data stored in Amazon S3 with an AWS Glue Data Catalog. Data analysts want to join this data with call center data stored in Amazon Redshift as part of a dally batch process. The Amazon Redshift cluster is already under a heavy load. The solution must...

January 8, 2024 No Comments READ MORE +

Which combination of solutions cost-effectively meets the company’s requirements for transforming the data?

A media company wants to perform machine learning and analytics on the data residing in its Amazon S3 data lake. There are two data transformation requirements that will enable the consumers within the company to create reports: Daily transformations of 300 GB of data with different file formats landing in...

January 8, 2024 No Comments READ MORE +

What is an explanation for this behavior and what is the solution?

A company is streaming its high-volume billing data (100 MBps) to Amazon Kinesis Data Streams. A data analyst partitioned the data on account_id to ensure that all records belonging to an account go to the same Kinesis shard and order is maintained. While building a custom consumer using the Kinesis...

January 7, 2024 No Comments READ MORE +

Which solution achieves these required access patterns to minimize costs and administrative tasks?

A large company has a central data lake to run analytics across different departments. Each department uses a separate AWS account and stores its data in an Amazon S3 bucket in that account. Each AWS account uses the AWS Glue Data Catalog as its data catalog. There are different data...

January 7, 2024 No Comments READ MORE +

What is the MOST cost-effective solution?

A company has a data warehouse in Amazon Redshift that is approximately 500 TB in size. New data is imported every few hours and read-only queries are run throughout the day and evening. There is a particularly heavy load with no writes for several hours each morning on business days....

January 7, 2024 No Comments READ MORE +

Which solution meets these requirements?

A regional energy company collects voltage data from sensors attached to buildings. To address any known dangerous conditions, the company wants to be alerted when a sequence of two voltage drops is detected within 10 minutes of a voltage spike at the same building. It is important to ensure that...

January 7, 2024 No Comments READ MORE +

Which solution meets these requirements?

A financial company uses Amazon S3 as its data lake and has set up a data warehouse using a multi-node Amazon Redshift cluster. The data files in the data lake are organized in folders based on the data source of each data file. All the data files are loaded to...

January 6, 2024 No Comments READ MORE +

Which action would MOST likely increase the performance of accessing log data in Amazon S3?

A media company has been performing analytics on log data generated by its applications. There has been a recent increase in the number of concurrent analytics jobs running, and the overall performance of existing jobs is decreasing as the number of new jobs is increasing. The partitioned data is stored...

January 6, 2024 No Comments READ MORE +

Which solution will provide the MOST up-to-date results?

A data analyst is designing a solution to interactively query datasets with SQL using a JDBC connection. Users will join data stored in Amazon S3 in Apache ORC format with data stored in Amazon Elasticsearch Service (Amazon ES) and Amazon Aurora MySQL. Which solution will provide the MOST up-to-date results?A...

January 5, 2024 No Comments READ MORE +

Which program modification will accelerate the COPY process?

A large company receives files from external parties in Amazon EC2 throughout the day. At the end of the day, the files are combined into a single file, compressed into a gzip file, and uploaded to Amazon S3. The total size of all the files is close to 100 GB...

January 5, 2024 No Comments READ MORE +