Which application on the cluster should be the data engineer use?

There are thousands of text files on Amazon S3. The total size of the files is 1 PB. The files contain retail order information for the past 2 years. A data engineer needs to run multiple interactive queries to manipulate the data. The data Engineer has AWS access to spin up an Amazon EMR cluster. The data Engineer needs to use an application on the cluster to process this data and return the results in interactive time frame.

Which application on the cluster should be the data engineer use?
A . Oozie
B . Apache Pig with Tachyon
C . Apache Hive
D . Presto

Answer: D

Latest BDS-C00 Dumps Valid Version with 264 Q&As

Latest And Valid Q&A | Instant Download | Once Fail, Full Refund

Subscribe
Notify of
guest
0 Comments
Inline Feedbacks
View all comments