Which application on the cluster should be the data engineer use?
There are thousands of text files on Amazon S3. The total size of the files is 1 PB. The files contain retail order information for the past 2 years. A data engineer needs to run multiple interactive queries to manipulate the data. The data Engineer has AWS access to spin up an Amazon EMR cluster. The data Engineer needs to use an application on the cluster to process this data and return the results in interactive time frame.
Which application on the cluster should be the data engineer use?
A . Oozie
B . Apache Pig with Tachyon
C . Apache Hive
D . Presto
Answer: D
Latest BDS-C00 Dumps Valid Version with 264 Q&As
Latest And Valid Q&A | Instant Download | Once Fail, Full Refund
Subscribe
Login
0 Comments
Inline Feedbacks
View all comments