Which of the following method can be used to ingest the data incrementally?

You are currently working to ingest millions of files that get uploaded to the cloud object storage for consumption, and you are asked to build a process to ingest this data, the schema of the file is expected to change over time, and the ingestion process should be able to handle these changes automatically.

Which of the following method can be used to ingest the data incrementally?
A . AUTO APPEND
B. AUTO LOADER
C. COPY INTO
D. Structured Streaming
E. Checkpoint

Answer: B

Explanation:

The answer is AUTO LOADER,

Use Auto Loader instead of the COPY INTO SQL command when:

• You want to load data from a file location that contains files in the order of millions or higher. Auto Loader can discover files more efficiently than the COPY INTO SQL command and can split file processing into multiple batches.

• COPY INTO only directory listing but AUTO LOADER supports File notification method where the Auto Loader continues to ingest files as they arrive in cloud object storage lever-aging cloud provider(Queues and triggers) and Spark’s structured streaming.

• Your data schema evolves frequently. Auto Loader provides better support for schema in-ference and evolution. See Configuring schema inference and evolution in Auto Loader.

Subscribe
Notify of
guest
0 Comments
Inline Feedbacks
View all comments