You have configured AUTO LOADER to process incoming IOT data from cloud object storage every 15 mins, recently a change was made to the notebook code to update the processing logic but the team later realized that the notebook was failing for the last 24 hours, what steps team needs to take to reprocess the data that was not loaded after the notebook was corrected?
You have configured AUTO LOADER to process incoming IOT data from cloud object storage every 15 mins, recently a change was made to the notebook code to update the processing logic but the team later realized that the notebook was failing for the last 24 hours, what steps team needs...
You are currently asked to work on building a data pipeline, you have noticed that you are currently working on a very large scale ETL many data dependencies, which of the following tools can be used to address this problem?
You are currently asked to work on building a data pipeline, you have noticed that you are currently working on a very large scale ETL many data dependencies, which of the following tools can be used to address this problem?A . AUTO LOADER B. JOBS and TASKS C. SQL Endpoints...
What is the purpose of a silver layer in Multi hop architecture?
What is the purpose of a silver layer in Multi hop architecture?A . Replaces a traditional data lake B. Efficient storage and querying of full and unprocessed history of data C. A schema is enforced, with data quality checks. D. Refined views with aggregated data E. Optimized query performance for...
You are noticing job cluster is taking 6 to 8 mins to start which is delaying your job to finish on time, what steps you can take to reduce the amount of time cluster startup time
You are noticing job cluster is taking 6 to 8 mins to start which is delaying your job to finish on time, what steps you can take to reduce the amount of time cluster startup timeA . Setup a second job ahead of first job to start the cluster, so...
Which of the following describes how Databricks Repos can help facilitate CI/CD workflows on the Databricks Lakehouse Platform?
Which of the following describes how Databricks Repos can help facilitate CI/CD workflows on the Databricks Lakehouse Platform?A . Databricks Repos can facilitate the pull request, review, and approval process before merging branches B. Databricks Repos can merge changes from a secondary Git branch into a main Git branch C....
table("uncleanedSales")
table("uncleanedSales")View AnswerAnswer: B Explanation: The answer is
table(table_name))
table(table_name))A . format, checkpointlocation, schemalocation, overwrite B. cloudfiles.format, checkpointlocation, cloudfiles.schemalocation, overwrite C. cloudfiles.format, cloudfiles.schemalocation, checkpointlocation, mergeSchema D. cloudfiles.format, cloudfiles.schemalocation, checkpointlocation, overwrite E. cloudfiles.format, cloudfiles.schemalocation, checkpointlocation, appendView AnswerAnswer: C Explanation: The answer is cloudfiles.format, cloudfiles.schemalocation, checkpointlocation, mergeSchema. Here is the end to end syntax of streaming ELT, below link contains complete...
Data science team members are using a single cluster to perform data analysis, although cluster size was chosen to handle multiple users and auto-scaling was enabled, the team realized queries are still running slow, what would be the suggested fix for this?
Data science team members are using a single cluster to perform data analysis, although cluster size was chosen to handle multiple users and auto-scaling was enabled, the team realized queries are still running slow, what would be the suggested fix for this?A . Setup multiple clusters so each team member...
Which of the following techniques structured streaming uses to ensure recovery of failures during stream processing?
Which of the following techniques structured streaming uses to ensure recovery of failures during stream processing?A . Checkpointing and Watermarking B. Write ahead logging and watermarking C. Checkpointing and write-ahead logging D. Delta time travel E. The stream will failover to available nodes in the cluster F. Checkpointing and Idempotent...
Which of the following SQL command can be used to insert or update or delete rows based on a condition to check if a row(s) exists?
Which of the following SQL command can be used to insert or update or delete rows based on a condition to check if a row(s) exists?A . MERGE INTO table_name B. COPY INTO table_name C. UPDATE table_name D. INSERT INTO OVERWRITE table_name E. INSERT IF EXISTS table_nameView AnswerAnswer: A Explanation:...