Which of the following approaches can enable the data engineering team to be notified if the ELT job has not been run in an hour?
A data engineer is using a Databricks SQL query to monitor the performance of an ELT job. The ELT job is triggered by a specific number of input records being ready to process. The Databricks SQL query returns the number of minutes since the job’s most recent runtime. Which of...
You are noticing job cluster is taking 6 to 8 mins to start which is delaying your job to finish on time, what steps you can take to reduce the amount of time cluster startup time
You are noticing job cluster is taking 6 to 8 mins to start which is delaying your job to finish on time, what steps you can take to reduce the amount of time cluster startup timeA . Setup a second job ahead of first job to start the cluster, so...
Which of the following SQL statement can be used to query a table by eliminating duplicate rows from the query results?
Which of the following SQL statement can be used to query a table by eliminating duplicate rows from the query results?A . SELECT DISTINCT * FROM table_name B. SELECT DISTINCT * FROM table_name HAVING COUNT(*) > 1 C. SELECT DISTINCT_ROWS (*) FROM table_name D. SELECT * FROM table_name GROUP BY...
Which of the following SQL statements can be used to update a transactions table, to set a flag on the table from Y to N
Which of the following SQL statements can be used to update a transactions table, to set a flag on the table from Y to NA . MODIFY transactions SET active_flag = 'N' WHERE active_flag = 'Y' B. MERGE transactions SET active_flag = 'N' WHERE active_flag = 'Y' C. UPDATE transactions...
Which of the following describes how Databricks Repos can help facilitate CI/CD workflows on the Databricks Lakehouse Platform?
Which of the following describes how Databricks Repos can help facilitate CI/CD workflows on the Databricks Lakehouse Platform?A . Databricks Repos can facilitate the pull request, review, and approval process before merging branches B. Databricks Repos can merge changes from a secondary Git branch into a main Git branch C....
table("uncleanedSales")
table("uncleanedSales")View AnswerAnswer: B Explanation: The answer is
table(table_name))
table(table_name))A . format, checkpointlocation, schemalocation, overwrite B. cloudfiles.format, checkpointlocation, cloudfiles.schemalocation, overwrite C. cloudfiles.format, cloudfiles.schemalocation, checkpointlocation, mergeSchema D. cloudfiles.format, cloudfiles.schemalocation, checkpointlocation, overwrite E. cloudfiles.format, cloudfiles.schemalocation, checkpointlocation, appendView AnswerAnswer: C Explanation: The answer is cloudfiles.format, cloudfiles.schemalocation, checkpointlocation, mergeSchema. Here is the end to end syntax of streaming ELT, below link contains complete...
Which of the following developer operations in the CI/CD can only be implemented through a GIT provider when using Databricks Repos.
Which of the following developer operations in the CI/CD can only be implemented through a GIT provider when using Databricks Repos.A . Trigger Databricks Repos pull API to update the latest version B. Commit and push code C. Create and edit code D. Create a new branch E. Pull request...
Which of the following describes how Databricks Repos can help facilitate CI/CD workflows on the Databricks Lakehouse Platform?
Which of the following describes how Databricks Repos can help facilitate CI/CD workflows on the Databricks Lakehouse Platform?A . Databricks Repos can facilitate the pull request, review, and approval process before merging branches B. Databricks Repos can merge changes from a secondary Git branch into a main Git branch C....
You have written a notebook to generate a summary data set for reporting, Notebook was scheduled using the job cluster, but you realized it takes 8 minutes to start the cluster, what feature can be used to start the cluster in a timely fashion so your job can run immediatley?
You have written a notebook to generate a summary data set for reporting, Notebook was scheduled using the job cluster, but you realized it takes 8 minutes to start the cluster, what feature can be used to start the cluster in a timely fashion so your job can run immediatley?A...