Which method should you invoke on the DataFrame?

exams DP-500 V1 DP-500 exam 0 Comments

You are using a Python notebook in an Apache Spark pool in Azure Synapse Analytics.

You need to present the data distribution statistics from a DataFrame in a tabular view.

Which method should you invoke on the DataFrame?
A . rollup
B. cov
C. explain
D. describe

Answer: D

Explanation:

The aggregating statistic can be calculated for multiple columns at the same time with the describe function.

Example:

titanic[["Age", "Fare"]].describe()

Out[6]:

Age Fare

count 714.000000 891.000000

mean 29.699118 32.204208

std 14.526497 49.693429

min 0.420000 0.000000

25% 20.125000 7.910400

50% 28.000000 14.454200

75% 38.000000 31.000000

max 80.000000 512.329200

Reference: https://pandas.pydata.org/docs/getting_started/intro_tutorials/06_calculate_statistics.html