BlackFriday 2024! Hurry Up, Grab the Special Discount - Save 25% - Ends In 00:00:00 Coupon code: SAVE25
Welcome to Pass4Success

- Free Preparation Discussions

Microsoft Exam DP-500 Topic 1 Question 15 Discussion

Actual exam question for Microsoft's DP-500 exam
Question #: 15
Topic #: 1
[All DP-500 Questions]

You are using a Python notebook in an Apache Spark pool in Azure Synapse Analytics.

You need to present the data distribution statistics from a DataFrame in a tabular view.

Which method should you invoke on the DataFrame?

Show Suggested Answer Hide Answer
Suggested Answer: B

pandas.DataFrame.corr computes pairwise correlation of columns, excluding NA/null values.

Incorrect:

* freqItems

pyspark.sql.DataFrame.freqItems

Finding frequent items for columns, possibly with false positives. Using the frequent element count algorithm described in https://doi.org/10.1145/762471.762473, proposed by Karp, Schenker, and Papadimitriou.'

* summary is used for index.

* There is no panda method for rollup. Rollup would not be correct anyway.


Contribute your Thoughts:

Currently there are no comments in this discussion, be the first to comment!


Save Cancel