Databricks Exam Databricks Certified Associate Developer for Apache Spark 3.0 Topic 1 Question 81 Discussion

Actual exam question for Databricks's Databricks Certified Associate Developer for Apache Spark 3.0 exam

Question #: 81
Topic #: 1

[All Databricks Certified Associate Developer for Apache Spark 3.0 Questions]

The code block displayed below contains an error. The code block should produce a DataFrame with color as the only column and three rows with color values of red, blue, and green, respectively.

Find the error.

Code block:

1. spark.createDataFrame([("red",), ("blue",), ("green",)], "color")

Instead of calling spark.createDataFrame, just DataFrame should be called.

AThe commas in the tuples with the colors should be eliminated.

BThe colors red, blue, and green should be expressed as a simple Python list, and not a list of tuples.

CInstead of color, a data type should be specified.

DThe 'color' expression needs to be wrapped in brackets, so it reads ['color'].

Show Suggested Answer

Suggested Answer: D

Correct code block:

spark.createDataFrame([('red',), ('blue',), ('green',)], ['color'])

The createDataFrame syntax is not exactly straightforward, but luckily the documentation (linked below) provides several examples on how to use it. It also shows an example very similar to the

code block presented here which should help you answer this Question: correctly.

More info: pyspark.sql.SparkSession.createDataFrame --- PySpark 3.1.2 documentation

Static notebook | Dynamic notebook: See test 2, Question: 23 (Databricks import instructions)

by Carin at Jul 07, 2025, 12:54 AM

Limited Time Offer

25%

Off

Get Premium Databricks Certified Associate Developer for Apache Spark 3.0 Questions as Interactive Web-Based Practice Test or PDF

Contribute your Thoughts:

Submit Cancel

Currently there are no comments in this discussion, be the first to comment!