Free Databricks Certified Data Engineer Associate Actual Exam Questions - Question 6 Discussion

Question No. 6
A data engineer is using the following code block as part of a batch ingestion pipeline to read from a
composable table:
Databricks-Certified-Data-Engineer-Associate practice exam questions
Which of the following changes needs to be made so this code block will work when the transactions
table is a stream source?
Select one option, then reveal solution.
US
AI
Adeel I.
2026-02-09

Also thinking E makes the most sense because streaming requires a different reader method. Options like B or D don’t actually switch from batch to streaming, so they seem off here. Does anyone see a scenario where C would matter?

0
AQ
Ahmed Q.
2026-02-04

Maybe E, since switching to streaming usually means using spark.readStream instead of spark.read. The other options don’t really enable streaming directly.

0
BT
Brian T.
2026-02-02

I’m with the group on option E. When switching from batch to streaming in Spark, you have to switch from spark.read to spark.readStream. The other options don’t really fit what’s needed here. For example, maxFilesPerTrigger is for controlling file ingestion rate but doesn’t replace the read method itself, and format("stream") isn’t a valid format in Spark. Anyone think option C could be relevant if the table path differs for streaming? Or is it mostly about how you read the data?

0
BT
Brian T.
2026-01-25

Option E makes sense since when dealing with a streaming source like a Delta table stream, you have to use spark.readStream instead of spark.read. The rest don’t seem right—like option B’s maxFilesPerTrigger is just an option, not a replacement for schema. Also, format("stream") isn’t a valid format in Spark; it’s always "delta" for Delta Lake. So switching the reader to streaming mode with spark.readStream seems like the key change here.

0
BT
Brian T.
2026-01-16

Not seeing the actual code snippet here makes it tricky-do we know if the current code is using spark.read or spark.readStream? Also, is the schema already defined properly for streaming? Without that, it’s hard to tell which change fits best. Anyone got the code handy?

0