Home/databricks/Free Databricks Machine Learning Associate Actual Exam Questions/Question 1

Free Databricks Machine Learning Associate Actual Exam Questions - Question 1 Discussion

Question No. 1

A data scientist has developed a linear regression model using Spark ML and computed the
predictions in a Spark DataFrame preds_df with the following schema:
prediction DOUBLE
actual DOUBLE
Which of the following code blocks can be used to compute the root mean-squared-error of the
model according to the data in preds_df and assign it to the rmse variable?
A)
Machine Learning Associate practice exam questions

Machine Learning Associate practice exam questions

Machine Learning Associate real exam questions

Machine Learning Associate actual exam questions

Select one option, then reveal solution.

Usman I.

2026-02-21

Maybe D, it explicitly computes mean and sqrt clearly, looks correct.

Rayan S.

2026-02-19

Maybe C works better here since it uses built-in Spark functions to calculate squared error and then takes the sqrt over the average, which is the exact RMSE formula. D looks similar but a bit more manual.

Imran P.

2026-02-19

B tbh looks off since it misses taking the square root, so it’s actually MSE not RMSE. A also doesn’t aggregate properly. C and D both compute mean squared error, but D’s final sqrt step is clearer and cleaner.

Osama J.

2026-02-18

C/D? C uses Spark functions straightforwardly for squared errors and sqrt, but D explicitly calculates mean and then sqrt too. Both look close, but D’s aggregation looks simpler and cleaner to me.

Osama J.

2026-02-13

C imo, because it uses the built-in Spark functions to compute squared error and then applies sqrt correctly. A and B miss either the sqrt or proper aggregation, so C fits the RMSE calculation better.

Ash X.

2026-02-09

I’m going with option D here. It looks like it computes the squared difference between prediction and actual correctly, then takes the mean and applies sqrt, which is exactly how you calculate RMSE. Option B also seems close but might be missing the sqrt part at the end, so it’s probably MSE instead of RMSE.

Haris U.

2026-01-28

B (calculates squared errors and averages correctly)

Luke M.

2026-01-17

Maybe D