Home/google/Free Google Professional Data Engineer Actual Exam Questions/Question 9

Free Google Professional Data Engineer Actual Exam Questions - Question 9 Discussion

Question No. 9

An organization maintains a Google BigQuery dataset that contains tables with user-level data. They want to expose aggregates of this data to other Google Cloud projects, while still controlling access to the user-level data. Additionally, they need to minimize their overall storage cost and ensure the analysis cost for other projects is assigned to those projects. What should they do?

Select one option, then reveal solution.

Fahad L.

2026-02-20

It’s A, Pig simplifies coding and optimizes MapReduce without extra cluster costs.

Shah C.

2026-02-17

Maybe A. Pig scripts are generally simpler and can optimize MapReduce jobs without extra hardware or cluster changes, so it might improve speed without cost hikes.

Shah C.

2026-02-15

It’s B, since Spark speeds up processing a lot without needing more hardware.

Shah C.

2026-02-13

It’s B because Spark handles large-scale data faster by keeping data in memory, which cuts down processing time without needing more servers or cluster expansion.

Shah C.

2026-02-12

B, since Spark can run on the same cluster and usually outperforms MapReduce.

Zain T.

2026-01-21

It’s B; Spark reduces I/O overhead without adding hardware costs.

Michael D.

2026-01-16

It’s B because Spark’s in-memory processing is way faster for iterative tasks compared to classic MapReduce, which is disk-heavy and slower. Also, Spark can run on the existing cluster, so no extra hardware costs. Option A with Pig is just a scripting language on top of MapReduce, so it won’t speed things up much. C means more hardware, which breaks the no-cost increase rule. D makes no sense since shrinking the cluster usually slows things down, regardless of Hive rewriting. So B hits that sweet spot between speed and cost.

Michael D.

2026-01-15

B Spark handles big data faster than MapReduce without costing more.