Free Google Professional-Machine-Learning-Engineer Actual Exam Questions - Question 4 Discussion

Question No. 4
Your organization's call center has asked you to develop a model that analyzes customer sentiments
in each call. The call center receives over one million calls daily, and data is stored in Cloud Storage.
The data collected must not leave the region in which the call originated, and no Personally
Identifiable Information (Pll) can be stored or analyzed. The data science team has a third-party tool
for visualization and access which requires a SQL ANSI-2011 compliant interface. You need to select
components for data processing and for analytics. How should the data pipeline be designed?
Professional Machine Learning Engineer practice exam questions
Select one option, then reveal solution.
US
PR
Paul R.
2026-02-21

Good point on Cloud SQL handling regional data and ANSI-2011 compliance. I’d say C fits best since Pub/Sub or Datastore won’t provide the SQL interface needed, and Cloud Functions might struggle with scaling to that volume. C it is.

0
PR
Paul R.
2026-02-17

Maybe C since Dataflow can scrub PII well and Cloud SQL keeps data regional.

0
PR
Paul R.
2026-02-17

C Dataflow is great for processing and anonymizing data fast, plus Cloud SQL supports ANSI-2011 and regional instances—better for keeping data local compared to BigQuery's multi-region setups.

0
PR
Paul R.
2026-02-17

D imo, Cloud Functions can handle quick PII filtering, Cloud SQL fits regional and ANSI needs.

0
AX
Ali X.
2026-02-15

Option C makes sense too. Dataflow is solid for processing and scrubbing PII before storing, and Cloud SQL still supports ANSI-2011 SQL with regional instances available. Cloud SQL could be easier to control data residency since you can spin up instances strictly in the needed region, while BigQuery's multi-region handling might risk crossing boundaries unintentionally. Plus, Cloud SQL handles structured query needs well for the data science team’s third-party tool. The main tradeoff is scalability at this volume, but with proper instance sizing, it should work fine.

0
JI
Jason I.
2026-02-10

C/D? Cloud SQL definitely supports ANSI-2011 SQL, which the third-party tool needs, and Cloud Functions can handle event-driven processing at scale without managing servers. But not sure if Cloud Functions alone can handle the volume or PII filtering as well as Dataflow. Meanwhile, Dataflow’s strong for ETL and regional compliance, but pairing it with Cloud SQL might limit scaling with a million calls daily. So it’s a trade-off between scalability (Dataflow + BigQuery) and strict SQL compliance (Cloud SQL).

0
SW
Shoaib W.
2026-02-09

Probably C here since Cloud SQL supports ANSI-2011 SQL and can keep data regional. Dataflow can handle the processing and filtering before loading into Cloud SQL, though scalability might be a concern.

0
SM
Shah M.
2026-01-27

Maybe A makes the most sense since Dataflow can do real-time processing and PII filtering before pushing data to BigQuery, which easily handles huge volumes and supports ANSI-2011 SQL for the visualization tool.

0
HN
Hassan N.
2026-01-26

C/D? Cloud SQL supports ANSI-2011 SQL, which fits the third-party tool’s need, and processing with Cloud Functions or Dataflow can handle PII removal. But Cloud SQL might struggle with the scale of over a million calls daily, and Cloud Functions can be limited for streaming data. Pub/Sub and Datastore don’t fit well for SQL querying or large-scale analytics. So it’s more about balancing scale versus SQL compatibility here. BigQuery scales better than Cloud SQL, but if strict SQL ANSI-2011 compliance is needed, Cloud SQL options (C or D) might be considered despite scaling challenges.

0
MH
Mohammad H.
2026-01-24

Maybe A because Dataflow handles large-scale streaming well and can do preprocessing like PII removal before loading into BigQuery, which supports ANSI SQL and regional data residency. Cloud SQL won’t scale as well here.

0
SE
Sarah E.
2026-01-22

It’s A because Dataflow can filter out PII in real-time and keep data regional, while BigQuery scales for huge datasets and offers full ANSI-2011 SQL support, unlike Cloud SQL or Datastore.

0
RB
Rizwan B.
2026-01-20

It’s A because BigQuery can easily handle massive datasets with ANSI SQL support and strict regional controls, while Dataflow manages huge streaming input better than Cloud SQL or Datastore.

0
ST
Shoaib T.
2026-01-19

A. Dataflow handles large-scale streaming well, and BigQuery supports ANSI SQL and regional data residency. Cloud SQL might struggle with the volume and doesn't guarantee regional data isolation as strictly.

0
DV
Daniel V.
2026-01-16

A, but explanation is too vague here. Can someone clarify?

0