Your company receives both batch- and stream-based event data. You want to process the data using Google Cloud Dataflow over a predictable time period.
However, you realize that in some instances data can arrive late or out of order. How should you design your Cloud Dataflow pipeline to handle data that is late or out of order?
A. Set a single global window to capture all the data.
B. Set sliding windows to capture all the lagged data.
C. Use watermarks and timestamps to capture the lagged data. Most Voted
D. Ensure every datasource type (stream or batch) has a timestamp, and use the timestamps to define the logic for lagged data.
Your company receives both batch- and stream-based event data. You want to process the data using Google Cloud Dataflow
-
answerhappygod
- Site Admin
- Posts: 899604
- Joined: Mon Aug 02, 2021 8:13 am
Your company receives both batch- and stream-based event data. You want to process the data using Google Cloud Dataflow
Join a community of subject matter experts. Register for FREE to view solutions, replies, and use search function. Request answer by replying!