SIGN IN SIGN UP

Data Engineering Zoomcamp is a free 9-week course on building production-ready data pipelines. The next cohort starts in January 2026. Join the course here 👇🏼

0 0 49 Jupyter Notebook

Rewrite watermark explanation, add realtime producer, rename jobs

- Reframe watermark as "when to publish" instead of "when to drop"
- Add Mermaid diagrams: late event before/after publishing
- Move watermark from pass-through job to aggregation section
- Rename event_watermark -> event_timestamp, start_job -> pass_through_job
- Add producer_realtime.py with simulated delays and NYC zone annotations
- Add aggregation_job_demo.py with 10-second windows for experimentation
- Remove Step N: prefixes from section headings
- Change watermark interval to 5 seconds
A
Alexey Grigorev committed
2d11fa3dd2e5ae841f9543c4e7dbb61fbef46fbd
Parent: 2eb5c31