Avatar for the Eventual-Inc user
Eventual-Inc
Daft
BlogDocsChangelog

Performance History

Latest Results

docs: fix remaining Slack links missed in first pass Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
everettVT/slack-link-audit
4 minutes ago
feat(checkpoint): distributed observability counters (#7026) ## What Surfaces checkpoint progress on the dashboard for distributed (Flotilla) checkpoint writes, via three run-total counters: - **`keys_staged`** — on the `StageCheckpointKeys` source operator (source keys recorded for dedup) - **`files_staged`** — on the write sink (staged file-metadata blobs) - **`checkpoints_sealed`** — on the write sink (successful seals) ## Why A checkpointed write previously gave no observability — there was no way to see from the dashboard whether/how much a run staged or sealed. These counters make checkpoint activity visible alongside the existing `rows.written` / throughput stats. ## How The counters ride the existing `StatSnapshot` aggregation path rather than introducing a new mechanism: - Each worker's `RuntimeStats` records the counter and includes it in its `StatSnapshot` (`StageCheckpointKeysSnapshot`, and two new fields on `WriteSnapshot`). - The distributed pipeline node's `handle_worker_node_stats` sums the snapshot fields into driver-side meter counters and re-emits them via `export_snapshot`. - Non-checkpoint operators report zero via default no-op `RuntimeStats` trait methods, so nothing changes for ordinary writes. The source op also now records `rows.in/out` (previously it ignored its stats handle, so the node showed empty throughput); row tallies are owned by the intermediate-op runner to avoid double-counting. ## Testing Verified end-to-end on a local Flotilla run (Ray runner, mock S3): a paired `read_parquet(checkpoint=...)` → `write_deltalake(checkpoint=...)` job shows `keys_staged=200`, `files_staged=1`, `checkpoints_sealed=1` on the dashboard, cross-checked against the staged objects in the store. A recovery rerun (same key) correctly stages zero. ## Scope Counters only. Driver-side lifecycle *events* (commit / recovery) and the dashboard-server UI for them are intentionally left as follow-ups. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-authored-by: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
main
44 minutes ago
fixes
chris/query-runner-metadata
2 hours ago
ci: rerun PR checks
jackylee-ch:codex-feat-iceberg-branch-tag-reads
4 hours ago
fix(mcap): support http urls through fsspec handler
RitwijParmar:codex/daft-http-mcap-fsspec-handler
4 hours ago

Latest Branches

CodSpeed Performance Gauge
0%
docs: standardize Slack links to use daft.ai/slack#7066
25 minutes ago
70e5f6b
everettVT/slack-link-audit
CodSpeed Performance Gauge
0%
1 hour ago
3cb4e55
colin/flight-shuffle-coordinator-memory
CodSpeed Performance Gauge
0%
feat(checkpoint): distributed observability counters#7026
5 days ago
06176aa
rohit/feature/checkpoint-metrics
© 2026 CodSpeed Technology
Home Terms Privacy Docs