Conversation
- Fix hatch_build.py: use force_include (underscore) not force-include (hyphen) so hatchling actually includes the compiled binary in the wheel - Set pure_python=False and platform tag (manylinux_2_17) so pip installs the correct architecture-specific wheel - Rewrite release.yml with matrix builds (x86_64 + aarch64) producing one wheel per arch, separate sdist, combined publish step - Attach arch-suffixed pwalk2 binaries to GitHub releases Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
Switch from ThreadPoolExecutor to raw mp.Process workers to eliminate GIL contention from botocore XML parsing that was starving the main thread. Workers now use a work-stealing pattern with blocking queues and None sentinels for clean shutdown. Also adds --max-objects CLI option for benchmarking, progress display during prefix discovery, and per-worker timing instrumentation. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
Consolidate parquet files and meta.json under a dashboard_data/ subdirectory alongside dashboard.html, using _export_dashboard_data for both build and update commands. Update test helpers accordingly. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
The old approach used the shallowest file's parent directory as root, which breaks when files exist at different depths (e.g. S3 data). Use lexicographic min/max to compute the LCP in O(1) comparisons. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
Folder chip sizes were computed from cell.folder (immediate parent dir only), giving values much lower than compute_top_folders which explodes all path components. Now accumulates chip sizes for every matching folder tag in the node's path, consistent with Python-side computation. Folder filtering now checks both node path parts and cell.folder, catching files in pruned subdirectories that the node-only approach missed. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
Compares two .feather scan files and builds a diff dashboard showing where disk usage grew or shrank. Merges pruned trees via outer join, builds merged cubes with old/new/delta columns, and produces a standalone HTML dashboard with diverging red/green treemap coloring based on absolute delta, deep linking support, and top changes panel. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
No description provided.