Implement image-6 text removal and align generation defaults#5
Conversation
…r defaults. This adds the text removal task pipeline and ReMOVE/PSNR metric wiring, removes forced masked composition behavior, updates typography-8 to run without mask metadata, and switches default diffusion runs/docs to FLUX.2 klein 9B for consistent baseline behavior. Made-with: Cursor
This updates the smoke assertion to the current 40 benchmark registry size and applies Ruff-compliant import sorting in remove_metric so lint passes. Made-with: Cursor
Keep a compatibility shim in tasks/image.py and align README/HELM metadata so image-6 remains discoverable as a typography-domain benchmark with consistent benchmark counts. Made-with: Cursor
Apply Ruff-compatible import grouping in typography.py only so the PR passes lint without changing runtime behavior. Made-with: Cursor
Co-authored-by: Cursor <cursoragent@cursor.com>
Use a non-degenerate edit mask for image-conditioned layout adaptation, extend typography bbox evaluation to count missed detections, ignore local model artifacts, and remove the obsolete image task shim. Co-authored-by: Cursor <cursoragent@cursor.com>
|
Thanks for the implementation. One blocker before merge: While resolving that, please also update the |
|
Functional review follow-up after exercising
Issues I think should be fixed before merge:
|
Resolve documentation conflicts, align diffusion defaults, materialize image-6 auxiliary assets from HuggingFace rows, and avoid downloading ReMOVE checkpoints before optional dependencies are available. Co-authored-by: Cursor <cursoragent@cursor.com>
Summary
image-6(Text Removal & Background Inpainting) pipeline end-to-end in GDB, including data loading, model I/O handling, and evaluation integration.src/gdb/metrics/remove_metric.py) and PSNR metric wiring (src/gdb/metrics/core.py,src/gdb/metrics/__init__.py), and classifyimage-6under typography while preserving benchmark ID compatibility.typography-8mask-free at model-input time, and updating diffusion defaults/docs fromflux.2-klein-4btoflux.2-klein-9b.Test plan
python scripts/run_benchmarks.py --listlayout-8andtypography-7.typography-8outputs after removing mask metadata and verify summary/log updates.typography-8ModelInputno longer includesmaskmetadata.Made with Cursor