Skip to content

Add CC-scale MinerU-HTML layout-clustering + propagation pipeline (91% fewer LLM calls, F1=0.91)#2075

Draft
VibhuJawa wants to merge 118 commits into
NVIDIA-NeMo:mainfrom
VibhuJawa:feat/mineru-html-layout-clustering-pipeline
Draft

Add CC-scale MinerU-HTML layout-clustering + propagation pipeline (91% fewer LLM calls, F1=0.91)#2075
VibhuJawa wants to merge 118 commits into
NVIDIA-NeMo:mainfrom
VibhuJawa:feat/mineru-html-layout-clustering-pipeline

Commits

Commits on Jun 13, 2026

Commits on Jun 14, 2026

Commits on Jun 15, 2026