Skip to content

Add CC-scale MinerU-HTML layout-clustering + propagation pipeline (91% fewer LLM calls, F1=0.91)#2075

Draft
VibhuJawa wants to merge 118 commits into
NVIDIA-NeMo:mainfrom
VibhuJawa:feat/mineru-html-layout-clustering-pipeline
Draft

Add CC-scale MinerU-HTML layout-clustering + propagation pipeline (91% fewer LLM calls, F1=0.91)#2075
VibhuJawa wants to merge 118 commits into
NVIDIA-NeMo:mainfrom
VibhuJawa:feat/mineru-html-layout-clustering-pipeline

Add single-command run_pipeline.py; fix DripperHTMLWorkflow._build_st…

5786aa1
Select commit
Loading
Failed to load commit list.