Skip to content

fix: order number and text of section headers by polygon.x_start, ens…#1004

Open
jcs-zfc wants to merge 1 commit intodatalab-to:masterfrom
jcs-zfc:orderSectionNrTxt
Open

fix: order number and text of section headers by polygon.x_start, ens…#1004
jcs-zfc wants to merge 1 commit intodatalab-to:masterfrom
jcs-zfc:orderSectionNrTxt

Conversation

@jcs-zfc
Copy link

@jcs-zfc jcs-zfc commented Mar 4, 2026

Description

In some PDFs (especially those with OCR layers), section numbers were positioned after the heading text in the output. This PR ensures that header elements are ordered by polygon.x_start to maintain the correct reading order.

(Note: This is the first of three independent PRs to improve marker.)

@github-actions
Copy link
Contributor

github-actions bot commented Mar 4, 2026

CLA Assistant Lite bot All contributors have signed the CLA ✍️ ✅

@jcs-zfc
Copy link
Author

jcs-zfc commented Mar 4, 2026

I have read the CLA Document and I hereby sign the CLA

github-actions bot added a commit that referenced this pull request Mar 4, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant