Detect and repair visually-baked Arabic text from PDFs, OCR, and legacy sources. Fixes what NFKC can't.
python nlp pdf ocr rtl bidi arabic data-cleaning arabic-nlp rag camel-tools presentation-forms text-repair
-
Updated
Jun 8, 2026 - Python