You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Prism is a next-generation document processing SDK built in Rust, designed to view, convert, and extract content from 600+ file formats. It's the modern, developer-friendly alternative to Oracle Outside In.
Comprehensive PDF parser focused on metadata-rich, layout-aware extraction. Combines PyMuPDF/pdfplumber text analysis, Camelot/Tabula tables, image and formula capture, plus column detection to preserve reading order. Ships with TOON export + token comparisons, CLI examples, and utilities for visual debug + dataset generation.