PDFStract - The Extraction and Chunking Layer in Your RAG Pipeline - Available as CLI - WEBUI - API
-
Updated
Jan 27, 2026 - Python
PDFStract - The Extraction and Chunking Layer in Your RAG Pipeline - Available as CLI - WEBUI - API
A Node.js service for converting DOCX files to PDF using LibreOffice. It supports both synchronous and asynchronous processing with webhook notifications, error handling, and health checks. The service can be deployed on platforms like Render and Fly.io, and offers Docker support.
Add a description, image, and links to the pdfconversion topic page so that developers can more easily learn about it.
To associate your repository with the pdfconversion topic, visit your repo's landing page and select "manage topics."