DocuPipe
self-hosted, document management

DocuPipe is an API-first intelligent document processing platform that turns messy PDFs, scans, images, tables, and text into clean, structured data. Our parsing engine performs OCR, extracts tables into rows and columns, detects checkboxes and signatures, reads handwriting, and surfaces embedded images and text. On top of the parsed output, DocuPipe provides LLM-powered services: apply or auto-create schemas to standardize data, refine schemas with feedback, classify documents with a taxonomy, analyze collections with natural language, and intelligently split multi-doc files.

DocuPipe logo

Join Our Mailing List

Stay in the loop with our monthly newsletter and be the first to know about new self-hosted software. We promise, no spam, just valuable updates.

Error. Your form has not been submittedEmoji
This is what the server says:
There must be an @ at the beginning.
I will retry
Reply
We respect your privacy and take protecting it seriously.
Built on Unicorn Platform