🐼 ocr

👇 3 Elemente

OCRmyPDF

18.2k Python MPL-2.0

OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched

1 vor 11 Jahr(en) vor 22 Tag(en)

MinerU

27.5k Python AGPL-3.0

A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。

1 vor 1 Jahr(en) vor 7 Tag(en)

paperless-ngx

25.6k Python GPL-3.0

A community-supported supercharged version of paperless: scan, index and archive all your physical documents

1 vor 3 Jahr(en) vor 7 Tag(en)