🐼 ocr

👇 3 Éléments

MinerU

41.8k Python AGPL-3.0

A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。

1 il y a 1 an(s) il y a 5 moi(s)

paperless-ngx

30.8k Python GPL-3.0

A community-supported supercharged document management system: scan, index and archive all your documents

1 il y a 3 an(s) il y a 4 moi(s)

OCRmyPDF

31.7k Python MPL-2.0

OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched

1 il y a 12 an(s) il y a 2 moi(s)