🐼 pdf

👇 6 Éléments

PDFMathTranslate

13.3k Python AGPL-3.0

PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译,支持 Google/DeepL/Ollama/OpenAI 等服务,提供 CLI/GUI/Docker

1 il y a 8 moi(s) il y a 4 moi(s)

MinerU

30.9k Python AGPL-3.0

A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。

1 il y a 1 an(s) il y a 23 jour(s)

paperless-ngx

26.7k Python GPL-3.0

A community-supported supercharged version of paperless: scan, index and archive all your physical documents

1 il y a 3 an(s) il y a 15 jour(s)

OCRmyPDF

27.9k Python MPL-2.0

OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched

1 il y a 11 an(s) il y a 15 jour(s)

markitdown

55.7k Python MIT

Python tool for converting files and office documents to Markdown.

1 il y a 5 moi(s) il y a 6 jour(s)

kkFileView

12.7k Java

Universal File Online Preview Project based on Spring-Boot

1 il y a 7 an(s) il y a 5 jour(s)