OCRmyPDF OCRmyPDF

OCRmyPDF was created by ocrmypdf 11 year(s) ago, and last updated 19 day(s) ago.

OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
Python 65.34MB MPL-2.0 Github
Stars
29.3k
Fork
2k
Watch
188
Open Issues
129

kkFileView

12.9k Java

Universal File Online Preview Project based on Spring-Boot

1 7 year(s) ago 2 day(s) ago

OpenBB

41k Python NOASSERTION

Investment Research for Everyone, Everywhere.

1 4 year(s) ago 18 day(s) ago

markitdown

59.6k Python MIT

Python tool for converting files and office documents to Markdown.

1 7 month(s) ago 2 day(s) ago

OCRmyPDF

29.3k Python MPL-2.0

OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched

1 11 year(s) ago 19 day(s) ago

browser-use

61.2k Python MIT

🌐 Make websites accessible for AI agents. Automate tasks online with ease.

1 8 month(s) ago 1 month(s) ago

instructor

10.4k Python MIT

structured outputs for llms

1 2 year(s) ago 1 month(s) ago

kokoro-onnx

2k Python MIT

TTS with kokoro and onnx runtime

1 5 month(s) ago 15 day(s) ago

Jobs_Applier_AI_Agent_AIHawk

28.3k Python AGPL-3.0

AIHawk aims to easy job hunt process by automating the job application process. Utilizing artificial intelligence, it enables users to apply for multiple jobs in a tailored way.

1 11 month(s) ago 6 day(s) ago

MoneyPrinterTurbo

19.9k Python MIT

利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.

1 1 year(s) ago 5 month(s) ago

tensorflow

190.5k C++ Apache-2.0

An Open Source Machine Learning Framework for Everyone

1 9 year(s) ago 1 day(s) ago

PDFMathTranslate

24.6k Python AGPL-3.0

PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译,支持 Google/DeepL/Ollama/OpenAI 等服务,提供 CLI/GUI/MCP/Docker/Zotero

1 9 month(s) ago 19 day(s) ago

LLMs-from-scratch

56k Jupyter Notebook NOASSERTION

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

1 1 year(s) ago 3 day(s) ago

ChatTTS

36.9k Python AGPL-3.0

A generative speech model for daily dialogue.

1 1 year(s) ago 12 day(s) ago

dify

87.8k TypeScript NOASSERTION

Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.

1 2 year(s) ago 3 month(s) ago

paperless-ngx

28.3k Python GPL-3.0

A community-supported supercharged version of paperless: scan, index and archive all your physical documents

1 3 year(s) ago 11 day(s) ago

MinerU

34.9k Python AGPL-3.0

A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。

1 1 year(s) ago 20 day(s) ago

yt-dlp

116.6k Python Unlicense

A feature-rich command-line audio/video downloader

1 4 year(s) ago 3 day(s) ago

fastapi

86.4k Python MIT

FastAPI framework, high performance, easy to learn, fast to code, ready for production

1 6 year(s) ago 12 day(s) ago