OCRmyPDF OCRmyPDF

OCRmyPDF was created by ocrmypdf 11 year(s) ago, and last updated 23 hour(s) ago.

OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
Python 65.59MB MPL-2.0 Github
Stars
30.9k
Fork
2.1k
Watch
190
Open Issues
137

kkFileView

13.2k Java

Universal File Online Preview Project based on Spring-Boot

1 7 year(s) ago 7 day(s) ago

OpenBB

44.1k Python NOASSERTION

Investment Research for Everyone, Everywhere.

1 4 year(s) ago 1 month(s) ago

markitdown

71.4k Python MIT

Python tool for converting files and office documents to Markdown.

1 9 month(s) ago 7 day(s) ago

OCRmyPDF

30.9k Python MPL-2.0

OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched

1 11 year(s) ago 23 hour(s) ago

browser-use

66.3k Python MIT

🌐 Make websites accessible for AI agents. Automate tasks online with ease.

1 9 month(s) ago 27 day(s) ago

instructor

11.2k Python MIT

structured outputs for llms

1 2 year(s) ago 11 day(s) ago

kokoro-onnx

2k Python MIT

TTS with kokoro and onnx runtime

1 7 month(s) ago 2 month(s) ago

Jobs_Applier_AI_Agent_AIHawk

28.5k Python AGPL-3.0

AIHawk aims to easy job hunt process by automating the job application process. Utilizing artificial intelligence, it enables users to apply for multiple jobs in a tailored way.

1 1 year(s) ago 28 day(s) ago

MoneyPrinterTurbo

19.9k Python MIT

利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.

1 1 year(s) ago 7 month(s) ago

tensorflow

191.2k C++ Apache-2.0

An Open Source Machine Learning Framework for Everyone

1 9 year(s) ago 7 day(s) ago

PDFMathTranslate

24.6k Python AGPL-3.0

PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译,支持 Google/DeepL/Ollama/OpenAI 等服务,提供 CLI/GUI/MCP/Docker/Zotero

1 11 month(s) ago 2 month(s) ago

LLMs-from-scratch

60.2k Jupyter Notebook NOASSERTION

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

1 2 year(s) ago 25 day(s) ago

ChatTTS

37.6k Python AGPL-3.0

A generative speech model for daily dialogue.

1 1 year(s) ago 1 day(s) ago

dify

110.8k TypeScript NOASSERTION

Production-ready platform for agentic workflow development.

1 2 year(s) ago 9 day(s) ago

paperless-ngx

30.8k Python GPL-3.0

A community-supported supercharged document management system: scan, index and archive all your documents

1 3 year(s) ago 1 day(s) ago

MinerU

41.8k Python AGPL-3.0

A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。

1 1 year(s) ago 8 day(s) ago

yt-dlp

118.6k Python Unlicense

A feature-rich command-line audio/video downloader

1 4 year(s) ago 1 month(s) ago

fastapi

88k Python MIT

FastAPI framework, high performance, easy to learn, fast to code, ready for production

1 6 year(s) ago 17 day(s) ago