OCRmyPDF OCRmyPDF

OCRmyPDF was created by ocrmypdf 11 year(s) ago, and last updated 22 day(s) ago.

OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
Python 64.87MB MPL-2.0 Github
Stars
18.2k
Fork
1.2k
Watch
151
Open Issues
119

OpenBB

36.6k Python NOASSERTION

Investment Research for Everyone, Everywhere.

1 4 year(s) ago 13 day(s) ago

markitdown

39.6k Python MIT

Python tool for converting files and office documents to Markdown.

1 4 month(s) ago 6 day(s) ago

OCRmyPDF

18.2k Python MPL-2.0

OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched

1 11 year(s) ago 22 day(s) ago

browser-use

25.4k Python MIT

Make websites accessible for AI agents

1 4 month(s) ago 1 month(s) ago

instructor

8.9k Python MIT

structured outputs for llms

1 1 year(s) ago 1 month(s) ago

kokoro-onnx

1.7k Python MIT

TTS with kokoro and onnx runtime

1 2 month(s) ago 2 day(s) ago

Jobs_Applier_AI_Agent_AIHawk

27.5k Python AGPL-3.0

Jobs_Applier_AI_Agent_AIHawk aims to easy job hunt process by automating the job application process. Utilizing artificial intelligence, it enables users to apply for multiple jobs in a tailored way.

1 7 month(s) ago 2 day(s) ago

MoneyPrinterTurbo

19.9k Python MIT

利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.

1 1 year(s) ago 1 month(s) ago

tensorflow

188.5k C++ Apache-2.0

An Open Source Machine Learning Framework for Everyone

1 9 year(s) ago 5 day(s) ago

PDFMathTranslate

13.3k Python AGPL-3.0

PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译,支持 Google/DeepL/Ollama/OpenAI 等服务,提供 CLI/GUI/Docker

1 6 month(s) ago 2 month(s) ago

LLMs-from-scratch

36.7k Jupyter Notebook NOASSERTION

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

1 1 year(s) ago 2 month(s) ago

ChatTTS

34.9k Python AGPL-3.0

A generative speech model for daily dialogue.

1 9 month(s) ago 7 day(s) ago

dify

78k TypeScript NOASSERTION

Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.

1 1 year(s) ago 7 day(s) ago

paperless-ngx

25.6k Python GPL-3.0

A community-supported supercharged version of paperless: scan, index and archive all your physical documents

1 3 year(s) ago 7 day(s) ago

MinerU

27.5k Python AGPL-3.0

A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。

1 1 year(s) ago 7 day(s) ago

yt-dlp

102.1k Python Unlicense

A feature-rich command-line audio/video downloader

1 4 year(s) ago 15 day(s) ago

fastapi

81.7k Python MIT

FastAPI framework, high performance, easy to learn, fast to code, ready for production

1 6 year(s) ago 7 day(s) ago