paperless-ngx paperless-ngx

paperless-ngxpaperless-ngx创建于 3 年前, 最后更新于 5 天前

A community-supported supercharged version of paperless: scan, index and archive all your physical documents
Python 153.38MB GPL-3.0 Github
Stars
28.3k
Fork
1.7k
Watch
123
Open Issues
11

kkFileView

12.9k Java

Universal File Online Preview Project based on Spring-Boot

1 7 年前 4 天前

OpenBB

41k Python NOASSERTION

Investment Research for Everyone, Everywhere.

1 4 年前 11 天前

markitdown

59.3k Python MIT

Python tool for converting files and office documents to Markdown.

1 7 个月前 4 天前

vectordb-recipes

784 Jupyter Notebook Apache-2.0

High quality resources & applications for LLMs, multi-modal models and VectorDBs

1 2 年前 6 天前

OCRmyPDF

29.3k Python MPL-2.0

OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched

1 11 年前 13 天前

swarms

4.9k Python Apache-2.0

The Enterprise-Grade Production-Ready Multi-Agent Orchestration Framework. Website: https://swarms.ai

1 2 年前 1 天前

tensorflow

190.4k C++ Apache-2.0

An Open Source Machine Learning Framework for Everyone

1 9 年前 3 天前

PDFMathTranslate

24.6k Python AGPL-3.0

PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译,支持 Google/DeepL/Ollama/OpenAI 等服务,提供 CLI/GUI/MCP/Docker/Zotero

1 9 个月前 13 天前

llm-course

52.7k Jupyter Notebook Apache-2.0

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

1 2 年前 29 天前

paperless-ngx

28.3k Python GPL-3.0

A community-supported supercharged version of paperless: scan, index and archive all your physical documents

1 3 年前 5 天前

MinerU

34.9k Python AGPL-3.0

A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。

1 1 年前 13 天前