markitdown markitdown

markitdown was created by microsoft 9 month(s) ago, and last updated 4 day(s) ago.

Python tool for converting files and office documents to Markdown.
Python 3.16MB MIT Github
Stars
70.5k
Fork
3.8k
Watch
249
Open Issues
344

kkFileView

13.1k Java

Universal File Online Preview Project based on Spring-Boot

1 7 year(s) ago 11 day(s) ago

markdown-to-image

1.4k TypeScript Apache-2.0

This React component is used to render Markdown into a beautiful poster image, with support for copying as an image. Md to Poster/Image/Quote/Card/Instagram/Twitter/Facebook...

1 1 year(s) ago 4 month(s) ago

markitdown

70.5k Python MIT

Python tool for converting files and office documents to Markdown.

1 9 month(s) ago 4 day(s) ago

cherry-studio

30.4k TypeScript NOASSERTION

🍒 Cherry Studio is a desktop client that supports for multiple LLM providers.

1 1 year(s) ago 20 day(s) ago

chatbox

35.1k TypeScript GPL-3.0

User-friendly Desktop Client App for AI Models/LLMs (GPT, Claude, Gemini, Ollama...)

1 2 year(s) ago 2 month(s) ago

vectordb-recipes

808 Jupyter Notebook Apache-2.0

High quality resources & applications for LLMs, multi-modal models and VectorDBs

1 2 year(s) ago 12 day(s) ago

OCRmyPDF

30.5k Python MPL-2.0

OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched

1 11 year(s) ago 13 day(s) ago

AFFiNE

54k TypeScript NOASSERTION

There can be more than Notion and Miro. AFFiNE(pronounced [ə‘fain]) is a next-gen knowledge base that brings planning, sorting and creating all together. Privacy first, open-source, customizable and ready to use.

1 3 year(s) ago 5 day(s) ago

shippie

2.2k TypeScript MIT

extendable code review and QA agent 🚢

1 2 year(s) ago 13 day(s) ago

ragflow

61.5k Python Apache-2.0

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

1 1 year(s) ago 7 day(s) ago

instructor

11.1k Python MIT

structured outputs for llms

1 2 year(s) ago 15 day(s) ago

swarms

5.1k Python Apache-2.0

The Enterprise-Grade Production-Ready Multi-Agent Orchestration Framework. Website: https://swarms.ai

1 2 year(s) ago 18 hour(s) ago

scira

10.4k TypeScript Apache-2.0

Scira (Formerly MiniPerplx) is a minimalistic AI-powered search engine that helps you find information on the internet and cites it too. Powered by Vercel AI SDK! Search with models like xAI's Grok 3.

1 1 year(s) ago 19 hour(s) ago

firecrawl

43.2k TypeScript AGPL-3.0

🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.

1 1 year(s) ago 21 day(s) ago

wiki

26.5k Vue AGPL-3.0

Wiki.js | A modern and powerful wiki app built on Node.js

1 8 year(s) ago 1 month(s) ago

WrenAI

9.7k TypeScript AGPL-3.0

⚡️ AI-powered Business Intelligence (GenBI - Generative BI) query any database in natural language, generate accurate SQL (Text-to-SQL), charts (Text-to-chart), and insights in seconds.

1 1 year(s) ago 2 day(s) ago

gitbook

28.2k TypeScript GPL-3.0

The open source frontend for GitBook doc sites

1 11 year(s) ago 5 day(s) ago

PDFMathTranslate

24.6k Python AGPL-3.0

PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译,支持 Google/DeepL/Ollama/OpenAI 等服务,提供 CLI/GUI/MCP/Docker/Zotero

1 11 month(s) ago 2 month(s) ago

quivr

38.1k Python NOASSERTION

Opiniated RAG for integrating GenAI in your apps 🧠 Focus on your product rather than the RAG. Easy integration in existing products with customisation! Any LLM: GPT4, Groq, Llama. Any Vectorstore: PGVector, Faiss. Any Files. Anyway you want.

1 2 year(s) ago 21 day(s) ago

Flowise

41.7k TypeScript NOASSERTION

Build AI Agents, Visually

1 2 year(s) ago 21 day(s) ago