Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Get up and running with Llama 3.3, Phi 4, Gemma 2, and other large language models.
A high-throughput and memory-efficient inference and serving engine for LLMs
Opiniated RAG for integrating GenAI in your apps 🧠 Focus on your product rather than the RAG. Easy integration in existing products with customisation! Any LLM: GPT4, Groq, Llama. Any Vectorstore: PGVector, Faiss. Any Files. Anyway you want.
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
基于大模型搭建的聊天机器人,同时支持 微信公众号、企业微信应用、飞书、钉钉 等接入,可选择GPT3.5/GPT-4o/GPT-o1/ Claude/文心一言/讯飞星火/通义千问/ Gemini/GLM-4/Claude/Kimi/LinkAI,能处理文本、语音和图片,访问操作系统和互联网,支持基于自有知识库进行定制企业智能客服。
Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and Llama) RAG and Agent app with langchain
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
Code review powered by LLMs (OpenAI GPT4, Sonnet 3.5) & Embeddings ⚡️ Improve code quality and catch bugs before you break production 🚀 Lives in your Github/GitLab/Azure DevOps CI
Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/
🍒 Cherry Studio is a desktop client that supports for multiple LLM providers. Support deepseek-r1
The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, No-code agent builder, and more.
A fast multimodal LLM for real-time voice
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.
LlamaIndex is the leading framework for building LLM-powered agents over your data.
Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search
A generative speech model for daily dialogue.
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.
🤖 Open-source GenBI AI Agent that empowers data-driven teams to chat with their data to generate Text-to-SQL, charts, spreadsheets, reports, and BI. 📈📊📋🧑💻
A CLI tool to convert your codebase into a single LLM prompt with source tree, prompt templating, and token counting.
Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous AI (gpt, claude, gemini, llama, qwen, mistral). Get started - free.
:robot: The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more models architectures. Features: Generate Text, Audio, Video, Images, Voice Cloning, Distributed, P2P inference
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)