llama.cpp llama.cpp

llama.cppggml-org创建于 2 年前, 最后更新于 24 天前

LLM inference in C/C++
C++ 91.30MB MIT Github
Stars
74.7k
Fork
10.8k
Watch
581
Open Issues
718

LocalAI

30k Go MIT

:robot: The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more models architectures. Features: Generate Text, Audio, Video, Images, Voice Cloning, Distributed, P2P inference

1 1 年前 3 天前

LLaMA-Factory

43.2k Python Apache-2.0

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

1 1 年前 8 天前

vllm

41.5k Python Apache-2.0

A high-throughput and memory-efficient inference and serving engine for LLMs

1 2 年前 39 分钟前

Langchain-Chatchat

34.1k TypeScript Apache-2.0

Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and Llama) RAG and Agent app with langchain

1 1 年前 20 分钟前

llama.cpp

74.7k C++ MIT

LLM inference in C/C++

1 2 年前 24 天前

ollama

132k Go MIT

Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, and other large language models.

1 1 年前 20 分钟前