| 1 |
ollama |
166264 |
15184 |
Go |
2032 |
Get up and running with Kimi-K2.5, GLM-5, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models. |
2026-03-27T01:06:55Z |
| 2 |
llama.cpp |
99510 |
15865 |
C++ |
483 |
LLM inference in C/C++ |
2026-03-27T01:44:05Z |
| 3 |
vllm |
74460 |
14830 |
Python |
1762 |
A high-throughput and memory-efficient inference and serving engine for LLMs |
2026-03-27T05:17:00Z |
| 4 |
LlamaFactory |
69131 |
8434 |
Python |
914 |
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024) |
2026-03-27T02:04:16Z |
| 5 |
llama |
59268 |
9826 |
Python |
460 |
Inference code for Llama models |
2025-01-26T21:42:26Z |
| 6 |
unsloth |
58354 |
4925 |
Python |
938 |
Unsloth Studio is a web UI for training and running open models like Qwen, DeepSeek, gpt-oss and Gemma locally. |
2026-03-27T05:16:21Z |
| 7 |
llama_index |
48037 |
7094 |
Python |
186 |
LlamaIndex is the leading document agent and OCR platform |
2026-03-27T04:09:27Z |
| 8 |
LocalAI |
44450 |
3800 |
Go |
120 |
LocalAI is the open-source AI engine. Run any model - LLMs, vision, voice, image, video - on any hardware. No GPU required. |
2026-03-27T01:02:49Z |
| 9 |
aider |
42423 |
4075 |
Python |
1187 |
aider is AI pair programming in your terminal |
2026-03-17T01:21:34Z |
| 10 |
quivr |
39066 |
3735 |
Python |
5 |
Opiniated RAG for integrating GenAI in your apps 🧠 Focus on your product rather than the RAG. Easy integration in existing products with customisation! Any LLM: GPT4, Groq, Llama. Any Vectorstore: PGVector, Faiss. Any Files. Anyway you want. |
2025-07-09T12:55:23Z |
| 11 |
Langchain-Chatchat |
37651 |
6186 |
Python |
3 |
Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and Llama) RAG and Agent app with langchain |
2025-11-10T09:27:42Z |
| 12 |
khoj |
33649 |
2086 |
Python |
78 |
Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous AI (gpt, claude, gemini, llama, qwen, mistral). Get started - free. |
2026-03-26T03:35:43Z |
| 13 |
llama3 |
29295 |
3527 |
Python |
178 |
The official Meta Llama 3 GitHub site |
2025-01-26T21:39:06Z |
| 14 |
fish-speech |
28829 |
2420 |
Python |
27 |
SOTA Open Source TTS |
2026-03-23T08:18:33Z |
| 15 |
AstrBot |
27661 |
1868 |
Python |
764 |
Agentic IM Chatbot infrastructure that integrates lots of IM platforms, LLMs, plugins and AI feature, and can be your openclaw alternative. ✨ |
2026-03-27T03:34:18Z |
| 16 |
sglang |
25083 |
5020 |
Python |
600 |
SGLang is a high-performance serving framework for large language models and multimodal models. |
2026-03-27T05:11:22Z |
| 17 |
LLaVA |
24618 |
2747 |
Python |
1095 |
[NeurIPS’23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond. |
2024-08-12T09:52:38Z |
| 18 |
llamafile |
23895 |
1284 |
C++ |
197 |
Distribute and run LLMs with a single file. |
2026-03-26T19:53:42Z |
| 19 |
repomix |
22677 |
1055 |
TypeScript |
126 |
📦 Repomix is a powerful tool that packs your entire repository into a single, AI-friendly file. Perfect for when you need to feed your codebase to Large Language Models (LLMs) or other AI tools like Claude, ChatGPT, DeepSeek, Perplexity, Gemini, Gemma, Llama, Grok, and more. |
2026-03-27T01:40:31Z |
| 20 |
Awesome-Chinese-LLM |
22479 |
2116 |
None |
7 |
整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。 |
2025-05-19T06:11:57Z |
| 21 |
llama2.c |
19322 |
2475 |
C |
125 |
Inference Llama 2 in one file of pure C |
2024-08-06T09:44:40Z |
| 22 |
Chinese-LLaMA-Alpaca |
18964 |
1864 |
Python |
1 |
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs) |
2025-07-15T00:53:02Z |
| 23 |
alpaca-lora |
18959 |
2199 |
Jupyter Notebook |
333 |
Instruct-tune LLaMA on consumer hardware |
2024-07-29T13:37:49Z |
| 24 |
promptfoo |
18597 |
1592 |
TypeScript |
95 |
Test your prompts, agents, and RAGs. Red teaming/pentesting/vulnerability scanning for AI. Compare performance of GPT, Claude, Gemini, Llama, and more. Simple declarative configs with command line and CI/CD integration. Used by OpenAI and Anthropic. |
2026-03-27T04:37:45Z |
| 25 |
llama-cookbook |
18260 |
2716 |
Jupyter Notebook |
25 |
Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama model family and using them on various provider services |
2026-03-26T13:11:11Z |
| 26 |
free-llm-api-resources |
17156 |
1702 |
Python |
14 |
A list of free LLM inference resources accessible via API. |
2026-03-10T00:26:45Z |
| 27 |
ChuanhuChatGPT |
15367 |
2252 |
Python |
125 |
GUI for ChatGPT API and many LLMs. Supports agents, file-based QA, GPT finetuning and query with web search. All with a neat UI. |
2026-02-28T13:47:36Z |
| 28 |
llama3-from-scratch |
15253 |
1287 |
Jupyter Notebook |
16 |
llama3 implementation one matrix multiplication at a time |
2024-05-23T14:34:05Z |
| 29 |
Llama-Chinese |
14741 |
1307 |
Python |
194 |
Llama中文社区,实时汇总最新Llama学习资料,构建最好的中文Llama大模型开源生态,完全开源可商用 |
2025-04-06T09:16:55Z |
| 30 |
airllm |
14439 |
1451 |
Jupyter Notebook |
131 |
AirLLM 70B inference with single 4GB GPU |
2026-03-10T11:42:34Z |
| 31 |
ms-swift |
13378 |
1300 |
Python |
914 |
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.5, DeepSeek-R1, GLM-5, InternLM3, Llama4, …) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, Phi4, …) (AAAI 2025). |
2026-03-26T09:08:38Z |
| 32 |
dalai |
12976 |
1350 |
CSS |
291 |
The simplest way to run LLaMA on your local machine |
2024-06-18T20:29:46Z |
| 33 |
PaddleNLP |
12935 |
3053 |
Python |
122 |
Easy-to-use and powerful LLM and SLM library with awesome model zoo. |
2025-12-17T09:19:22Z |
| 34 |
OpenLLM |
12214 |
805 |
Python |
4 |
Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud. |
2026-03-23T16:54:29Z |
| 35 |
h2ogpt |
12001 |
1318 |
Python |
292 |
Private chat with local GPT with document, images, video, etc. 100% private, Apache 2.0. Supports oLLaMa, Mixtral, llama.cpp, and more. Demo: https://gpt.h2o.ai/ https://gpt-docs.h2o.ai/ |
2025-10-09T23:30:01Z |
| 36 |
shell_gpt |
11924 |
949 |
Python |
85 |
A command-line productivity tool powered by AI large language models like GPT-5, will help you accomplish your tasks faster and more efficiently. |
2026-01-28T04:07:23Z |
| 37 |
ludwig |
11665 |
1212 |
Python |
0 |
Low-code framework for building custom LLMs, neural networks, and other AI models |
2026-03-16T21:29:15Z |
| 38 |
langchain4j |
11318 |
2074 |
Java |
577 |
LangChain4j is an open-source Java library that simplifies the integration of LLMs into Java applications through a unified API, providing access to popular LLMs and vector databases. It makes implementing RAG, tool calling (including support for MCP), and agents easy. LangChain4j integrates seamlessly with various enterprise Java frameworks. |
2026-03-26T18:58:29Z |
| 39 |
bisheng |
11245 |
1834 |
TypeScript |
98 |
BISHENG is an open LLM devops platform for next generation Enterprise AI applications. Powerful and comprehensive features include: GenAI workflow, RAG, Agent, Unified model management, Evaluation, SFT, Dataset Management, Enterprise-level System Management, Observability and more. |
2026-03-27T03:55:57Z |
| 40 |
tensorzero |
11153 |
797 |
Rust |
322 |
TensorZero is an open-source LLMOps platform that unifies an LLM gateway, observability, evaluation, optimization, and experimentation. |
2026-03-26T22:26:32Z |
| 41 |
llama-gpt |
10985 |
712 |
TypeScript |
84 |
A self-hosted, offline, ChatGPT-like chatbot. Powered by Llama 2. 100% private, with no data leaving your device. New: Code Llama support! |
2024-04-23T18:56:06Z |
| 42 |
llama-cpp-python |
10097 |
1338 |
Python |
608 |
Python bindings for llama.cpp |
2026-03-25T23:43:36Z |
| 43 |
petals |
10026 |
597 |
Python |
92 |
🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading |
2024-09-07T11:54:28Z |
| 44 |
inference |
9176 |
809 |
Python |
104 |
Swap GPT for any LLM by changing a single line of code. Xinference lets you run open-source, speech, and multimodal models on cloud, on-prem, or your laptop — all through one unified, production-ready inference API. |
2026-03-27T02:30:58Z |
| 45 |
PowerInfer |
9134 |
538 |
C++ |
123 |
High-speed Large Language Model Serving for Local Deployment |
2026-01-24T06:33:35Z |
| 46 |
ART |
9076 |
776 |
Python |
59 |
Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen3.5, GPT-OSS, Llama, and more! |
2026-03-26T00:36:23Z |
| 47 |
TinyLlama |
8926 |
605 |
Python |
45 |
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens. |
2024-05-03T20:21:20Z |
| 48 |
oumi |
8920 |
707 |
Python |
4 |
Easily fine-tune, evaluate and deploy gpt-oss, Qwen3, DeepSeek-R1, or any open source LLM / VLM! |
2026-03-26T23:11:52Z |
| 49 |
ipex-llm |
8741 |
1411 |
Python |
1213 |
Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, DeepSeek, Mixtral, Gemma, Phi, MiniCPM, Qwen-VL, MiniCPM-V, etc.) on Intel XPU (e.g., local PC with iGPU and NPU, discrete GPU such as Arc, Flex and Max); seamlessly integrate with llama.cpp, Ollama, HuggingFace, LangChain, LlamaIndex, vLLM, DeepSpeed, Axolotl, etc. |
2026-01-28T17:32:40Z |
| 50 |
reor |
8544 |
519 |
JavaScript |
113 |
Private & local AI personal knowledge management app for high entropy people. |
2025-05-13T21:28:59Z |
| 51 |
llama-stack |
8302 |
1290 |
Python |
153 |
Composable building blocks to build LLM Apps |
2026-03-27T01:37:00Z |
| 52 |
BELLE |
8287 |
767 |
HTML |
106 |
BELLE: Be Everyone’s Large Language model Engine(开源中文对话大模型) |
2024-10-16T11:38:59Z |
| 53 |
GPTCache |
7968 |
571 |
Python |
74 |
Semantic cache for LLMs. Fully integrated with LangChain and llama_index. |
2025-07-11T09:04:36Z |
| 54 |
awesome-LLM-resources |
7914 |
801 |
None |
3 |
🧑🚀 全世界最好的LLM资料总结(多模态生成、Agent、辅助编程、AI审稿、数据处理、模型训练、模型推理、o1 模型、MCP、小语言模型、视觉语言模型) | Summary of the world’s best LLM resources. |
2026-03-27T03:04:26Z |
| 55 |
nexa-sdk |
7884 |
972 |
Kotlin |
34 |
Run frontier LLMs and VLMs with day-0 model support across GPU, NPU, and CPU, with comprehensive runtime coverage for PC (Python/C++), mobile (Android & iOS), and Linux/IoT (Arm64 & x86 Docker). Supporting OpenAI GPT-OSS, IBM Granite-4, Qwen-3-VL, Gemma-3n, Ministral-3, and more. |
2026-02-26T21:13:41Z |
| 56 |
lmdeploy |
7722 |
674 |
Python |
518 |
LMDeploy is a toolkit for compressing, deploying, and serving LLMs. |
2026-03-27T04:50:21Z |
| 57 |
k8sgpt |
7594 |
963 |
Go |
87 |
Giving Kubernetes Superpowers to everyone |
2026-03-27T04:59:15Z |
| 58 |
open_llama |
7538 |
406 |
None |
37 |
OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset |
2023-07-16T13:42:13Z |
| 59 |
llama-models |
7536 |
1351 |
Python |
166 |
Utilities intended for use with Llama models. |
2026-02-11T16:38:31Z |
| 60 |
Chinese-LLaMA-Alpaca-2 |
7163 |
569 |
Python |
1 |
中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models) |
2025-07-15T00:47:22Z |
| 61 |
mergekit |
6915 |
680 |
Python |
236 |
Tools for merging pretrained large language models. |
2026-03-15T10:49:47Z |
| 62 |
llamacoder |
6890 |
1643 |
TypeScript |
2 |
Open source Claude Artifacts – built with Llama 3.1 405B |
2026-03-02T16:38:03Z |
| 63 |
Firefly |
6653 |
587 |
Python |
203 |
Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型 |
2024-10-24T02:27:42Z |
| 64 |
llm-scraper |
6246 |
372 |
TypeScript |
3 |
Turn any webpage into structured data using LLMs |
2026-03-26T08:12:20Z |
| 65 |
Liger-Kernel |
6240 |
506 |
Python |
91 |
Efficient Triton Kernels for LLM Training |
2026-03-27T01:02:16Z |
| 66 |
YuE |
6111 |
719 |
Python |
83 |
YuE: Open Full-song Music Generation Foundation Model, something similar to Suno.ai but open |
2025-06-04T13:08:48Z |
| 67 |
lit-llama |
6083 |
521 |
Python |
100 |
Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed. |
2025-07-01T16:31:39Z |
| 68 |
LaWGPT |
6037 |
546 |
Python |
86 |
🎉 Repo for LaWGPT, Chinese-Llama tuned with Chinese Legal knowledge. 基于中文法律知识的大语言模型 |
2024-06-11T07:20:19Z |
| 69 |
LLaMA-Adapter |
5933 |
381 |
Python |
108 |
[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters |
2024-03-14T08:12:53Z |
| 70 |
enchanted |
5856 |
404 |
Swift |
101 |
Enchanted is iOS and macOS app for chatting with private self hosted language models such as Llama2, Mistral or Vicuna using Ollama. |
2025-03-19T20:19:21Z |
| 71 |
serge |
5737 |
396 |
Svelte |
18 |
A web interface for chatting with Alpaca through llama.cpp. Fully dockerized, with an easy to use API. |
2025-11-21T08:07:36Z |
| 72 |
llama-fs |
5721 |
382 |
TypeScript |
53 |
A self-organizing file system with llama 3 |
2025-08-08T02:27:02Z |
| 73 |
Baichuan-7B |
5678 |
507 |
Python |
85 |
A large-scale 7B pretraining language model developed by BaiChuan-Inc. |
2024-07-18T14:23:01Z |
| 74 |
smolvlm-realtime-webcam |
5541 |
892 |
HTML |
12 |
Real-time webcam demo with SmolVLM and llama.cpp server |
2025-05-12T17:24:39Z |
| 75 |
paperless-ai |
5500 |
287 |
JavaScript |
57 |
An automated document analyzer for Paperless-ngx using OpenAI API, Ollama, Deepseek-r1, Azure and all OpenAI API compatible Services to automatically analyze and tag your documents. |
2026-03-21T11:24:07Z |
| 76 |
sdk-python |
5387 |
739 |
Python |
325 |
A model-driven approach to building AI agents in just a few lines of code. |
2026-03-26T20:46:48Z |
| 77 |
MedicalGPT |
5104 |
718 |
Python |
56 |
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型,实现了包括增量预训练(PT)、有监督微调(SFT)、RLHF、DPO、ORPO、GRPO。 |
2026-03-26T08:13:28Z |
| 78 |
Huatuo-Llama-Med-Chinese |
4951 |
500 |
Python |
28 |
Repo for BenTsao [original name: HuaTuo (华驼)], Instruction-tuning Large Language Models with Chinese Medical Knowledge. 本草(原名:华驼)模型仓库,基于中文医学知识的大语言模型指令微调 |
2025-02-21T02:04:37Z |
| 79 |
h2o-llmstudio |
4905 |
519 |
Python |
39 |
H2O LLM Studio - a framework and no-code GUI for fine-tuning LLMs. Documentation: https://docs.h2o.ai/h2o-llmstudio/ |
2026-03-26T12:44:41Z |
| 80 |
transformerlab-app |
4840 |
506 |
Python |
36 |
The open source research environment for AI researchers to seamlessly train, evaluate, and scale models from local hardware to GPU clusters. |
2026-03-26T23:57:35Z |
| 81 |
gpustack |
4736 |
487 |
Python |
504 |
A GPU cluster manager that configures and orchestrates inference engines like vLLM and SGLang for high-performance AI model deployment. |
2026-03-26T06:30:46Z |
| 82 |
obsidian-smart-connections |
4720 |
292 |
JavaScript |
463 |
Chat with your notes & see links to related content with AI embeddings. Use local models or 100+ via APIs like Claude, Gemini, ChatGPT & Llama 3 |
2026-03-23T18:21:21Z |
| 83 |
tiny-universe |
4658 |
454 |
Jupyter Notebook |
10 |
《大模型白盒子构建指南》:一个全手搓的Tiny-Universe |
2026-02-12T02:00:08Z |
| 84 |
casibase |
4481 |
527 |
Go |
52 |
⚡️AI Cloud OS: Open-source enterprise-level AI knowledge base and MCP (model-context-protocol)/A2A (agent-to-agent) management platform with admin UI, user management and Single-Sign-On⚡️, supports ChatGPT, Claude, Llama, Ollama, HuggingFace, etc., chat bot demo: https://ai.casibase.com, admin UI demo: https://ai-admin.casibase.com |
2026-03-23T13:36:19Z |
| 85 |
GPT-4-LLM |
4337 |
311 |
HTML |
13 |
Instruction Tuning with GPT-4 |
2023-06-11T13:40:30Z |
| 86 |
llama-stack-apps |
4300 |
641 |
None |
24 |
Agentic components of the Llama Stack APIs |
2025-08-05T14:49:58Z |
| 87 |
llama_cloud_services |
4250 |
476 |
TypeScript |
351 |
Knowledge Agents and Management in the Cloud |
2026-03-25T01:27:52Z |
| 88 |
g1 |
4207 |
360 |
Python |
0 |
g1: Using Llama-3.1 70b on Groq to create o1-like reasoning chains |
2025-12-30T22:28:50Z |
| 89 |
llama3-Chinese-chat |
4160 |
335 |
Python |
30 |
Llama3、Llama3.1 中文后训练版仓库 - 微调、魔改版本有趣权重 & 训练、推理、评测、部署教程视频 & 文档。 |
2026-02-21T08:22:18Z |
| 90 |
llama-dl |
4142 |
403 |
Shell |
9 |
High-speed download of LLaMA, Facebook’s 65B parameter GPT model |
2023-06-28T16:56:55Z |
| 91 |
Chinese-Vicuna |
4132 |
409 |
C |
65 |
Chinese-Vicuna: A Chinese Instruction-following LLaMA-based Model —— 一个中文低资源的llama+lora方案,结构参考alpaca |
2025-04-18T02:41:35Z |
| 92 |
PurpleLlama |
4086 |
710 |
Python |
23 |
Set of tools to assess and improve LLM security. |
2026-03-26T23:04:27Z |
| 93 |
llms-from-scratch-cn |
4043 |
557 |
Jupyter Notebook |
11 |
仅需Python基础,从0构建大语言模型;从0逐步构建GLM4\Llama3\RWKV6, 深入理解大模型原理 |
2026-03-26T04:32:32Z |
| 94 |
LightLLM |
3975 |
313 |
Python |
83 |
LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance. |
2026-03-27T03:14:09Z |
| 95 |
langroid |
3942 |
365 |
Python |
49 |
Harness LLMs with Multi-Agent Programming |
2026-03-25T21:27:35Z |
| 96 |
shimmy |
3867 |
320 |
Rust |
31 |
⚡ Python-free Rust inference server — OpenAI-API compatible. GGUF + SafeTensors, hot model swap, auto-discovery, single binary. FREE now, FREE forever. |
2026-03-26T03:00:53Z |
| 97 |
Operit |
3798 |
293 |
Kotlin |
51 |
The most powerful AI agent and AI chat software on Android/Operit是一款Android上能力最为强大的AI Agent |
2026-03-26T15:18:28Z |
| 98 |
zero_nlp |
3792 |
447 |
Jupyter Notebook |
102 |
中文nlp解决方案(大模型、数据、模型、训练、推理) |
2025-08-05T01:26:45Z |
| 99 |
lorax |
3741 |
312 |
Python |
150 |
Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs |
2025-05-21T17:40:25Z |
| 100 |
SwanLab |
3736 |
192 |
Python |
66 |
⚡️SwanLab - an open-source, modern-design AI training tracking and visualization tool. Supports Cloud / Self-hosted use. Integrated with PyTorch / Transformers / verl / LLaMA Factory / ms-swift / Ultralytics / MMEngine / Keras etc. |
2026-03-26T12:44:46Z |