| 1 |
ollama |
157511 |
13903 |
Go |
1930 |
Get up and running with OpenAI gpt-oss, DeepSeek-R1, Gemma 3 and other models. |
2025-12-13T03:29:00Z |
| 2 |
llama.cpp |
91232 |
14085 |
C++ |
335 |
LLM inference in C/C++ |
2025-12-13T03:08:37Z |
| 3 |
vllm |
65271 |
11926 |
Python |
1868 |
A high-throughput and memory-efficient inference and serving engine for LLMs |
2025-12-13T03:34:24Z |
| 4 |
LLaMA-Factory |
63892 |
7739 |
Python |
812 |
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024) |
2025-12-12T17:44:29Z |
| 5 |
llama |
58979 |
9810 |
Python |
454 |
Inference code for Llama models |
2025-01-26T21:42:26Z |
| 6 |
unsloth |
49360 |
4079 |
Python |
805 |
Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM. |
2025-12-12T13:53:46Z |
| 7 |
llama_index |
45785 |
6623 |
Python |
233 |
LlamaIndex is the leading framework for building LLM-powered agents over your data. |
2025-12-12T13:05:09Z |
| 8 |
LocalAI |
40070 |
3206 |
Go |
199 |
:robot: The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more. Features: Generate Text, MCP, Audio, Video, Images, Voice Cloning, Distributed, P2P and decentralized inference |
2025-12-12T23:37:41Z |
| 9 |
aider |
38916 |
3726 |
Python |
1097 |
aider is AI pair programming in your terminal |
2025-12-11T19:28:27Z |
| 10 |
quivr |
38676 |
3693 |
Python |
1 |
Opiniated RAG for integrating GenAI in your apps 🧠 Focus on your product rather than the RAG. Easy integration in existing products with customisation! Any LLM: GPT4, Groq, Llama. Any Vectorstore: PGVector, Faiss. Any Files. Anyway you want. |
2025-07-09T12:55:23Z |
| 11 |
Langchain-Chatchat |
36781 |
6079 |
Python |
17 |
Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and Llama) RAG and Agent app with langchain |
2025-11-10T09:27:42Z |
| 12 |
khoj |
31909 |
1893 |
Python |
75 |
Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous AI (gpt, claude, gemini, llama, qwen, mistral). Get started - free. |
2025-12-08T04:36:04Z |
| 13 |
llama3 |
29129 |
3497 |
Python |
176 |
The official Meta Llama 3 GitHub site |
2025-01-26T21:39:06Z |
| 14 |
fish-speech |
24323 |
1995 |
Python |
16 |
SOTA Open Source TTS |
2025-12-01T19:10:07Z |
| 15 |
LLaVA |
24159 |
2670 |
Python |
1089 |
[NeurIPS’23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond. |
2024-08-12T09:52:38Z |
| 16 |
llamafile |
23466 |
1247 |
C |
186 |
Distribute and run LLMs with a single file. |
2025-12-12T17:56:48Z |
| 17 |
Awesome-Chinese-LLM |
21884 |
2078 |
None |
5 |
整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。 |
2025-05-19T06:11:57Z |
| 18 |
sglang |
21241 |
3732 |
Python |
633 |
SGLang is a fast serving framework for large language models and vision language models. |
2025-12-13T03:41:38Z |
| 19 |
repomix |
20623 |
947 |
TypeScript |
119 |
📦 Repomix is a powerful tool that packs your entire repository into a single, AI-friendly file. Perfect for when you need to feed your codebase to Large Language Models (LLMs) or other AI tools like Claude, ChatGPT, DeepSeek, Perplexity, Gemini, Gemma, Llama, Grok, and more. |
2025-12-13T00:59:27Z |
| 20 |
llama2.c |
19020 |
2425 |
C |
125 |
Inference Llama 2 in one file of pure C |
2024-08-06T09:44:40Z |
| 21 |
alpaca-lora |
18981 |
2215 |
Jupyter Notebook |
333 |
Instruct-tune LLaMA on consumer hardware |
2024-07-29T13:37:49Z |
| 22 |
Chinese-LLaMA-Alpaca |
18964 |
1874 |
Python |
1 |
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs) |
2025-07-15T00:53:02Z |
| 23 |
mastra |
18788 |
1344 |
TypeScript |
177 |
The TypeScript AI agent framework. ⚡ Assistants, RAG, observability. Supports any LLM: GPT-4, Claude, Gemini, Llama. |
2025-12-13T02:14:16Z |
| 24 |
llama-cookbook |
18075 |
2667 |
Jupyter Notebook |
22 |
Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama model family and using them on various provider services |
2025-11-03T16:25:20Z |
| 25 |
ChuanhuChatGPT |
15418 |
2266 |
Python |
124 |
GUI for ChatGPT API and many LLMs. Supports agents, file-based QA, GPT finetuning and query with web search. All with a neat UI. |
2025-08-15T02:25:50Z |
| 26 |
llama3-from-scratch |
15199 |
1291 |
Jupyter Notebook |
14 |
llama3 implementation one matrix multiplication at a time |
2024-05-23T14:34:05Z |
| 27 |
Llama-Chinese |
14751 |
1302 |
Python |
195 |
Llama中文社区,实时汇总最新Llama学习资料,构建最好的中文Llama大模型开源生态,完全开源可商用 |
2025-04-06T09:16:55Z |
| 28 |
AstrBot |
14112 |
1087 |
Python |
325 |
✨ Agentic IM ChatBot Infrastructure — 聊天智能体基础设施 ✨ 多消息平台集成(QQ / Telegram / 企微 / 飞书 / 钉钉等),强大易用的插件系统,支持 OpenAI / Gemini / Anthropic / Dify / Coze / 阿里云百炼 / 知识库 / Agent 智能体 |
2025-12-12T15:15:11Z |
| 29 |
dalai |
13020 |
1374 |
CSS |
291 |
The simplest way to run LLaMA on your local machine |
2024-06-18T20:29:46Z |
| 30 |
PaddleNLP |
12875 |
3075 |
Python |
122 |
Easy-to-use and powerful LLM and SLM library with awesome model zoo. |
2025-12-12T09:16:21Z |
| 31 |
OpenLLM |
11993 |
798 |
Python |
3 |
Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud. |
2025-12-08T16:56:47Z |
| 32 |
h2ogpt |
11973 |
1313 |
Python |
292 |
Private chat with local GPT with document, images, video, etc. 100% private, Apache 2.0. Supports oLLaMa, Mixtral, llama.cpp, and more. Demo: https://gpt.h2o.ai/ https://gpt-docs.h2o.ai/ |
2025-10-09T23:30:01Z |
| 33 |
ludwig |
11627 |
1219 |
Python |
42 |
Low-code framework for building custom LLMs, neural networks, and other AI models |
2025-12-08T21:22:04Z |
| 34 |
ms-swift |
11608 |
1049 |
Python |
806 |
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, GLM4.5, InternLM3, DeepSeek-R1, …) and 200+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, Llava, GLM4v, Phi4, …) (AAAI 2025). |
2025-12-12T10:06:38Z |
| 35 |
shell_gpt |
11583 |
938 |
Python |
91 |
A command-line productivity tool powered by AI large language models like GPT-4, will help you accomplish your tasks faster and more efficiently. |
2025-10-30T03:33:26Z |
| 36 |
llama-gpt |
10991 |
711 |
TypeScript |
84 |
A self-hosted, offline, ChatGPT-like chatbot. Powered by Llama 2. 100% private, with no data leaving your device. New: Code Llama support! |
2024-04-23T18:56:06Z |
| 37 |
tensorzero |
10675 |
745 |
Rust |
315 |
TensorZero is an open-source stack for industrial-grade LLM applications. It unifies an LLM gateway, observability, optimization, evaluation, and experimentation. |
2025-12-13T03:41:55Z |
| 38 |
bisheng |
10605 |
1721 |
TypeScript |
89 |
BISHENG is an open LLM devops platform for next generation Enterprise AI applications. Powerful and comprehensive features include: GenAI workflow, RAG, Agent, Unified model management, Evaluation, SFT, Dataset Management, Enterprise-level System Management, Observability and more. |
2025-12-13T01:44:04Z |
| 39 |
langchain4j |
9932 |
1823 |
Java |
539 |
LangChain4j is an open-source Java library that simplifies the integration of LLMs into Java applications through a unified API, providing access to popular LLMs and vector databases. It makes implementing RAG, tool calling (including support for MCP), and agents easy. LangChain4j integrates seamlessly with various enterprise Java frameworks. |
2025-12-12T14:32:23Z |
| 40 |
petals |
9852 |
586 |
Python |
92 |
🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading |
2024-09-07T11:54:28Z |
| 41 |
llama-cpp-python |
9820 |
1258 |
Python |
586 |
Python bindings for llama.cpp |
2025-08-15T06:23:26Z |
| 42 |
promptfoo |
9406 |
814 |
TypeScript |
126 |
Test your prompts, agents, and RAGs. AI Red teaming, pentesting, and vulnerability scanning for LLMs. Compare performance of GPT, Claude, Gemini, Llama, and more. Simple declarative configs with command line and CI/CD integration. |
2025-12-13T01:48:47Z |
| 43 |
inference |
8838 |
775 |
Python |
136 |
Swap GPT for any LLM by changing a single line of code. Xinference lets you run open-source, speech, and multimodal models on cloud, on-prem, or your laptop — all through one unified, production-ready inference API. |
2025-12-12T09:04:21Z |
| 44 |
TinyLlama |
8828 |
581 |
Python |
45 |
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens. |
2024-05-03T20:21:20Z |
| 45 |
oumi |
8784 |
685 |
Python |
6 |
Easily fine-tune, evaluate and deploy gpt-oss, Qwen3, DeepSeek-R1, or any open source LLM / VLM! |
2025-12-13T02:57:56Z |
| 46 |
ipex-llm |
8534 |
1387 |
Python |
1216 |
Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, DeepSeek, Mixtral, Gemma, Phi, MiniCPM, Qwen-VL, MiniCPM-V, etc.) on Intel XPU (e.g., local PC with iGPU and NPU, discrete GPU such as Arc, Flex and Max); seamlessly integrate with llama.cpp, Ollama, HuggingFace, LangChain, LlamaIndex, vLLM, DeepSpeed, Axolotl, etc. |
2025-10-14T06:04:12Z |
| 47 |
PowerInfer |
8455 |
458 |
C++ |
120 |
High-speed Large Language Model Serving for Local Deployment |
2025-08-02T03:36:13Z |
| 48 |
reor |
8420 |
516 |
JavaScript |
110 |
Private & local AI personal knowledge management app for high entropy people. |
2025-05-13T21:28:59Z |
| 49 |
BELLE |
8277 |
768 |
HTML |
106 |
BELLE: Be Everyone’s Large Language model Engine(开源中文对话大模型) |
2024-10-16T11:38:59Z |
| 50 |
llama-stack |
8191 |
1228 |
Python |
146 |
Composable building blocks to build LLM Apps |
2025-12-13T03:41:29Z |
| 51 |
ART |
8023 |
641 |
Python |
54 |
Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen2.5, Qwen3, Llama, and more! |
2025-12-12T22:04:19Z |
| 52 |
GPTCache |
7871 |
566 |
Python |
73 |
Semantic cache for LLMs. Fully integrated with LangChain and llama_index. |
2025-07-11T09:04:36Z |
| 53 |
open_llama |
7526 |
405 |
None |
37 |
OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset |
2023-07-16T13:42:13Z |
| 54 |
lmdeploy |
7377 |
634 |
Python |
510 |
LMDeploy is a toolkit for compressing, deploying, and serving LLMs. |
2025-12-12T09:17:04Z |
| 55 |
llama-models |
7374 |
1293 |
Python |
151 |
Utilities intended for use with Llama models. |
2025-10-10T00:22:06Z |
| 56 |
k8sgpt |
7223 |
906 |
Go |
88 |
Giving Kubernetes Superpowers to everyone |
2025-12-12T23:06:51Z |
| 57 |
free-llm-api-resources |
7198 |
682 |
Python |
13 |
A list of free LLM inference resources accessible via API. |
2025-12-13T00:22:22Z |
| 58 |
Chinese-LLaMA-Alpaca-2 |
7180 |
569 |
Python |
0 |
中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models) |
2025-07-15T00:47:22Z |
| 59 |
awesome-LLM-resources |
6940 |
671 |
None |
0 |
🧑🚀 全世界最好的LLM资料总结(语音视频生成、Agent、辅助编程、数据处理、模型训练、模型推理、o1 模型、MCP、小语言模型、视觉语言模型) | Summary of the world’s best LLM resources. |
2025-12-11T01:46:23Z |
| 60 |
llamacoder |
6787 |
1637 |
TypeScript |
1 |
Open source Claude Artifacts – built with Llama 3.1 405B |
2025-12-12T14:31:58Z |
| 61 |
Firefly |
6610 |
588 |
Python |
204 |
Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型 |
2024-10-24T02:27:42Z |
| 62 |
mergekit |
6576 |
645 |
Python |
231 |
Tools for merging pretrained large language models. |
2025-12-10T03:30:10Z |
| 63 |
airllm |
6456 |
508 |
Jupyter Notebook |
114 |
AirLLM 70B inference with single 4GB GPU |
2025-09-03T13:50:08Z |
| 64 |
nexa-sdk |
6165 |
810 |
Go |
35 |
Run the latest LLMs and VLMs across GPU, NPU, and CPU with PC (Python/C++) & mobile (Android & iOS) support, running quickly with OpenAI gpt-oss, Granite4, Qwen3VL, Gemma 3n and more. |
2025-12-12T07:44:27Z |
| 65 |
llm-scraper |
6129 |
368 |
TypeScript |
9 |
Turn any webpage into structured data using LLMs |
2025-12-06T22:12:24Z |
| 66 |
lit-llama |
6089 |
527 |
Python |
108 |
Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed. |
2025-07-01T16:31:39Z |
| 67 |
LaWGPT |
6024 |
553 |
Python |
86 |
🎉 Repo for LaWGPT, Chinese-Llama tuned with Chinese Legal knowledge. 基于中文法律知识的大语言模型 |
2024-06-11T07:20:19Z |
| 68 |
Liger-Kernel |
5940 |
446 |
Python |
84 |
Efficient Triton Kernels for LLM Training |
2025-12-12T10:36:07Z |
| 69 |
LLaMA-Adapter |
5926 |
384 |
Python |
107 |
[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters |
2024-03-14T08:12:53Z |
| 70 |
YuE |
5814 |
676 |
Python |
79 |
YuE: Open Full-song Music Generation Foundation Model, something similar to Suno.ai but open |
2025-06-04T13:08:48Z |
| 71 |
serge |
5757 |
403 |
Svelte |
18 |
A web interface for chatting with Alpaca through llama.cpp. Fully dockerized, with an easy to use API. |
2025-11-21T08:07:36Z |
| 72 |
enchanted |
5742 |
387 |
Swift |
99 |
Enchanted is iOS and macOS app for chatting with private self hosted language models such as Llama2, Mistral or Vicuna using Ollama. |
2025-03-19T20:19:21Z |
| 73 |
llama-fs |
5697 |
385 |
TypeScript |
53 |
A self-organizing file system with llama 3 |
2025-08-08T02:27:02Z |
| 74 |
Baichuan-7B |
5685 |
505 |
Python |
85 |
A large-scale 7B pretraining language model developed by BaiChuan-Inc. |
2024-07-18T14:23:01Z |
| 75 |
Huatuo-Llama-Med-Chinese |
4911 |
499 |
Python |
28 |
Repo for BenTsao [original name: HuaTuo (华驼)], Instruction-tuning Large Language Models with Chinese Medical Knowledge. 本草(原名:华驼)模型仓库,基于中文医学知识的大语言模型指令微调 |
2025-02-21T02:04:37Z |
| 76 |
smolvlm-realtime-webcam |
4839 |
771 |
HTML |
11 |
Real-time webcam demo with SmolVLM and llama.cpp server |
2025-05-12T17:24:39Z |
| 77 |
h2o-llmstudio |
4753 |
508 |
Python |
42 |
H2O LLM Studio - a framework and no-code GUI for fine-tuning LLMs. Documentation: https://docs.h2o.ai/h2o-llmstudio/ |
2025-12-12T10:33:07Z |
| 78 |
paperless-ai |
4721 |
217 |
JavaScript |
31 |
An automated document analyzer for Paperless-ngx using OpenAI API, Ollama, Deepseek-r1, Azure and all OpenAI API compatible Services to automatically analyze and tag your documents. |
2025-12-02T20:05:07Z |
| 79 |
transformerlab-app |
4585 |
466 |
Python |
66 |
Open Source Application for Advanced LLM + Diffusion Engineering: interact, train, fine-tune, and evaluate large language models on your own computer. |
2025-12-12T22:26:37Z |
| 80 |
sdk-python |
4434 |
538 |
Python |
259 |
A model-driven approach to building AI agents in just a few lines of code. |
2025-12-12T07:16:41Z |
| 81 |
MedicalGPT |
4421 |
645 |
Python |
52 |
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型,实现了包括增量预训练(PT)、有监督微调(SFT)、RLHF、DPO、ORPO、GRPO。 |
2025-08-30T12:31:36Z |
| 82 |
obsidian-smart-connections |
4347 |
260 |
JavaScript |
438 |
Chat with your notes & see links to related content with AI embeddings. Use local models or 100+ via APIs like Claude, Gemini, ChatGPT & Llama 3 |
2025-12-12T02:44:14Z |
| 83 |
GPT-4-LLM |
4340 |
306 |
HTML |
13 |
Instruction Tuning with GPT-4 |
2023-06-11T13:40:30Z |
| 84 |
casibase |
4328 |
518 |
Go |
51 |
⚡️AI Cloud OS: Open-source enterprise-level AI knowledge base and MCP (model-context-protocol)/A2A (agent-to-agent) management platform with admin UI, user management and Single-Sign-On⚡️, supports ChatGPT, Claude, Llama, Ollama, HuggingFace, etc., chat bot demo: https://ai.casibase.com, admin UI demo: https://ai-admin.casibase.com |
2025-12-12T13:32:56Z |
| 85 |
llama-stack-apps |
4278 |
637 |
None |
20 |
Agentic components of the Llama Stack APIs |
2025-08-05T14:49:58Z |
| 86 |
llama_cloud_services |
4220 |
468 |
TypeScript |
325 |
Knowledge Agents and Management in the Cloud |
2025-12-12T14:24:00Z |
| 87 |
g1 |
4220 |
372 |
Python |
0 |
g1: Using Llama-3.1 70b on Groq to create o1-like reasoning chains |
2025-09-11T16:29:17Z |
| 88 |
gpustack |
4196 |
425 |
Python |
349 |
GPU cluster manager for optimized AI model deployment |
2025-12-13T03:25:05Z |
| 89 |
llama3-Chinese-chat |
4163 |
336 |
Python |
29 |
Llama3、Llama3.1 中文后训练版仓库 - 微调、魔改版本有趣权重 & 训练、推理、评测、部署教程视频 & 文档。 |
2025-05-07T06:09:40Z |
| 90 |
llama-dl |
4155 |
407 |
Shell |
9 |
High-speed download of LLaMA, Facebook’s 65B parameter GPT model |
2023-06-28T16:56:55Z |
| 91 |
Chinese-Vicuna |
4145 |
413 |
C |
65 |
Chinese-Vicuna: A Chinese Instruction-following LLaMA-based Model —— 一个中文低资源的llama+lora方案,结构参考alpaca |
2025-04-18T02:41:35Z |
| 92 |
tiny-universe |
4129 |
410 |
Jupyter Notebook |
9 |
《大模型白盒子构建指南》:一个全手搓的Tiny-Universe |
2025-12-02T06:12:39Z |
| 93 |
PurpleLlama |
3923 |
681 |
Python |
17 |
Set of tools to assess and improve LLM security. |
2025-11-20T21:53:42Z |
| 94 |
llms-from-scratch-cn |
3800 |
520 |
Jupyter Notebook |
13 |
仅需Python基础,从0构建大语言模型;从0逐步构建GLM4\Llama3\RWKV6, 深入理解大模型原理 |
2024-08-15T02:19:06Z |
| 95 |
langroid |
3796 |
350 |
Python |
48 |
Harness LLMs with Multi-Agent Programming |
2025-11-24T21:02:31Z |
| 96 |
LightLLM |
3786 |
288 |
Python |
81 |
LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance. |
2025-12-12T10:19:33Z |
| 97 |
zero_nlp |
3723 |
444 |
Jupyter Notebook |
102 |
中文nlp解决方案(大模型、数据、模型、训练、推理) |
2025-08-05T01:26:45Z |
| 98 |
ClaraVerse |
3596 |
405 |
TypeScript |
59 |
ClaraVerse is a privacy-first, fully local AI workspace featuring a Local LLM chat powered by LLama.cpp, along with support for any provider, tool calling, agent building, Stable Diffusion, and n8n-style automation. It requires no backend or API keys—just your stack and machine. |
2025-11-10T17:08:26Z |
| 99 |
lorax |
3567 |
289 |
Python |
148 |
Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs |
2025-05-21T17:40:25Z |
| 100 |
higgsfield |
3510 |
585 |
Jupyter Notebook |
2 |
Fault-tolerant, highly scalable GPU orchestration, and a machine learning framework designed for training models with billions to trillions of parameters |
2024-05-25T17:43:07Z |