1 |
ollama |
147147 |
12473 |
Go |
1611 |
Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, Mistral Small 3.1 and other large language models. |
2025-07-21T18:25:32Z |
2 |
llama.cpp |
83292 |
12412 |
C++ |
275 |
LLM inference in C/C++ |
2025-07-22T02:14:33Z |
3 |
llama |
58539 |
9786 |
Python |
439 |
Inference code for Llama models |
2025-01-26T21:42:26Z |
4 |
LLaMA-Factory |
54690 |
6714 |
Python |
510 |
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024) |
2025-07-21T06:15:36Z |
5 |
vllm |
52827 |
8835 |
Python |
1822 |
A high-throughput and memory-efficient inference and serving engine for LLMs |
2025-07-21T20:47:47Z |
6 |
llama_index |
43248 |
6211 |
Python |
220 |
LlamaIndex is the leading framework for building LLM-powered agents over your data. |
2025-07-21T22:29:35Z |
7 |
unsloth |
42403 |
3392 |
Python |
666 |
Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train Qwen3, Llama 4, DeepSeek-R1, Gemma 3, TTS 2x faster with 70% less VRAM. |
2025-07-21T12:35:35Z |
8 |
quivr |
38159 |
3654 |
Python |
2 |
Opiniated RAG for integrating GenAI in your apps 🧠 Focus on your product rather than the RAG. Easy integration in existing products with customisation! Any LLM: GPT4, Groq, Llama. Any Vectorstore: PGVector, Faiss. Any Files. Anyway you want. |
2025-07-09T12:55:23Z |
9 |
aider |
35806 |
3285 |
Python |
953 |
aider is AI pair programming in your terminal |
2025-07-18T11:05:54Z |
10 |
Langchain-Chatchat |
35656 |
5965 |
TypeScript |
130 |
Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and Llama) RAG and Agent app with langchain |
2025-03-25T15:45:51Z |
11 |
LocalAI |
34008 |
2647 |
Go |
423 |
:robot: The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more models architectures. Features: Generate Text, Audio, Video, Images, Voice Cloning, Distributed, P2P inference |
2025-07-21T21:37:31Z |
12 |
khoj |
30581 |
1747 |
Python |
75 |
Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous AI (gpt, claude, gemini, llama, qwen, mistral). Get started - free. |
2025-07-20T02:32:45Z |
13 |
llama3 |
28855 |
3422 |
Python |
174 |
The official Meta Llama 3 GitHub site |
2025-01-26T21:39:06Z |
14 |
LLaVA |
23097 |
2552 |
Python |
1082 |
[NeurIPS’23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond. |
2024-08-12T09:52:38Z |
15 |
llamafile |
22817 |
1199 |
C++ |
155 |
Distribute and run LLMs with a single file. |
2025-06-30T19:03:06Z |
16 |
fish-speech |
22438 |
1835 |
Python |
33 |
SOTA Open Source TTS |
2025-07-02T10:11:05Z |
17 |
Awesome-Chinese-LLM |
20701 |
1986 |
None |
5 |
整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。 |
2025-05-19T06:11:57Z |
18 |
alpaca-lora |
18933 |
2226 |
Jupyter Notebook |
333 |
Instruct-tune LLaMA on consumer hardware |
2024-07-29T13:37:49Z |
19 |
Chinese-LLaMA-Alpaca |
18887 |
1887 |
Python |
1 |
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs) |
2025-07-15T00:53:02Z |
20 |
llama2.c |
18574 |
2302 |
C |
124 |
Inference Llama 2 in one file of pure C |
2024-08-06T09:44:40Z |
21 |
repomix |
18008 |
778 |
TypeScript |
96 |
📦 Repomix is a powerful tool that packs your entire repository into a single, AI-friendly file. Perfect for when you need to feed your codebase to Large Language Models (LLMs) or other AI tools like Claude, ChatGPT, DeepSeek, Perplexity, Gemini, Gemma, Llama, Grok, and more. |
2025-07-21T16:11:21Z |
22 |
llama-cookbook |
17654 |
2555 |
Jupyter Notebook |
20 |
Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama model family and using them on various provider services |
2025-07-22T01:27:18Z |
23 |
sglang |
16191 |
2408 |
Python |
517 |
SGLang is a fast serving framework for large language models and vision language models. |
2025-07-22T03:18:34Z |
24 |
ChuanhuChatGPT |
15400 |
2275 |
Python |
122 |
GUI for ChatGPT API and many LLMs. Supports agents, file-based QA, GPT finetuning and query with web search. All with a neat UI. |
2025-03-13T09:36:38Z |
25 |
mastra |
15170 |
937 |
TypeScript |
178 |
The TypeScript AI agent framework. ⚡ Assistants, RAG, observability. Supports any LLM: GPT-4, Claude, Gemini, Llama. |
2025-07-22T03:55:35Z |
26 |
llama3-from-scratch |
15051 |
1278 |
Jupyter Notebook |
13 |
llama3 implementation one matrix multiplication at a time |
2024-05-23T14:34:05Z |
27 |
Llama-Chinese |
14647 |
1309 |
Python |
196 |
Llama中文社区,实时汇总最新Llama学习资料,构建最好的中文Llama大模型开源生态,完全开源可商用 |
2025-04-06T09:16:55Z |
28 |
dalai |
13069 |
1388 |
CSS |
293 |
The simplest way to run LLaMA on your local machine |
2024-06-18T20:29:46Z |
29 |
PaddleNLP |
12693 |
3060 |
Python |
142 |
Easy-to-use and powerful LLM and SLM library with awesome model zoo. |
2025-07-21T11:47:13Z |
30 |
h2ogpt |
11877 |
1296 |
Python |
291 |
Private chat with local GPT with document, images, video, etc. 100% private, Apache 2.0. Supports oLLaMa, Mixtral, llama.cpp, and more. Demo: https://gpt.h2o.ai/ https://gpt-docs.h2o.ai/ |
2025-05-25T19:02:29Z |
31 |
OpenLLM |
11590 |
752 |
Python |
3 |
Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud. |
2025-07-21T16:51:05Z |
32 |
ludwig |
11531 |
1219 |
Python |
42 |
Low-code framework for building custom LLMs, neural networks, and other AI models |
2025-06-23T20:14:15Z |
33 |
shell_gpt |
11147 |
897 |
Python |
88 |
A command-line productivity tool powered by AI large language models like GPT-4, will help you accomplish your tasks faster and more efficiently. |
2025-07-17T04:13:14Z |
34 |
llama-gpt |
10992 |
710 |
TypeScript |
84 |
A self-hosted, offline, ChatGPT-like chatbot. Powered by Llama 2. 100% private, with no data leaving your device. New: Code Llama support! |
2024-04-23T18:56:06Z |
35 |
AstrBot |
10703 |
746 |
Python |
245 |
✨ 易上手的多平台 LLM 聊天机器人及开发框架 ✨ 支持 QQ、QQ频道、Telegram、微信、企微、飞书、钉钉 | 知识库、MCP 服务器、OpenAI、DeepSeek、Gemini、硅基流动、月之暗面、Ollama、OneAPI、Dify |
2025-07-21T10:28:43Z |
36 |
petals |
9729 |
566 |
Python |
92 |
🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading |
2024-09-07T11:54:28Z |
37 |
llama-cpp-python |
9359 |
1172 |
Python |
557 |
Python bindings for llama.cpp |
2025-07-18T18:00:29Z |
38 |
bisheng |
9142 |
1493 |
TypeScript |
124 |
BISHENG is an open LLM devops platform for next generation Enterprise AI applications. Powerful and comprehensive features include: GenAI workflow, RAG, Agent, Unified model management, Evaluation, SFT, Dataset Management, Enterprise-level System Management, Observability and more. |
2025-07-22T03:30:26Z |
39 |
ms-swift |
8794 |
756 |
Python |
740 |
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, InternLM3, DeepSeek-R1, …) and 200+ MLLMs (Qwen2.5-VL, Qwen2.5-Omni, Qwen2-Audio, Ovis2, InternVL3, Llava, GLM4v, Phi4, …) (AAAI 2025). |
2025-07-22T02:40:46Z |
40 |
tensorzero |
8666 |
557 |
Rust |
240 |
TensorZero is an open-source stack for industrial-grade LLM applications. It unifies an LLM gateway, observability, optimization, evaluation, and experimentation. |
2025-07-22T00:23:29Z |
41 |
TinyLlama |
8650 |
551 |
Python |
45 |
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens. |
2024-05-03T20:21:20Z |
42 |
langchain4j |
8426 |
1545 |
Java |
454 |
Java version of LangChain |
2025-07-21T20:04:58Z |
43 |
oumi |
8330 |
626 |
Python |
31 |
Easily fine-tune, evaluate and deploy Qwen3, DeepSeek-R1, Llama 4 or any open source LLM / VLM! |
2025-07-22T03:43:43Z |
44 |
inference |
8259 |
706 |
Python |
136 |
Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you’re empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop. |
2025-07-21T09:00:05Z |
45 |
PowerInfer |
8237 |
433 |
C++ |
115 |
High-speed Large Language Model Serving for Local Deployment |
2025-02-19T08:15:55Z |
46 |
BELLE |
8198 |
769 |
HTML |
105 |
BELLE: Be Everyone’s Large Language model Engine(开源中文对话大模型) |
2024-10-16T11:38:59Z |
47 |
reor |
8134 |
494 |
JavaScript |
109 |
Private & local AI personal knowledge management app for high entropy people. |
2025-05-13T21:28:59Z |
48 |
ipex-llm |
8132 |
1361 |
Python |
1194 |
Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, DeepSeek, Mixtral, Gemma, Phi, MiniCPM, Qwen-VL, MiniCPM-V, etc.) on Intel XPU (e.g., local PC with iGPU and NPU, discrete GPU such as Arc, Flex and Max); seamlessly integrate with llama.cpp, Ollama, HuggingFace, LangChain, LlamaIndex, vLLM, DeepSpeed, Axolotl, etc. |
2025-07-21T08:20:20Z |
49 |
llama-stack |
7918 |
1106 |
Python |
153 |
Composable building blocks to build Llama Apps |
2025-07-22T02:53:33Z |
50 |
GPTCache |
7643 |
544 |
Python |
72 |
Semantic cache for LLMs. Fully integrated with LangChain and llama_index. |
2025-07-11T09:04:36Z |
51 |
promptfoo |
7636 |
616 |
TypeScript |
179 |
Test your prompts, agents, and RAGs. Red teaming, pentesting, and vulnerability scanning for LLMs. Compare performance of GPT, Claude, Gemini, Llama, and more. Simple declarative configs with command line and CI/CD integration. |
2025-07-22T01:14:12Z |
52 |
open_llama |
7511 |
402 |
None |
37 |
OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset |
2023-07-16T13:42:13Z |
53 |
Chinese-LLaMA-Alpaca-2 |
7169 |
572 |
Python |
3 |
中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models) |
2025-07-15T00:47:22Z |
54 |
llama-models |
7155 |
1217 |
Python |
118 |
Utilities intended for use with Llama models. |
2025-07-15T21:34:35Z |
55 |
k8sgpt |
6784 |
841 |
Go |
79 |
Giving Kubernetes Superpowers to everyone |
2025-07-22T02:11:17Z |
56 |
lmdeploy |
6742 |
578 |
Python |
458 |
LMDeploy is a toolkit for compressing, deploying, and serving LLMs. |
2025-07-21T09:57:43Z |
57 |
Firefly |
6491 |
583 |
Python |
204 |
Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型 |
2024-10-24T02:27:42Z |
58 |
llamacoder |
6290 |
1490 |
TypeScript |
45 |
Open source Claude Artifacts – built with Llama 3.1 405B |
2025-07-15T17:39:40Z |
59 |
lit-llama |
6080 |
521 |
Python |
108 |
Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed. |
2025-07-01T16:31:39Z |
60 |
mergekit |
6069 |
581 |
Python |
222 |
Tools for merging pretrained large language models. |
2025-07-16T21:43:01Z |
61 |
LaWGPT |
5993 |
554 |
Python |
86 |
🎉 Repo for LaWGPT, Chinese-Llama tuned with Chinese Legal knowledge. 基于中文法律知识的大语言模型 |
2024-06-11T07:20:19Z |
62 |
LLaMA-Adapter |
5886 |
381 |
Python |
107 |
[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters |
2024-03-14T08:12:53Z |
63 |
airllm |
5853 |
461 |
Jupyter Notebook |
112 |
AirLLM 70B inference with single 4GB GPU |
2025-05-06T13:11:40Z |
64 |
llm-scraper |
5746 |
334 |
TypeScript |
6 |
Turn any webpage into structured data using LLMs |
2025-05-18T11:11:18Z |
65 |
awesome-LLM-resources |
5743 |
557 |
None |
0 |
🧑🚀 全世界最好的LLM资料总结(语音视频生成、Agent、辅助编程、数据处理、模型训练、模型推理、o1 模型、MCP、小语言模型、视觉语言模型) | Summary of the world’s best LLM resources. |
2025-07-20T02:32:56Z |
66 |
serge |
5738 |
400 |
Svelte |
18 |
A web interface for chatting with Alpaca through llama.cpp. Fully dockerized, with an easy to use API. |
2025-07-18T08:56:45Z |
67 |
Baichuan-7B |
5686 |
504 |
Python |
85 |
A large-scale 7B pretraining language model developed by BaiChuan-Inc. |
2024-07-18T14:23:01Z |
68 |
enchanted |
5481 |
360 |
Swift |
96 |
Enchanted is iOS and macOS app for chatting with private self hosted language models such as Llama2, Mistral or Vicuna using Ollama. |
2025-03-19T20:19:21Z |
69 |
Liger-Kernel |
5392 |
371 |
Python |
67 |
Efficient Triton Kernels for LLM Training |
2025-07-21T10:00:50Z |
70 |
llama-fs |
5312 |
356 |
TypeScript |
51 |
A self-organizing file system with llama 3 |
2025-02-18T01:58:14Z |
71 |
YuE |
5228 |
596 |
Python |
70 |
YuE: Open Full-song Music Generation Foundation Model, something similar to Suno.ai but open |
2025-06-04T13:08:48Z |
72 |
llm-answer-engine |
4900 |
765 |
TypeScript |
25 |
Build a Perplexity-Inspired Answer Engine Using Next.js, Groq, Llama-3, Langchain, OpenAI, Upstash, Brave & Serper |
2025-06-27T03:35:14Z |
73 |
Huatuo-Llama-Med-Chinese |
4829 |
485 |
Python |
28 |
Repo for BenTsao [original name: HuaTuo (华驼)], Instruction-tuning Large Language Models with Chinese Medical Knowledge. 本草(原名:华驼)模型仓库,基于中文医学知识的大语言模型指令微调 |
2025-02-21T02:04:37Z |
74 |
h2o-llmstudio |
4353 |
445 |
Python |
37 |
H2O LLM Studio - a framework and no-code GUI for fine-tuning LLMs. Documentation: https://docs.h2o.ai/h2o-llmstudio/ |
2025-07-15T06:23:04Z |
75 |
GPT-4-LLM |
4316 |
305 |
HTML |
13 |
Instruction Tuning with GPT-4 |
2023-06-11T13:40:30Z |
76 |
llama-stack-apps |
4267 |
631 |
None |
20 |
Agentic components of the Llama Stack APIs |
2025-04-30T18:01:33Z |
77 |
free-llm-api-resources |
4243 |
374 |
Python |
5 |
A list of free LLM inference resources accessible via API. |
2025-07-22T01:44:42Z |
78 |
g1 |
4227 |
379 |
Python |
1 |
g1: Using Llama-3.1 70b on Groq to create o1-like reasoning chains |
2025-01-27T18:36:13Z |
79 |
llama-dl |
4163 |
414 |
Shell |
9 |
High-speed download of LLaMA, Facebook’s 65B parameter GPT model |
2023-06-28T16:56:55Z |
80 |
llama3-Chinese-chat |
4155 |
339 |
Python |
29 |
Llama3、Llama3.1 中文后训练版仓库 - 微调、魔改版本有趣权重 & 训练、推理、评测、部署教程视频 & 文档。 |
2025-05-07T06:09:40Z |
81 |
Chinese-Vicuna |
4153 |
414 |
C |
65 |
Chinese-Vicuna: A Chinese Instruction-following LLaMA-based Model —— 一个中文低资源的llama+lora方案,结构参考alpaca |
2025-04-18T02:41:35Z |
82 |
llama_cloud_services |
4065 |
434 |
Python |
258 |
Knowledge Agents and Management in the Cloud |
2025-07-21T22:49:44Z |
83 |
smolvlm-realtime-webcam |
4047 |
577 |
HTML |
11 |
Real-time webcam demo with SmolVLM and llama.cpp server |
2025-05-12T17:24:39Z |
84 |
MedicalGPT |
4003 |
585 |
Python |
46 |
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型,实现了包括增量预训练(PT)、有监督微调(SFT)、RLHF、DPO、ORPO、GRPO。 |
2025-07-09T11:39:44Z |
85 |
obsidian-smart-connections |
3912 |
230 |
JavaScript |
385 |
Chat with your notes & see links to related content with AI embeddings. Use local models or 100+ via APIs like Claude, Gemini, ChatGPT & Llama 3 |
2025-07-21T16:02:41Z |
86 |
casibase |
3866 |
456 |
Go |
41 |
⚡️AI Cloud OS: Open-source enterprise-level AI knowledge base and MCP (model-context-protocol)/A2A (agent-to-agent) management platform with admin UI, user management and Single-Sign-On⚡️, supports ChatGPT, Claude, Llama, Ollama, HuggingFace, etc., chat bot demo: https://ai.casibase.com, admin UI demo: https://ai-admin.casibase.com |
2025-07-21T16:46:29Z |
87 |
paperless-ai |
3862 |
151 |
JavaScript |
11 |
An automated document analyzer for Paperless-ngx using OpenAI API, Ollama, Deepseek-r1, Azure and all OpenAI API compatible Services to automatically analyze and tag your documents. |
2025-07-17T20:37:04Z |
88 |
transformerlab-app |
3643 |
317 |
TypeScript |
54 |
Open Source Application for Advanced LLM + Diffusion Engineering: interact, train, fine-tune, and evaluate large language models on your own computer. |
2025-07-21T20:33:52Z |
89 |
PurpleLlama |
3615 |
599 |
Python |
12 |
Set of tools to assess and improve LLM security. |
2025-07-03T12:34:14Z |
90 |
zero_nlp |
3557 |
421 |
Jupyter Notebook |
100 |
中文nlp解决方案(大模型、数据、模型、训练、推理) |
2025-07-20T23:59:47Z |
91 |
langroid |
3491 |
333 |
Python |
47 |
Harness LLMs with Multi-Agent Programming |
2025-07-20T17:29:23Z |
92 |
llama-hub |
3476 |
729 |
Jupyter Notebook |
82 |
A library of data loaders for LLMs made by the community – to be used with LlamaIndex and/or LangChain |
2024-03-01T15:17:16Z |
93 |
higgsfield |
3419 |
571 |
Jupyter Notebook |
1 |
Fault-tolerant, highly scalable GPU orchestration, and a machine learning framework designed for training models with billions to trillions of parameters |
2024-05-25T17:43:07Z |
94 |
LightLLM |
3396 |
269 |
Python |
78 |
LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance. |
2025-07-22T03:32:35Z |
95 |
tiny-universe |
3356 |
334 |
Jupyter Notebook |
9 |
《大模型白盒子构建指南》:一个全手搓的Tiny-Universe |
2025-04-30T06:22:05Z |
96 |
llms-from-scratch-cn |
3352 |
463 |
Jupyter Notebook |
12 |
仅需Python基础,从0构建大语言模型;从0逐步构建GLM4\Llama3\RWKV6, 深入理解大模型原理 |
2024-08-15T02:19:06Z |
97 |
lorax |
3325 |
249 |
Python |
147 |
Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs |
2025-05-21T17:40:25Z |
98 |
ART |
3320 |
198 |
Python |
16 |
Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen2.5, Qwen3, Llama, Kimi, and more! |
2025-07-22T01:42:48Z |
99 |
LLamaSharp |
3285 |
453 |
C# |
26 |
A C#/.NET library to run LLM (🦙LLaMA/LLaVA) on your local device efficiently. |
2025-07-21T14:12:48Z |
100 |
LangChain-ChatGLM-Webui |
3273 |
494 |
Python |
45 |
基于LangChain和ChatGLM-6B等系列LLM的针对本地知识库的自动问答 |
2024-04-15T15:03:05Z |