1 |
ollama |
135548 |
11258 |
Go |
1460 |
Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, and other large language models. |
2025-04-01T02:48:30Z |
2 |
llama.cpp |
77451 |
11265 |
C++ |
343 |
LLM inference in C/C++ |
2025-03-31T21:09:49Z |
3 |
llama |
57968 |
9720 |
Python |
427 |
Inference code for Llama models |
2025-01-26T21:42:26Z |
4 |
LLaMA-Factory |
45767 |
5596 |
Python |
423 |
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024) |
2025-03-31T16:15:16Z |
5 |
vllm |
43200 |
6573 |
Python |
1548 |
A high-throughput and memory-efficient inference and serving engine for LLMs |
2025-03-31T22:27:29Z |
6 |
llama_index |
40557 |
5768 |
Python |
735 |
LlamaIndex is the leading framework for building LLM-powered agents over your data. |
2025-04-01T00:42:01Z |
7 |
quivr |
37632 |
3634 |
Python |
22 |
Opiniated RAG for integrating GenAI in your apps 🧠 Focus on your product rather than the RAG. Easy integration in existing products with customisation! Any LLM: GPT4, Groq, Llama. Any Vectorstore: PGVector, Faiss. Any Files. Anyway you want. |
2025-03-31T16:29:26Z |
8 |
unsloth |
36283 |
2807 |
Python |
931 |
Finetune Llama 3.3, DeepSeek-R1, Gemma 3 & Reasoning LLMs 2x faster with 70% less memory! 🦥 |
2025-03-31T19:51:21Z |
9 |
Langchain-Chatchat |
34466 |
5824 |
TypeScript |
198 |
Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and Llama) RAG and Agent app with langchain |
2025-03-25T15:45:51Z |
10 |
LocalAI |
31344 |
2379 |
Go |
418 |
:robot: The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more models architectures. Features: Generate Text, Audio, Video, Images, Voice Cloning, Distributed, P2P inference |
2025-03-31T22:01:34Z |
11 |
aider |
30368 |
2744 |
Python |
679 |
aider is AI pair programming in your terminal |
2025-04-01T03:14:06Z |
12 |
llama3 |
28561 |
3340 |
Python |
170 |
The official Meta Llama 3 GitHub site |
2025-01-26T21:39:06Z |
13 |
khoj |
28345 |
1568 |
Python |
71 |
Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous AI (gpt, claude, gemini, llama, qwen, mistral). Get started - free. |
2025-03-31T20:07:27Z |
14 |
llamafile |
22088 |
1159 |
C++ |
141 |
Distribute and run LLMs with a single file. |
2025-03-24T16:36:01Z |
15 |
LLaVA |
22043 |
2419 |
Python |
1059 |
[NeurIPS’23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond. |
2024-08-12T09:52:38Z |
16 |
fish-speech |
20387 |
1608 |
Python |
32 |
SOTA Open Source TTS |
2025-03-20T08:59:55Z |
17 |
Awesome-Chinese-LLM |
19278 |
1857 |
None |
5 |
整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。 |
2024-09-19T11:06:18Z |
18 |
alpaca-lora |
18872 |
2231 |
Jupyter Notebook |
333 |
Instruct-tune LLaMA on consumer hardware |
2024-07-29T13:37:49Z |
19 |
Chinese-LLaMA-Alpaca |
18775 |
1892 |
Python |
1 |
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs) |
2024-04-30T04:28:38Z |
20 |
llama2.c |
18247 |
2225 |
C |
122 |
Inference Llama 2 in one file of pure C |
2024-08-06T09:44:40Z |
21 |
llama-cookbook |
16559 |
2398 |
Jupyter Notebook |
15 |
Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama model family and using them on various provider services |
2025-03-31T12:41:20Z |
22 |
ChuanhuChatGPT |
15399 |
2287 |
Python |
122 |
GUI for ChatGPT API and many LLMs. Supports agents, file-based QA, GPT finetuning and query with web search. All with a neat UI. |
2025-03-13T09:36:38Z |
23 |
MaxKB |
15365 |
2019 |
Python |
143 |
💬 Ready-to-use & flexible RAG Chatbot, supporting mainstream large language models (LLMs) such as DeepSeek-R1, Llama 3.3, Qwen2, OpenAI and more. |
2025-04-01T02:57:43Z |
24 |
llama3-from-scratch |
14748 |
1241 |
Jupyter Notebook |
13 |
llama3 implementation one matrix multiplication at a time |
2024-05-23T14:34:05Z |
25 |
Llama-Chinese |
14516 |
1300 |
Python |
196 |
Llama中文社区,Llama3在线体验和微调模型已开放,实时汇总最新Llama3学习资料,已将所有代码更新适配Llama3,构建最好的中文Llama大模型,完全开源可商用 |
2024-09-05T13:50:43Z |
26 |
repomix |
13986 |
598 |
TypeScript |
65 |
📦 Repomix (formerly Repopack) is a powerful tool that packs your entire repository into a single, AI-friendly file. Perfect for when you need to feed your codebase to Large Language Models (LLMs) or other AI tools like Claude, ChatGPT, DeepSeek, Perplexity, Gemini, Gemma, Llama, Grok, and more. |
2025-03-31T14:59:20Z |
27 |
dalai |
13081 |
1403 |
CSS |
293 |
The simplest way to run LLaMA on your local machine |
2024-06-18T20:29:46Z |
28 |
sglang |
12714 |
1402 |
Python |
437 |
SGLang is a fast serving framework for large language models and vision language models. |
2025-04-01T03:45:07Z |
29 |
PaddleNLP |
12475 |
3007 |
Python |
268 |
Easy-to-use and powerful LLM and SLM library with awesome model zoo. |
2025-04-01T02:46:29Z |
30 |
h2ogpt |
11741 |
1292 |
Python |
285 |
Private chat with local GPT with document, images, video, etc. 100% private, Apache 2.0. Supports oLLaMa, Mixtral, llama.cpp, and more. Demo: https://gpt.h2o.ai/ https://gpt-docs.h2o.ai/ |
2025-03-26T15:07:15Z |
31 |
mastra |
11488 |
556 |
TypeScript |
70 |
The TypeScript AI agent framework. ⚡ Assistants, RAG, observability. Supports any LLM: GPT-4, Claude, Gemini, Llama. |
2025-04-01T02:28:35Z |
32 |
ludwig |
11403 |
1205 |
Python |
39 |
Low-code framework for building custom LLMs, neural networks, and other AI models |
2025-03-31T20:00:41Z |
33 |
OpenLLM |
11057 |
705 |
Python |
0 |
Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud. |
2025-03-31T19:44:12Z |
34 |
llama-gpt |
10956 |
716 |
TypeScript |
84 |
A self-hosted, offline, ChatGPT-like chatbot. Powered by Llama 2. 100% private, with no data leaving your device. New: Code Llama support! |
2024-04-23T18:56:06Z |
35 |
shell_gpt |
10628 |
838 |
Python |
85 |
A command-line productivity tool powered by AI large language models like GPT-4, will help you accomplish your tasks faster and more efficiently. |
2025-02-17T04:11:14Z |
36 |
petals |
9524 |
549 |
Python |
90 |
🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading |
2024-09-07T11:54:28Z |
37 |
llama-cpp-python |
8884 |
1099 |
Python |
533 |
Python bindings for llama.cpp |
2025-03-24T23:24:33Z |
38 |
TinyLlama |
8356 |
524 |
Python |
44 |
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens. |
2024-05-03T20:21:20Z |
39 |
PowerInfer |
8168 |
428 |
C++ |
107 |
High-speed Large Language Model Serving for Local Deployment |
2025-02-19T08:15:55Z |
40 |
BELLE |
8101 |
765 |
HTML |
104 |
BELLE: Be Everyone’s Large Language model Engine(开源中文对话大模型) |
2024-10-16T11:38:59Z |
41 |
bisheng |
7939 |
1334 |
Python |
84 |
BISHENG is an open LLM devops platform for next generation Enterprise AI applications. Powerful and comprehensive features include: GenAI workflow, RAG, Agent, Unified model management, Evaluation, SFT, Dataset Management, Enterprise-level System Management, Observability and more. |
2025-03-31T11:15:32Z |
42 |
reor |
7770 |
465 |
JavaScript |
106 |
Private & local AI personal knowledge management app for high entropy people. |
2025-03-31T02:42:24Z |
43 |
ipex-llm |
7669 |
1339 |
Python |
1116 |
Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, DeepSeek, Mixtral, Gemma, Phi, MiniCPM, Qwen-VL, MiniCPM-V, etc.) on Intel XPU (e.g., local PC with iGPU and NPU, discrete GPU such as Arc, Flex and Max); seamlessly integrate with llama.cpp, Ollama, HuggingFace, LangChain, LlamaIndex, vLLM, DeepSpeed, Axolotl, etc. |
2025-04-01T02:59:22Z |
44 |
llama-stack |
7595 |
955 |
Python |
173 |
Composable building blocks to build Llama Apps |
2025-03-31T20:38:47Z |
45 |
GPTCache |
7489 |
530 |
Python |
71 |
Semantic cache for LLMs. Fully integrated with LangChain and llama_index. |
2024-09-18T02:05:21Z |
46 |
open_llama |
7465 |
396 |
None |
36 |
OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset |
2023-07-16T13:42:13Z |
47 |
inference |
7320 |
600 |
Python |
165 |
Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you’re empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop. |
2025-04-01T03:17:05Z |
48 |
Chinese-LLaMA-Alpaca-2 |
7155 |
575 |
Python |
1 |
中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models) |
2024-09-23T02:52:19Z |
49 |
AstrBot |
6897 |
414 |
Python |
160 |
✨ 易上手的多平台 LLM 聊天机器人及开发框架 ✨ 平台支持 QQ、QQ频道、Telegram、微信、企微、飞书 | MCP 服务器、OpenAI、DeepSeek、Gemini、硅基流动、月之暗面、Ollama、OneAPI、Dify 等。附带 WebUI。 |
2025-04-01T03:35:57Z |
50 |
langchain4j |
6756 |
1259 |
Java |
386 |
Java version of LangChain |
2025-03-31T19:48:45Z |
51 |
ms-swift |
6663 |
568 |
Python |
504 |
Use PEFT or Full-parameter to finetune 500+ LLMs (Qwen2.5, InternLM3, GLM4, Llama3.3, Mistral, Yi1.5, DeepSeek-R1, …) and 200+ MLLMs (Qwen2.5-VL, Qwen2.5-Omni, Qwen2-Audio, Llama3.2-Vision, Llava, InternVL2.5, MiniCPM-V-2.6, GLM4v, Xcomposer2.5, DeepSeek-VL2, Phi3.5-Vision, GOT-OCR2, …). |
2025-04-01T03:18:04Z |
52 |
k8sgpt |
6399 |
769 |
Go |
72 |
Giving Kubernetes Superpowers to everyone |
2025-03-31T19:17:09Z |
53 |
Firefly |
6300 |
572 |
Python |
204 |
Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型 |
2024-10-24T02:27:42Z |
54 |
lit-llama |
6042 |
522 |
Python |
109 |
Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed. |
2024-09-06T11:38:12Z |
55 |
promptfoo |
6027 |
498 |
TypeScript |
146 |
Test your prompts, agents, and RAGs. Red teaming, pentesting, and vulnerability scanning for LLMs. Compare performance of GPT, Claude, Gemini, Llama, and more. Simple declarative configs with command line and CI/CD integration. |
2025-04-01T03:14:24Z |
56 |
llama-models |
5968 |
1014 |
Python |
92 |
Utilities intended for use with Llama models. |
2025-03-31T21:09:14Z |
57 |
lmdeploy |
5963 |
517 |
Python |
399 |
LMDeploy is a toolkit for compressing, deploying, and serving LLMs. |
2025-03-31T16:39:44Z |
58 |
LaWGPT |
5955 |
551 |
Python |
86 |
🎉 Repo for LaWGPT, Chinese-Llama tuned with Chinese Legal knowledge. 基于中文法律知识的大语言模型 |
2024-06-11T07:20:19Z |
59 |
LLaMA-Adapter |
5846 |
379 |
Python |
106 |
[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters |
2024-03-14T08:12:53Z |
60 |
airllm |
5756 |
455 |
Jupyter Notebook |
112 |
AirLLM 70B inference with single 4GB GPU |
2024-11-24T23:32:29Z |
61 |
llamacoder |
5748 |
1279 |
TypeScript |
37 |
Open source Claude Artifacts – built with Llama 3.1 405B |
2025-01-22T11:28:23Z |
62 |
serge |
5707 |
404 |
Svelte |
18 |
A web interface for chatting with Alpaca through llama.cpp. Fully dockerized, with an easy to use API. |
2025-03-29T13:44:09Z |
63 |
Baichuan-7B |
5691 |
507 |
Python |
85 |
A large-scale 7B pretraining language model developed by BaiChuan-Inc. |
2024-07-18T14:23:01Z |
64 |
mergekit |
5493 |
522 |
Python |
198 |
Tools for merging pretrained large language models. |
2025-03-31T23:48:53Z |
65 |
llama-fs |
5216 |
328 |
TypeScript |
44 |
A self-organizing file system with llama 3 |
2025-02-18T01:58:14Z |
66 |
enchanted |
5126 |
329 |
Swift |
89 |
Enchanted is iOS and macOS app for chatting with private self hosted language models such as Llama2, Mistral or Vicuna using Ollama. |
2025-03-19T20:19:21Z |
67 |
llm-answer-engine |
4876 |
766 |
TypeScript |
25 |
Build a Perplexity-Inspired Answer Engine Using Next.js, Groq, Llama-3, Langchain, OpenAI, Upstash, Brave & Serper |
2024-09-28T16:41:53Z |
68 |
Liger-Kernel |
4754 |
291 |
Python |
53 |
Efficient Triton Kernels for LLM Training |
2025-03-31T19:41:53Z |
69 |
Huatuo-Llama-Med-Chinese |
4721 |
476 |
Python |
29 |
Repo for BenTsao [original name: HuaTuo (华驼)], Instruction-tuning Large Language Models with Chinese Medical Knowledge. 本草(原名:华驼)模型仓库,基于中文医学知识的大语言模型指令微调 |
2025-02-21T02:04:37Z |
70 |
llm-scraper |
4678 |
272 |
TypeScript |
11 |
Turn any webpage into structured data using LLMs |
2024-08-30T17:36:16Z |
71 |
YuE |
4667 |
503 |
Python |
57 |
YuE: Open Full-song Music Generation Foundation Model, something similar to Suno.ai but open |
2025-03-30T04:12:01Z |
72 |
awesome-LLM-resourses |
4561 |
474 |
None |
0 |
🧑🚀 全世界最好的LLM资料总结(数据处理、模型训练、模型部署、o1 模型、MCP、小语言模型、视觉语言模型) | Summary of the world’s best LLM resources. |
2025-03-31T05:51:56Z |
73 |
GPT-4-LLM |
4289 |
305 |
HTML |
13 |
Instruction Tuning with GPT-4 |
2023-06-11T13:40:30Z |
74 |
h2o-llmstudio |
4249 |
442 |
Python |
36 |
H2O LLM Studio - a framework and no-code GUI for fine-tuning LLMs. Documentation: https://docs.h2o.ai/h2o-llmstudio/ |
2025-03-07T08:32:53Z |
75 |
g1 |
4202 |
379 |
Python |
1 |
g1: Using Llama-3.1 70b on Groq to create o1-like reasoning chains |
2025-01-27T18:36:13Z |
76 |
llama-stack-apps |
4185 |
614 |
None |
19 |
Agentic components of the Llama Stack APIs |
2025-04-01T02:43:53Z |
77 |
llama-dl |
4162 |
418 |
Shell |
9 |
High-speed download of LLaMA, Facebook’s 65B parameter GPT model |
2023-06-28T16:56:55Z |
78 |
Chinese-Vicuna |
4152 |
419 |
C |
65 |
Chinese-Vicuna: A Chinese Instruction-following LLaMA-based Model —— 一个中文低资源的llama+lora方案,结构参考alpaca |
2024-11-14T12:37:47Z |
79 |
llama3-Chinese-chat |
4136 |
340 |
Python |
29 |
Llama3、Llama3.1 中文后训练版仓库(微调、魔改版本有趣权重 & 训练、推理、评测、部署教程视频 & 文档) |
2024-09-16T10:05:58Z |
80 |
data-juicer |
4089 |
225 |
Python |
31 |
Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷 |
2025-04-01T03:07:26Z |
81 |
llama_cloud_services |
3843 |
379 |
Python |
228 |
Knowledge Agents and Management in the Cloud |
2025-04-01T00:49:38Z |
82 |
MedicalGPT |
3763 |
550 |
Python |
41 |
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型,实现了包括增量预训练(PT)、有监督微调(SFT)、RLHF、DPO、ORPO、GRPO。 |
2025-03-20T02:32:43Z |
83 |
obsidian-smart-connections |
3478 |
201 |
JavaScript |
345 |
Chat with your notes & see links to related content with AI embeddings. Use local models or 100+ via APIs like Claude, Gemini, ChatGPT & Llama 3 |
2025-03-26T13:59:21Z |
84 |
llama-hub |
3474 |
731 |
Jupyter Notebook |
82 |
A library of data loaders for LLMs made by the community – to be used with LlamaIndex and/or LangChain |
2024-03-01T15:17:16Z |
85 |
casibase |
3419 |
402 |
Go |
35 |
⚡️AI Cloud OS: Open-source enterprise-level AI knowledge base and Manus-like agent management platform with admin UI, user management and Single-Sign-On⚡️, supports ChatGPT, Claude, DeepSeek R1, Llama, Ollama, HuggingFace, etc., chat bot demo: https://ai.casibase.com, admin UI demo: https://ai-admin.casibase.com |
2025-03-31T12:58:26Z |
86 |
zero_nlp |
3337 |
397 |
Jupyter Notebook |
100 |
中文nlp解决方案(大模型、数据、模型、训练、推理) |
2025-02-12T13:56:56Z |
87 |
higgsfield |
3327 |
558 |
Jupyter Notebook |
1 |
Fault-tolerant, highly scalable GPU orchestration, and a machine learning framework designed for training models with billions to trillions of parameters |
2024-05-25T17:43:07Z |
88 |
tensorzero |
3268 |
207 |
Rust |
119 |
TensorZero creates a feedback loop for optimizing LLM applications — turning production data into smarter, faster, and cheaper models. |
2025-04-01T02:47:10Z |
89 |
YAYI |
3264 |
44 |
Python |
0 |
雅意大模型:为客户打造安全可靠的专属大模型,基于大规模中英文多领域指令数据训练的 LlaMA 2 & BLOOM 系列模型,由中科闻歌算法团队研发。(Repo for YaYi Chinese LLMs based on LlaMA2 & BLOOM) |
2024-01-17T07:37:16Z |
90 |
LangChain-ChatGLM-Webui |
3248 |
490 |
Python |
45 |
基于LangChain和ChatGLM-6B等系列LLM的针对本地知识库的自动问答 |
2024-04-15T15:03:05Z |
91 |
InternGPT |
3215 |
230 |
Python |
19 |
InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now it supports DragGAN, ChatGPT, ImageBind, multimodal chat like GPT-4, SAM, interactive image editing, etc. Try it at igpt.opengvlab.com (支持DragGAN、ChatGPT、ImageBind、SAM的在线Demo系统) |
2024-08-20T12:51:03Z |
92 |
langroid |
3202 |
307 |
Python |
55 |
Harness LLMs with Multi-Agent Programming |
2025-03-31T14:11:08Z |
93 |
LLamaSharp |
3077 |
405 |
C# |
157 |
A C#/.NET library to run LLM (🦙LLaMA/LLaVA) on your local device efficiently. |
2025-03-30T14:27:14Z |
94 |
lightllm |
3069 |
244 |
Python |
72 |
LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance. |
2025-04-01T03:21:39Z |
95 |
Linly |
3049 |
234 |
Python |
109 |
Chinese-LLaMA 1&2、Chinese-Falcon 基础模型;ChatFlow中文对话模型;中文OpenLLaMA模型;NLP预训练/指令微调数据集 |
2024-04-14T05:19:19Z |
96 |
GPTQ-for-LLaMa |
3046 |
461 |
Python |
61 |
4 bits quantization of LLaMA using GPTQ |
2024-07-13T04:45:28Z |
97 |
PurpleLlama |
2991 |
501 |
Python |
6 |
Set of tools to assess and improve LLM security. |
2025-02-14T21:34:34Z |
98 |
Video-LLaMA |
2973 |
270 |
Python |
62 |
[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding |
2024-06-04T07:06:41Z |
99 |
AGiXT |
2956 |
395 |
Python |
5 |
AGiXT is a dynamic AI Agent Automation Platform that seamlessly orchestrates instruction management and complex task execution across diverse AI providers. Combining adaptive memory, smart features, and a versatile plugin system, AGiXT delivers efficient and comprehensive AI solutions. |
2025-03-31T23:13:11Z |
100 |
lorax |
2893 |
207 |
Python |
143 |
Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs |
2025-03-07T06:20:21Z |