1 |
ollama |
135548 |
11258 |
Go |
1460 |
Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, and other large language models. |
2025-04-01T02:48:30Z |
2 |
LLaMA-Factory |
45767 |
5596 |
Python |
423 |
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024) |
2025-03-31T16:15:16Z |
3 |
unsloth |
36283 |
2807 |
Python |
931 |
Finetune Llama 3.3, DeepSeek-R1, Gemma 3 & Reasoning LLMs 2x faster with 70% less memory! 🦥 |
2025-03-31T19:51:21Z |
4 |
LocalAI |
31344 |
2379 |
Go |
418 |
:robot: The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more models architectures. Features: Generate Text, Audio, Video, Images, Voice Cloning, Distributed, P2P inference |
2025-03-31T22:01:34Z |
5 |
khoj |
28345 |
1568 |
Python |
71 |
Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous AI (gpt, claude, gemini, llama, qwen, mistral). Get started - free. |
2025-03-31T20:07:27Z |
6 |
LibreChat |
23924 |
4006 |
TypeScript |
137 |
Enhanced ChatGPT Clone: Features Agents, DeepSeek, Anthropic, AWS, OpenAI, Assistants API, Azure, Groq, o1, GPT-4o, Mistral, OpenRouter, Vertex AI, Gemini, Artifacts, AI model switching, message search, Code Interpreter, langchain, DALL-E-3, OpenAPI Actions, Functions, Secure Multi-User Auth, Presets, open-source for self-hosting. Active project. |
2025-04-01T02:44:21Z |
7 |
ludwig |
11403 |
1205 |
Python |
39 |
Low-code framework for building custom LLMs, neural networks, and other AI models |
2025-03-31T20:00:41Z |
8 |
OpenLLM |
11057 |
705 |
Python |
0 |
Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud. |
2025-03-31T19:44:12Z |
9 |
mistral-inference |
10138 |
908 |
Jupyter Notebook |
121 |
Official inference library for Mistral models |
2025-03-20T15:03:08Z |
10 |
ipex-llm |
7669 |
1339 |
Python |
1116 |
Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, DeepSeek, Mixtral, Gemma, Phi, MiniCPM, Qwen-VL, MiniCPM-V, etc.) on Intel XPU (e.g., local PC with iGPU and NPU, discrete GPU such as Arc, Flex and Max); seamlessly integrate with llama.cpp, Ollama, HuggingFace, LangChain, LlamaIndex, vLLM, DeepSpeed, Axolotl, etc. |
2025-04-01T02:59:22Z |
11 |
inference |
7320 |
600 |
Python |
165 |
Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you’re empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop. |
2025-04-01T03:17:05Z |
12 |
ms-swift |
6663 |
568 |
Python |
504 |
Use PEFT or Full-parameter to finetune 500+ LLMs (Qwen2.5, InternLM3, GLM4, Llama3.3, Mistral, Yi1.5, DeepSeek-R1, …) and 200+ MLLMs (Qwen2.5-VL, Qwen2.5-Omni, Qwen2-Audio, Llama3.2-Vision, Llava, InternVL2.5, MiniCPM-V-2.6, GLM4v, Xcomposer2.5, DeepSeek-VL2, Phi3.5-Vision, GOT-OCR2, …). |
2025-04-01T03:18:04Z |
13 |
Firefly |
6300 |
572 |
Python |
204 |
Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型 |
2024-10-24T02:27:42Z |
14 |
big-AGI |
6276 |
1447 |
TypeScript |
234 |
AI suite powered by state-of-the-art models and providing advanced AI/AGI functions. It features AI personas, AGI functions, multi-model chats, text-to-image, voice, response streaming, code highlighting and execution, PDF import, presets for developers, much more. Deploy on-prem or in the cloud. |
2025-03-30T18:18:53Z |
15 |
mistral.rs |
5385 |
391 |
Rust |
109 |
Blazingly fast LLM inference. |
2025-03-30T01:33:26Z |
16 |
enchanted |
5126 |
329 |
Swift |
89 |
Enchanted is iOS and macOS app for chatting with private self hosted language models such as Llama2, Mistral or Vicuna using Ollama. |
2025-03-19T20:19:21Z |
17 |
opencompass |
5065 |
529 |
Python |
291 |
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets. |
2025-03-31T11:08:55Z |
18 |
Liger-Kernel |
4754 |
291 |
Python |
53 |
Efficient Triton Kernels for LLM Training |
2025-03-31T19:41:53Z |
19 |
awesome-LLM-resourses |
4561 |
474 |
None |
0 |
🧑🚀 全世界最好的LLM资料总结(数据处理、模型训练、模型部署、o1 模型、MCP、小语言模型、视觉语言模型) | Summary of the world’s best LLM resources. |
2025-03-31T05:51:56Z |
20 |
xtuner |
4435 |
335 |
Python |
214 |
An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, …) |
2025-03-31T10:35:50Z |
21 |
agentops |
4154 |
374 |
Python |
90 |
Python SDK for AI agent monitoring, LLM cost tracking, benchmarking, and more. Integrates with most LLMs and agent frameworks including OpenAI Agents SDK, CrewAI, Langchain, Autogen, AG2, and CamelAI |
2025-03-31T20:40:42Z |
22 |
chinese-llm-benchmark |
3908 |
166 |
None |
28 |
目前已囊括203个大模型,覆盖chatgpt、gpt-4o、o3-mini、谷歌gemini、Claude3.5、智谱GLM-Zero、文心一言、qwen-max、百川、讯飞星火、商汤senseChat、minimax等商用模型, 以及DeepSeek-R1、qwq-32b、deepseek-v3、qwen2.5、llama3.3、phi-4、glm4、gemma3、mistral、书生internLM2.5等开源大模型。不仅提供能力评分排行榜,也提供所有模型的原始输出结果! |
2025-03-31T10:56:06Z |
23 |
mistral-finetune |
2895 |
259 |
Python |
32 |
None |
2024-09-13T09:53:13Z |
24 |
AI-Infra-from-Zero-to-Hero |
2841 |
320 |
None |
12 |
🚀 Awesome System for Machine Learning ⚡️ AI System Papers and Industry Practice. ⚡️ System for Machine Learning, LLM (Large Language Model), GenAI (Generative AI). 🍻 OSDI, NSDI, SIGCOMM, SoCC, MLSys, etc. 🗃️ Llama3, Mistral, etc. 🧑💻 Video Tutorials. |
2024-08-14T05:12:47Z |
25 |
paperless-ai |
2804 |
103 |
JavaScript |
5 |
An automated document analyzer for Paperless-ngx using OpenAI API, Ollama, Deepseek-r1, Azure and all OpenAI API compatible Services to automatically analyze and tag your documents. |
2025-03-21T19:24:53Z |
26 |
lsp-ai |
2645 |
92 |
Rust |
29 |
LSP-AI is an open-source language server that serves as a backend for AI-powered functionality, designed to assist and empower software engineers, not replace them. |
2025-01-07T22:17:38Z |
27 |
xTuring |
2641 |
206 |
Python |
10 |
Build, customize and control you own LLMs. From data pre-processing to fine-tuning, xTuring provides an easy way to personalize open-source LLMs. Join our discord community: https://discord.gg/TgHXuSJEk6 |
2024-09-23T09:40:48Z |
28 |
secret-llama |
2604 |
164 |
TypeScript |
18 |
Fully private LLM chatbot that runs entirely with a browser with no server needed. Supports Mistral and LLama 3. |
2024-06-05T02:04:17Z |
29 |
elia |
2092 |
131 |
Python |
12 |
A snappy, keyboard-centric terminal user interface for interacting with large language models. Chat with ChatGPT, Claude, Llama 3, Phi 3, Mistral, Gemma and more. |
2024-10-10T19:12:52Z |
30 |
OnnxStream |
1928 |
89 |
C++ |
55 |
Lightweight inference library for ONNX files, written in C++. It can run Stable Diffusion XL 1.0 on a RPI Zero 2 (or in 298MB of RAM) but also Mistral 7B on desktops and servers. ARM, x86, WASM, RISC-V supported. Accelerated by XNNPACK. |
2025-03-29T09:51:04Z |
31 |
maid |
1916 |
208 |
Dart |
10 |
Maid is a cross-platform Flutter app for interfacing with GGUF / llama.cpp models locally, and with Ollama and OpenAI models remotely. |
2025-04-01T01:02:29Z |
32 |
floneum |
1807 |
93 |
Rust |
38 |
Instant, controllable, local pre-trained AI models in Rust |
2025-03-31T18:16:28Z |
33 |
Ollamac |
1753 |
95 |
Swift |
36 |
Mac app for Ollama |
2025-03-12T22:28:22Z |
34 |
dialoqbase |
1744 |
275 |
TypeScript |
39 |
Create chatbots with ease |
2024-10-15T14:24:20Z |
35 |
json_repair |
1661 |
81 |
Python |
0 |
A python module to repair invalid JSON from LLMs |
2025-03-30T15:01:17Z |
36 |
papersgpt-for-zotero |
1442 |
48 |
JavaScript |
39 |
Zotero chat PDF with AI, DeepSeek, GPT 4.5, ChatGPT, Claude, Gemini |
2025-04-01T01:07:00Z |
37 |
search2ai |
1260 |
193 |
JavaScript |
17 |
Help your LLMs online |
2025-02-19T16:26:01Z |
38 |
modelfusion |
1244 |
88 |
TypeScript |
33 |
The TypeScript library for building AI applications. |
2024-07-19T15:17:19Z |
39 |
aws-genai-llm-chatbot |
1207 |
367 |
TypeScript |
23 |
A modular and comprehensive solution to deploy a Multi-LLM and Multi-RAG powered chatbot (Amazon Bedrock, Anthropic, HuggingFace, OpenAI, Meta, AI21, Cohere, Mistral) using AWS CDK on AWS |
2025-04-01T01:58:02Z |
40 |
nextjs-ollama-llm-ui |
1169 |
282 |
TypeScript |
13 |
Fully-featured web interface for Ollama LLMs |
2025-02-04T19:07:06Z |
41 |
gp.nvim |
1105 |
93 |
Lua |
41 |
Gp.nvim (GPT prompt) Neovim AI plugin: ChatGPT sessions & Instructable text/code operations & Speech to text [OpenAI, Ollama, Anthropic, ..] |
2024-09-23T12:32:50Z |
42 |
bedrock-claude-chat |
1074 |
399 |
TypeScript |
116 |
AWS-native chatbot using Bedrock + Claude (+Nova and Mistral) |
2025-04-01T02:16:55Z |
43 |
LLM-Prompt-Library |
1072 |
113 |
Python |
0 |
My personal prompt library for various LLMs + scripts & tools. Suitable for models from Deepseek, OpenAI, Claude, Meta, Mistral, Google, Grok, and others. |
2025-03-18T17:04:23Z |
44 |
poe-api-wrapper |
1070 |
139 |
Python |
27 |
👾 A Python API wrapper for Poe.com. With this, you will have free access to GPT-4, Claude, Llama, Gemini, Mistral and more! 🚀 |
2025-03-07T20:07:31Z |
45 |
chatd |
1020 |
71 |
JavaScript |
26 |
Chat with your documents using local AI |
2024-07-06T01:21:36Z |
46 |
BaseAI |
993 |
83 |
TypeScript |
4 |
BaseAI — The Web AI Framework. The easiest way to build serverless autonomous AI agents with memory. Start building local-first, agentic pipes, tools, and memory. Deploy serverless with one command. |
2025-02-25T11:30:28Z |
47 |
RisuAI |
955 |
164 |
TypeScript |
61 |
Make your own story. User-friendly software for LLM roleplaying |
2025-03-28T09:47:53Z |
48 |
graphrag-local-ollama |
944 |
152 |
Python |
43 |
Local models support for Microsoft’s graphrag using ollama (llama3, mistral, gemma2 phi3)- LLM & Embedding extraction |
2024-09-30T02:43:30Z |
49 |
ai-dev-gallery |
929 |
113 |
C# |
41 |
An open-source project for Windows developers to learn how to add AI with local models and APIs to Windows apps. |
2025-03-31T21:55:32Z |
50 |
generative-ai-use-cases-jp |
876 |
209 |
TypeScript |
86 |
Application implementation with business use cases for safely utilizing generative AI in business operations |
2025-04-01T03:46:59Z |
51 |
witsy |
846 |
61 |
TypeScript |
4 |
Witsy: desktop AI assistant |
2025-04-01T03:11:44Z |
52 |
MixtralKit |
767 |
80 |
Python |
12 |
A toolkit for inference and evaluation of ‘mixtral-8x7b-32kseqlen’ from Mistral AI |
2023-12-15T19:10:55Z |
53 |
fine-tune-mistral |
709 |
64 |
Python |
3 |
Fine-tune mistral-7B on 3090s, a100s, h100s |
2023-10-11T17:25:59Z |
54 |
mistral-common |
706 |
78 |
Python |
17 |
None |
2025-03-19T22:27:53Z |
55 |
web-llm-chat |
699 |
118 |
TypeScript |
9 |
Chat with AI large language models running natively in your browser. Enjoy private, server-free, seamless AI conversations. |
2025-01-29T19:23:34Z |
56 |
tt-metal |
684 |
124 |
C++ |
2190 |
:metal: TT-NN operator library, and TT-Metalium low level kernel programming model. |
2025-04-01T03:40:58Z |
57 |
Hexabot |
683 |
120 |
TypeScript |
124 |
Hexabot is an open-source AI chatbot / agent builder. It allows you to create and manage multi-channel and multilingual chatbots / agents with ease. |
2025-03-28T09:11:42Z |
58 |
ComfyUI-IF_AI_tools |
619 |
47 |
Python |
50 |
ComfyUI-IF_AI_tools is a set of custom nodes for ComfyUI that allows you to generate prompts using a local Large Language Model (LLM) via Ollama. This tool enables you to enhance your image generation workflow by leveraging the power of language models. |
2025-03-09T09:11:32Z |
59 |
llm-finetuning |
573 |
89 |
Python |
3 |
Guide for fine-tuning Llama/Mistral/CodeLlama models and more |
2024-08-28T10:44:08Z |
60 |
client-python |
571 |
119 |
Python |
13 |
Python client library for Mistral AI platform |
2025-03-26T15:19:38Z |
61 |
Owl |
571 |
56 |
Python |
6 |
A personal wearable AI that runs locally |
2024-03-17T06:37:26Z |
62 |
mistral |
570 |
52 |
Python |
18 |
Mistral: A strong, northwesterly wind: Framework for transparent and accessible large-scale language model training, built with Hugging Face 🤗 Transformers. |
2023-11-10T02:55:18Z |
63 |
parrot.nvim |
555 |
36 |
Lua |
6 |
parrot.nvim 🦜 - the plugin that brings stochastic parrots to Neovim. |
2025-03-31T16:35:23Z |
64 |
BambooAI |
541 |
53 |
Python |
11 |
A Python library powered by Language Models (LLMs) for conversational data discovery and analysis. |
2025-03-02T07:52:21Z |
65 |
ai-commits-intellij-plugin |
532 |
41 |
Kotlin |
23 |
AI Commits for IntelliJ based IDEs/Android Studio. |
2025-03-28T17:49:42Z |
66 |
llmcord |
507 |
101 |
Python |
2 |
Make Discord your LLM frontend ● Supports any OpenAI compatible API (Ollama, LM Studio, vLLM, OpenRouter, xAI, Mistral, Groq and more) |
2025-03-31T14:53:18Z |
67 |
rag-chatbot |
496 |
74 |
Python |
6 |
Chat with multiple PDFs locally |
2024-10-11T04:30:01Z |
68 |
embedJs |
482 |
53 |
TypeScript |
24 |
A NodeJS RAG framework to easily work with LLMs and embeddings |
2025-02-14T10:53:44Z |
69 |
helix |
476 |
48 |
Go |
124 |
🧬 Helix is a private GenAI stack for building AI applications with declarative pipelines, knowledge (RAG), API bindings, and first-class testing. |
2025-03-31T19:13:16Z |
70 |
ollama-voice-mac |
469 |
54 |
Python |
8 |
Mac compatible Ollama Voice |
2024-03-26T14:49:04Z |
71 |
aikit |
437 |
37 |
Go |
20 |
🏗️ Fine-tune, build, and deploy open-source LLMs easily! |
2025-03-31T02:22:20Z |
72 |
obsidian-bmo-chatbot |
434 |
60 |
TypeScript |
45 |
Generate and brainstorm ideas while creating your notes using Large Language Models (LLMs) from Ollama, LM Studio, Anthropic, Google Gemini, Mistral AI, OpenAI, and more for Obsidian. |
2024-09-12T04:07:29Z |
73 |
mlx-llm |
432 |
30 |
Python |
0 |
Large Language Models (LLMs) applications and tools running on Apple Silicon in real-time with Apple MLX. |
2025-01-29T07:13:07Z |
74 |
LESS |
424 |
42 |
Jupyter Notebook |
15 |
[ICML 2024] LESS: Selecting Influential Data for Targeted Instruction Tuning |
2024-10-20T03:11:58Z |
75 |
bolna |
416 |
112 |
Python |
28 |
End-to-end platform for building voice first multimodal agents |
2024-10-28T05:40:38Z |
76 |
DevoxxGenieIDEAPlugin |
413 |
47 |
Java |
40 |
DevoxxGenie is a plugin for IntelliJ IDEA that uses local LLM’s (Ollama, LMStudio, GPT4All, Jan and Llama.cpp) and Cloud based LLMs to help review, test, explain your project code. |
2025-03-31T17:57:15Z |
77 |
xllm |
401 |
21 |
Python |
6 |
🦖 X—LLM: Cutting Edge & Easy LLM Finetuning |
2024-01-17T16:43:39Z |
78 |
fltr |
380 |
8 |
Rust |
1 |
Like grep but for natural language questions. Based on Mistral 7B or Mixtral 8x7B. |
2024-03-13T11:39:01Z |
79 |
GPTPortal |
367 |
66 |
JavaScript |
2 |
A feature-rich portal to chat with GPT-4, Claude, Gemini, Mistral, & OpenAI Assistant APIs via a lightweight Node.js web app; supports customizable multimodality for voice, images, & files. |
2025-03-07T19:37:35Z |
80 |
edgen |
357 |
16 |
Rust |
23 |
⚡ Edgen: Local, private GenAI server alternative to OpenAI. No GPU required. Run AI models locally: LLMs (Llama2, Mistral, Mixtral…), Speech-to-text (whisper) and many others. |
2024-05-23T14:21:38Z |
81 |
NeuralFlow |
347 |
15 |
Python |
4 |
Visualize the intermediate output of Mistral 7B |
2025-01-22T11:25:17Z |
82 |
ai_automation_suggester |
347 |
12 |
Python |
7 |
This custom Home Assistant integration automatically scans your entities, detects new devices, and uses AI (via cloud and local APIs) to suggest tailored automations. It supports multiple AI providers, including OpenAI, Anthropic, Google, Groq, LocalAI, Mistral and Ollama. The integration provides automation suggestions via HASS notifications |
2025-03-29T15:39:20Z |
83 |
KVQuant |
338 |
30 |
Python |
14 |
[NeurIPS 2024] KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization |
2024-08-13T11:19:28Z |
84 |
airunner |
302 |
25 |
Python |
24 |
Stable Diffusion and LLMs offline on your own hardware |
2025-04-01T02:54:51Z |
85 |
LLaMa2lang |
300 |
34 |
Python |
0 |
Convenience scripts to finetune (chat-)LLaMa3 and other models for any language |
2024-06-17T14:00:13Z |
86 |
mistral |
291 |
118 |
Python |
0 |
Workflow Service for OpenStack. Mirror of code maintained at opendev.org. |
2025-03-18T23:37:58Z |
87 |
OllamaKit |
284 |
30 |
Swift |
4 |
Ollama client for Swift |
2025-03-09T22:20:34Z |
88 |
nanodl |
284 |
10 |
Python |
2 |
A Jax-based library for designing and training transformer models from scratch. |
2024-08-28T21:24:22Z |
89 |
simple-openai |
283 |
33 |
Java |
6 |
A Java library to use the OpenAI Api in the simplest possible way. |
2025-03-22T20:52:57Z |
90 |
yalm |
276 |
28 |
C++ |
1 |
Yet Another Language Model: LLM inference in C++/CUDA, no libraries except for I/O |
2025-01-15T07:22:42Z |
91 |
llm-mistral-invoice-cpu |
266 |
62 |
Python |
0 |
Data extraction with LLM on CPU |
2024-03-26T05:44:59Z |
92 |
Heat |
261 |
17 |
Swift |
4 |
An LLM agnostic desktop and mobile client. |
2025-03-31T17:51:52Z |
93 |
aicommit2 |
259 |
21 |
TypeScript |
6 |
A Reactive CLI that generates git commit messages with Ollama, ChatGPT, Gemini, Claude, Mistral and other AI |
2025-03-27T05:44:37Z |
94 |
unsaged |
255 |
78 |
TypeScript |
15 |
Open source chat kit engineered for seamless interaction with AI models. |
2025-02-25T18:02:25Z |
95 |
ai-playground |
243 |
57 |
Python |
0 |
Code from tutorials presented on the “Code AI with Rok” YouTube channel |
2025-03-25T12:11:24Z |
96 |
inferflow |
239 |
25 |
C++ |
8 |
Inferflow is an efficient and highly configurable inference engine for large language models (LLMs). |
2024-03-15T06:52:33Z |
97 |
companion-vscode |
232 |
12 |
TypeScript |
3 |
VSCode extension of Quack Companion 💻 Turn your team insights into a portable plug-and-play context for code generation. Alternative to GitHub Copilot powered by OSS LLMs (Mistral, Gemma, etc.), served with Ollama. |
2024-10-01T04:06:14Z |
98 |
TPU-Alignment |
230 |
25 |
Jupyter Notebook |
0 |
Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free |
2024-10-31T20:34:59Z |
99 |
ProX |
230 |
18 |
Python |
2 |
Offical Repo for “Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale” |
2025-02-16T07:59:43Z |
100 |
END-TO-END-GENERATIVE-AI-PROJECTS |
229 |
65 |
None |
0 |
End to End Generative AI Industry Projects on LLM Models with Deployment_Awesome LLM Projects |
2025-01-24T07:20:37Z |