1 |
ollama |
147147 |
12473 |
Go |
1611 |
Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, Mistral Small 3.1 and other large language models. |
2025-07-21T18:25:32Z |
2 |
unsloth |
42403 |
3392 |
Python |
666 |
Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train Qwen3, Llama 4, DeepSeek-R1, Gemma 3, TTS 2x faster with 70% less VRAM. |
2025-07-21T12:35:35Z |
3 |
LocalAI |
34008 |
2647 |
Go |
423 |
:robot: The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more models architectures. Features: Generate Text, Audio, Video, Images, Voice Cloning, Distributed, P2P inference |
2025-07-21T21:37:31Z |
4 |
khoj |
30581 |
1747 |
Python |
75 |
Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous AI (gpt, claude, gemini, llama, qwen, mistral). Get started - free. |
2025-07-20T02:32:45Z |
5 |
LibreChat |
28294 |
5095 |
TypeScript |
163 |
Enhanced ChatGPT Clone: Features Agents, DeepSeek, Anthropic, AWS, OpenAI, Responses API, Azure, Groq, o1, GPT-4o, Mistral, OpenRouter, Vertex AI, Gemini, Artifacts, AI model switching, message search, Code Interpreter, langchain, DALL-E-3, OpenAPI Actions, Functions, Secure Multi-User Auth, Presets, open-source for self-hosting. Active project. |
2025-07-21T22:26:46Z |
6 |
OpenLLM |
11590 |
752 |
Python |
3 |
Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud. |
2025-07-21T16:51:05Z |
7 |
ludwig |
11531 |
1219 |
Python |
42 |
Low-code framework for building custom LLMs, neural networks, and other AI models |
2025-06-23T20:14:15Z |
8 |
mistral-inference |
10373 |
937 |
Jupyter Notebook |
127 |
Official inference library for Mistral models |
2025-03-20T15:03:08Z |
9 |
inference |
8259 |
706 |
Python |
136 |
Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you’re empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop. |
2025-07-21T09:00:05Z |
10 |
ipex-llm |
8132 |
1361 |
Python |
1194 |
Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, DeepSeek, Mixtral, Gemma, Phi, MiniCPM, Qwen-VL, MiniCPM-V, etc.) on Intel XPU (e.g., local PC with iGPU and NPU, discrete GPU such as Arc, Flex and Max); seamlessly integrate with llama.cpp, Ollama, HuggingFace, LangChain, LlamaIndex, vLLM, DeepSpeed, Axolotl, etc. |
2025-07-21T08:20:20Z |
11 |
big-AGI |
6545 |
1531 |
TypeScript |
248 |
AI suite powered by state-of-the-art models and providing advanced AI/AGI functions. It features AI personas, AGI functions, multi-model chats, text-to-image, voice, response streaming, code highlighting and execution, PDF import, presets for developers, much more. Deploy on-prem or in the cloud. |
2025-07-21T21:15:01Z |
12 |
Firefly |
6491 |
583 |
Python |
204 |
Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型 |
2024-10-24T02:27:42Z |
13 |
mistral.rs |
5913 |
432 |
Rust |
166 |
Blazingly fast LLM inference. |
2025-07-22T01:00:15Z |
14 |
awesome-LLM-resources |
5743 |
557 |
None |
0 |
🧑🚀 全世界最好的LLM资料总结(语音视频生成、Agent、辅助编程、数据处理、模型训练、模型推理、o1 模型、MCP、小语言模型、视觉语言模型) | Summary of the world’s best LLM resources. |
2025-07-20T02:32:56Z |
15 |
opencompass |
5711 |
626 |
Python |
321 |
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets. |
2025-07-21T05:31:27Z |
16 |
enchanted |
5481 |
360 |
Swift |
96 |
Enchanted is iOS and macOS app for chatting with private self hosted language models such as Llama2, Mistral or Vicuna using Ollama. |
2025-03-19T20:19:21Z |
17 |
Liger-Kernel |
5392 |
371 |
Python |
67 |
Efficient Triton Kernels for LLM Training |
2025-07-21T10:00:50Z |
18 |
agentops |
4668 |
442 |
Python |
55 |
Python SDK for AI agent monitoring, LLM cost tracking, benchmarking, and more. Integrates with most LLMs and agent frameworks including CrewAI, Agno, OpenAI Agents SDK, Langchain, Autogen, AG2, and CamelAI |
2025-07-21T21:34:21Z |
19 |
xtuner |
4655 |
352 |
Python |
219 |
An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, …) |
2025-07-11T05:43:37Z |
20 |
chinese-llm-benchmark |
4544 |
186 |
None |
27 |
ReLE中文大模型能力评测(持续更新):目前已囊括257个大模型,覆盖chatgpt、gpt-4.1、o4-mini、谷歌gemini-2.5、Claude、智谱GLM-Z1、文心一言、qwen-max、百川、讯飞星火、商汤senseChat、minimax等商用模型, 以及DeepSeek-R1-0528、qwq-32b、deepseek-v3、qwen3、llama4、phi-4、glm4、gemma3、mistral、书生internLM2.5等开源大模型。不仅提供排行榜,也提供规模超200万的大模型缺陷库!方便广大社区研究分析、改进大模型。 |
2025-07-18T07:54:13Z |
21 |
paperless-ai |
3862 |
151 |
JavaScript |
11 |
An automated document analyzer for Paperless-ngx using OpenAI API, Ollama, Deepseek-r1, Azure and all OpenAI API compatible Services to automatically analyze and tag your documents. |
2025-07-17T20:37:04Z |
22 |
local-deep-research |
3178 |
317 |
Python |
50 |
Local Deep Research achieves ~95% on SimpleQA benchmark (tested with GPT-4.1-mini) and includes benchmark tools to test on your own setup. Searches 10+ sources - arXiv, PubMed, GitHub, web, and your private documents. Everything Local. |
2025-07-22T00:32:12Z |
23 |
AI-Infra-from-Zero-to-Hero |
3118 |
331 |
None |
13 |
🚀 Awesome System for Machine Learning ⚡️ AI System Papers and Industry Practice. ⚡️ System for Machine Learning, LLM (Large Language Model), GenAI (Generative AI). 🍻 OSDI, NSDI, SIGCOMM, SoCC, MLSys, etc. 🗃️ Llama3, Mistral, etc. 🧑💻 Video Tutorials. |
2025-05-24T18:29:12Z |
24 |
mistral-finetune |
2986 |
282 |
Python |
34 |
None |
2024-09-13T09:53:13Z |
25 |
lsp-ai |
2887 |
99 |
Rust |
36 |
LSP-AI is an open-source language server that serves as a backend for AI-powered functionality, designed to assist and empower software engineers, not replace them. |
2025-01-07T22:17:38Z |
26 |
xTuring |
2661 |
202 |
Python |
10 |
Build, customize and control you own LLMs. From data pre-processing to fine-tuning, xTuring provides an easy way to personalize open-source LLMs. Join our discord community: https://discord.gg/TgHXuSJEk6 |
2024-09-23T09:40:48Z |
27 |
secret-llama |
2623 |
167 |
TypeScript |
19 |
Fully private LLM chatbot that runs entirely with a browser with no server needed. Supports Mistral and LLama 3. |
2024-06-05T02:04:17Z |
28 |
json_repair |
2496 |
107 |
Python |
0 |
A python module to repair invalid JSON from LLMs |
2025-07-17T08:47:40Z |
29 |
elia |
2225 |
136 |
Python |
13 |
A snappy, keyboard-centric terminal user interface for interacting with large language models. Chat with ChatGPT, Claude, Llama 3, Phi 3, Mistral, Gemma and more. |
2024-10-10T19:12:52Z |
30 |
maid |
2082 |
216 |
Dart |
13 |
Maid is a cross-platform Flutter app for interfacing with GGUF / llama.cpp models locally, and with Ollama and OpenAI models remotely. |
2025-05-28T04:49:58Z |
31 |
OnnxStream |
1970 |
89 |
C++ |
61 |
Lightweight inference library for ONNX files, written in C++. It can run Stable Diffusion XL 1.0 on a RPI Zero 2 (or in 298MB of RAM) but also Mistral 7B on desktops and servers. ARM, x86, WASM, RISC-V supported. Accelerated by XNNPACK. |
2025-07-17T05:14:03Z |
32 |
floneum |
1950 |
105 |
Rust |
43 |
Instant, controllable, local pre-trained AI models in Rust |
2025-07-22T00:47:54Z |
33 |
Ollamac |
1867 |
102 |
Swift |
41 |
Mac app for Ollama |
2025-03-12T22:28:22Z |
34 |
maxtext |
1842 |
387 |
Python |
56 |
A simple, performant and scalable Jax LLM! |
2025-07-22T03:34:34Z |
35 |
papersgpt-for-zotero |
1787 |
54 |
JavaScript |
43 |
Chat Multiple PDFs in Zotero AI with Gemini, Grok 4, DeepSeek, GPT, ChatGPT, Claude, OpenRouter, Gemma 3, Qwen 3 |
2025-07-10T17:02:38Z |
36 |
dialoqbase |
1769 |
279 |
TypeScript |
40 |
Create chatbots with ease |
2024-10-15T14:24:20Z |
37 |
LLM-Prompt-Library |
1370 |
138 |
Jinja |
0 |
A playground of highly experimental prompts, Jinja2 templates & scripts for machine intelligence models from OpenAI, Anthropic, DeepSeek, Meta, Mistral, Google, xAI & others. Alex Bilzerian (2022-2025). |
2025-07-12T00:11:54Z |
38 |
witsy |
1329 |
106 |
TypeScript |
14 |
Witsy: desktop AI assistant / universal MCP client |
2025-07-22T02:17:45Z |
39 |
nextjs-ollama-llm-ui |
1298 |
307 |
TypeScript |
16 |
Fully-featured web interface for Ollama LLMs |
2025-06-05T13:13:19Z |
40 |
modelfusion |
1293 |
89 |
TypeScript |
33 |
The TypeScript library for building AI applications. |
2024-07-19T15:17:19Z |
41 |
aws-genai-llm-chatbot |
1290 |
405 |
TypeScript |
13 |
A modular and comprehensive solution to deploy a Multi-LLM and Multi-RAG powered chatbot (Amazon Bedrock, Anthropic, HuggingFace, OpenAI, Meta, AI21, Cohere, Mistral) using AWS CDK on AWS |
2025-07-21T17:31:37Z |
42 |
search2ai |
1276 |
190 |
JavaScript |
18 |
Help your LLMs online |
2025-02-19T16:26:01Z |
43 |
paperless-gpt |
1263 |
66 |
Go |
70 |
Use LLMs and LLM Vision (OCR) to handle paperless-ngx - Document Digitalization powered by AI |
2025-07-17T13:48:59Z |
44 |
gp.nvim |
1223 |
102 |
Lua |
46 |
Gp.nvim (GPT prompt) Neovim AI plugin: ChatGPT sessions & Instructable text/code operations & Speech to text [OpenAI, Ollama, Anthropic, ..] |
2025-04-08T21:18:30Z |
45 |
airunner |
1216 |
96 |
Python |
19 |
Offline inference engine for art, real-time voice conversations, LLM powered chatbots and automated workflows |
2025-07-15T14:33:02Z |
46 |
ai-dev-gallery |
1148 |
148 |
C# |
57 |
An open-source project for Windows developers to learn how to add AI with local models and APIs to Windows apps. |
2025-07-01T18:45:03Z |
47 |
BaseAI |
1116 |
98 |
TypeScript |
4 |
BaseAI — The Web AI Framework. The easiest way to build serverless autonomous AI agents with memory. Start building local-first, agentic pipes, tools, and memory. Deploy serverless with one command. |
2025-02-25T11:30:28Z |
48 |
generative-ai-use-cases |
1110 |
296 |
TypeScript |
59 |
Application implementation with business use cases for safely utilizing generative AI in business operations |
2025-07-22T03:51:38Z |
49 |
poe-api-wrapper |
1107 |
150 |
Python |
27 |
👾 A Python API wrapper for Poe.com. With this, you will have free access to GPT-4, Claude, Llama, Gemini, Mistral and more! 🚀 |
2025-03-07T20:07:31Z |
50 |
RisuAI |
1083 |
208 |
TypeScript |
79 |
Make your own story. User-friendly software for LLM roleplaying |
2025-07-21T05:24:12Z |
51 |
chatd |
1050 |
74 |
JavaScript |
26 |
Chat with your documents using local AI |
2024-07-06T01:21:36Z |
52 |
graphrag-local-ollama |
1035 |
158 |
Python |
48 |
Local models support for Microsoft’s graphrag using ollama (llama3, mistral, gemma2 phi3)- LLM & Embedding extraction |
2024-09-30T02:43:30Z |
53 |
tt-metal |
1011 |
231 |
C++ |
2795 |
:metal: TT-NN operator library, and TT-Metalium low level kernel programming model. |
2025-07-22T04:04:53Z |
54 |
web-llm-chat |
795 |
138 |
TypeScript |
12 |
Chat with AI large language models running natively in your browser. Enjoy private, server-free, seamless AI conversations. |
2025-05-05T08:21:15Z |
55 |
MixtralKit |
767 |
79 |
Python |
12 |
A toolkit for inference and evaluation of ‘mixtral-8x7b-32kseqlen’ from Mistral AI |
2023-12-15T19:10:55Z |
56 |
mistral-common |
764 |
96 |
Python |
16 |
Official inference library for pre-processing of Mistral models |
2025-07-21T21:01:52Z |
57 |
Hexabot |
761 |
137 |
TypeScript |
152 |
Hexabot is an open-source AI chatbot / agent builder. It allows you to create and manage multi-channel and multilingual chatbots / agents with ease. |
2025-07-21T19:11:59Z |
58 |
fine-tune-mistral |
715 |
64 |
Python |
3 |
Fine-tune mistral-7B on 3090s, a100s, h100s |
2023-10-11T17:25:59Z |
59 |
ComfyUI-IF_AI_tools |
657 |
49 |
Python |
52 |
ComfyUI-IF_AI_tools is a set of custom nodes for ComfyUI that allows you to generate prompts using a local Large Language Model (LLM) via Ollama. This tool enables you to enhance your image generation workflow by leveraging the power of language models. |
2025-03-09T09:11:32Z |
60 |
BambooAI |
639 |
64 |
Python |
12 |
A Python library powered by Language Models (LLMs) for conversational data discovery and analysis. |
2025-07-07T06:37:54Z |
61 |
ai-commits-intellij-plugin |
629 |
46 |
Kotlin |
21 |
AI Commits for IntelliJ based IDEs/Android Studio. |
2025-07-22T04:05:18Z |
62 |
client-python |
623 |
130 |
Python |
17 |
Python client library for Mistral AI platform |
2025-07-10T13:06:04Z |
63 |
llmcord |
612 |
140 |
Python |
1 |
Make Discord your LLM frontend - Supports any OpenAI compatible API (Ollama, xAI, Gemini, OpenRouter and more) |
2025-07-18T15:25:18Z |
64 |
llm-finetuning |
611 |
100 |
Python |
1 |
Guide for fine-tuning Llama/Mistral/CodeLlama models and more |
2025-05-07T01:11:58Z |
65 |
Owl |
602 |
57 |
Python |
6 |
A personal wearable AI that runs locally |
2024-03-17T06:37:26Z |
66 |
mistral |
575 |
52 |
Python |
18 |
Mistral: A strong, northwesterly wind: Framework for transparent and accessible large-scale language model training, built with Hugging Face 🤗 Transformers. |
2023-11-10T02:55:18Z |
67 |
rag-chatbot |
559 |
90 |
Python |
8 |
Chat with multiple PDFs locally |
2024-10-11T04:30:01Z |
68 |
embedJs |
539 |
65 |
TypeScript |
15 |
A NodeJS RAG framework to easily work with LLMs and embeddings |
2025-06-16T12:56:17Z |
69 |
DevoxxGenieIDEAPlugin |
538 |
75 |
Java |
57 |
DevoxxGenie is a plugin for IntelliJ IDEA that uses local LLM’s (Ollama, LMStudio, GPT4All, Jan and Llama.cpp) and Cloud based LLMs to help review, test, explain your project code. |
2025-07-18T10:34:36Z |
70 |
helix |
508 |
53 |
Go |
75 |
♾️ Helix is a private GenAI stack for building AI agents with declarative pipelines, knowledge (RAG), API bindings, and first-class testing. |
2025-07-21T22:25:56Z |
71 |
ollama-voice-mac |
491 |
57 |
Python |
8 |
Mac compatible Ollama Voice |
2024-03-26T14:49:04Z |
72 |
obsidian-bmo-chatbot |
472 |
59 |
TypeScript |
46 |
Generate and brainstorm ideas while creating your notes using Large Language Models (LLMs) from Ollama, LM Studio, Anthropic, Google Gemini, Mistral AI, OpenAI, and more for Obsidian. |
2024-09-12T04:07:29Z |
73 |
LESS |
469 |
46 |
Jupyter Notebook |
16 |
[ICML 2024] LESS: Selecting Influential Data for Targeted Instruction Tuning |
2024-10-20T03:11:58Z |
74 |
aikit |
459 |
43 |
Go |
22 |
🏗️ Fine-tune, build, and deploy open-source LLMs easily! |
2025-07-21T03:38:46Z |
75 |
mlx-llm |
448 |
30 |
Python |
0 |
Large Language Models (LLMs) applications and tools running on Apple Silicon in real-time with Apple MLX. |
2025-01-29T07:13:07Z |
76 |
bolna |
421 |
113 |
Python |
28 |
End-to-end platform for building voice first multimodal agents |
2024-10-28T05:40:38Z |
77 |
WorkflowAI |
419 |
47 |
Python |
0 |
WorkflowAI is an open-source platform where product and engineering teams
collaborate to build and iterate on AI features. |
2025-07-21T02:03:08Z |
78 |
xllm |
403 |
21 |
Python |
6 |
🦖 X—LLM: Cutting Edge & Easy LLM Finetuning |
2024-01-17T16:43:39Z |
79 |
yalm |
392 |
36 |
C++ |
3 |
Yet Another Language Model: LLM inference in C++/CUDA, no libraries except for I/O |
2025-06-07T01:32:14Z |
80 |
GPTPortal |
388 |
72 |
JavaScript |
2 |
A feature-rich portal to chat with GPT-4, Claude, Gemini, Mistral, & OpenAI Assistant APIs via a lightweight Node.js web app; supports customizable multimodality for voice, images, & files. |
2025-07-18T02:32:19Z |
81 |
fltr |
381 |
8 |
Rust |
1 |
Like grep but for natural language questions. Based on Mistral 7B or Mixtral 8x7B. |
2024-03-13T11:39:01Z |
82 |
NeuralFlow |
366 |
16 |
Python |
4 |
Visualize the intermediate output of Mistral 7B |
2025-01-22T11:25:17Z |
83 |
edgen |
362 |
20 |
Rust |
23 |
⚡ Edgen: Local, private GenAI server alternative to OpenAI. No GPU required. Run AI models locally: LLMs (Llama2, Mistral, Mixtral…), Speech-to-text (whisper) and many others. |
2024-05-23T14:21:38Z |
84 |
KVQuant |
362 |
31 |
Python |
15 |
[NeurIPS 2024] KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization |
2024-08-13T11:19:28Z |
85 |
END-TO-END-GENERATIVE-AI-PROJECTS |
337 |
97 |
None |
0 |
End to End Generative AI Industry Projects on LLM Models with Deployment_Awesome LLM Projects |
2025-01-24T07:20:37Z |
86 |
simple-openai |
327 |
42 |
Java |
5 |
A Java library to use the OpenAI Api in the simplest possible way. |
2025-07-15T19:54:10Z |
87 |
aicommit2 |
321 |
25 |
TypeScript |
8 |
A Reactive CLI that generates git commit messages with Ollama, ChatGPT, Gemini, Claude, Mistral and other AI |
2025-07-21T11:36:01Z |
88 |
OllamaKit |
314 |
37 |
Swift |
5 |
Ollama client for Swift |
2025-03-09T22:20:34Z |
89 |
LLaMa2lang |
310 |
35 |
Python |
0 |
Convenience scripts to finetune (chat-)LLaMa3 and other models for any language |
2024-06-17T14:00:13Z |
90 |
mistral |
294 |
122 |
Python |
0 |
Workflow Service for OpenStack. Mirror of code maintained at opendev.org. |
2025-07-20T12:01:33Z |
91 |
nanodl |
291 |
11 |
Python |
2 |
A Jax-based library for building transformers, includes implementations of GPT, Gemma, LlaMa, Mixtral, Whisper, SWin, ViT and more. |
2024-08-28T21:24:22Z |
92 |
voice-chat-ai |
288 |
61 |
Python |
0 |
🎙️ Speak with AI - Run locally using Ollama, OpenAI, Anthropic or xAI - Speech uses XTTS, OpenAI, ElevenLabs or Kokoro |
2025-07-16T06:48:39Z |
93 |
Kolosal |
288 |
20 |
C++ |
12 |
Kolosal AI is an OpenSource and Lightweight alternative to LM Studio to run LLMs 100% offline on your device. |
2025-05-22T06:29:28Z |
94 |
ai-playground |
287 |
70 |
Python |
0 |
Code from tutorials presented on the “Code AI with Rok” YouTube channel |
2025-06-16T10:09:21Z |
95 |
Heat |
286 |
17 |
Swift |
5 |
An LLM agnostic desktop and mobile client. |
2025-05-08T20:34:10Z |
96 |
lemonade |
277 |
23 |
Python |
32 |
Local LLM Server with GPU and NPU Acceleration |
2025-07-17T17:22:42Z |
97 |
llm-mistral-invoice-cpu |
268 |
64 |
Python |
0 |
Data extraction with LLM on CPU |
2024-03-26T05:44:59Z |
98 |
unsaged |
258 |
80 |
TypeScript |
15 |
Open source chat kit engineered for seamless interaction with AI models. |
2025-02-25T18:02:25Z |
99 |
picollm |
258 |
14 |
Python |
0 |
On-device LLM Inference Powered by X-Bit Quantization |
2025-06-11T20:16:28Z |
100 |
ProX |
255 |
19 |
Python |
1 |
[ICML 2025] Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale |
2025-07-08T05:24:48Z |