1 |
ollama |
140598 |
11754 |
Go |
1550 |
Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, Mistral Small 3.1 and other large language models. |
2025-05-15T23:35:59Z |
2 |
LLaMA-Factory |
48978 |
5967 |
Python |
462 |
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024) |
2025-05-15T02:54:36Z |
3 |
unsloth |
38737 |
3035 |
Python |
930 |
Finetune Qwen3, Llama 4, TTS, DeepSeek-R1 & Gemma 3 LLMs 2x faster with 70% less memory! 🦥 |
2025-05-15T22:15:53Z |
4 |
LocalAI |
32604 |
2484 |
Go |
442 |
:robot: The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more models architectures. Features: Generate Text, Audio, Video, Images, Voice Cloning, Distributed, P2P inference |
2025-05-15T21:17:11Z |
5 |
khoj |
30052 |
1672 |
Python |
68 |
Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous AI (gpt, claude, gemini, llama, qwen, mistral). Get started - free. |
2025-05-13T19:05:09Z |
6 |
LibreChat |
25564 |
4410 |
TypeScript |
141 |
Enhanced ChatGPT Clone: Features Agents, DeepSeek, Anthropic, AWS, OpenAI, Assistants API, Azure, Groq, o1, GPT-4o, Mistral, OpenRouter, Vertex AI, Gemini, Artifacts, AI model switching, message search, Code Interpreter, langchain, DALL-E-3, OpenAPI Actions, Functions, Secure Multi-User Auth, Presets, open-source for self-hosting. Active project. |
2025-05-15T22:05:56Z |
7 |
ludwig |
11449 |
1207 |
Python |
40 |
Low-code framework for building custom LLMs, neural networks, and other AI models |
2025-05-12T20:02:02Z |
8 |
OpenLLM |
11258 |
719 |
Python |
2 |
Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud. |
2025-05-12T23:11:32Z |
9 |
mistral-inference |
10217 |
914 |
Jupyter Notebook |
124 |
Official inference library for Mistral models |
2025-03-20T15:03:08Z |
10 |
ipex-llm |
7868 |
1348 |
Python |
1148 |
Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, DeepSeek, Mixtral, Gemma, Phi, MiniCPM, Qwen-VL, MiniCPM-V, etc.) on Intel XPU (e.g., local PC with iGPU and NPU, discrete GPU such as Arc, Flex and Max); seamlessly integrate with llama.cpp, Ollama, HuggingFace, LangChain, LlamaIndex, vLLM, DeepSpeed, Axolotl, etc. |
2025-05-15T08:46:52Z |
11 |
inference |
7823 |
666 |
Python |
179 |
Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you’re empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop. |
2025-05-15T11:53:55Z |
12 |
ms-swift |
7574 |
642 |
Python |
698 |
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, InternLM3, GLM4, Mistral, Yi1.5, DeepSeek-R1, …) and 200+ MLLMs (Qwen2.5-VL, Qwen2.5-Omni, Qwen2-Audio, Ovis2, InternVL3, Llava, MiniCPM-V-2.6, GLM4v, Xcomposer2.5, DeepSeek-VL2, Phi4, GOT-OCR2, …). |
2025-05-16T03:14:47Z |
13 |
big-AGI |
6411 |
1483 |
TypeScript |
242 |
AI suite powered by state-of-the-art models and providing advanced AI/AGI functions. It features AI personas, AGI functions, multi-model chats, text-to-image, voice, response streaming, code highlighting and execution, PDF import, presets for developers, much more. Deploy on-prem or in the cloud. |
2025-05-14T23:19:05Z |
14 |
Firefly |
6390 |
576 |
Python |
204 |
Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型 |
2024-10-24T02:27:42Z |
15 |
mistral.rs |
5607 |
401 |
Rust |
119 |
Blazingly fast LLM inference. |
2025-05-16T02:38:26Z |
16 |
opencompass |
5343 |
568 |
Python |
304 |
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets. |
2025-05-14T02:25:03Z |
17 |
enchanted |
5312 |
342 |
Swift |
92 |
Enchanted is iOS and macOS app for chatting with private self hosted language models such as Llama2, Mistral or Vicuna using Ollama. |
2025-03-19T20:19:21Z |
18 |
awesome-LLM-resources |
5195 |
512 |
None |
0 |
🧑🚀 全世界最好的LLM资料总结(Agent框架、辅助编程、数据处理、模型训练、模型推理、o1 模型、MCP、小语言模型、视觉语言模型) | Summary of the world’s best LLM resources. |
2025-05-13T13:11:02Z |
19 |
Liger-Kernel |
5019 |
324 |
Python |
59 |
Efficient Triton Kernels for LLM Training |
2025-05-15T16:35:49Z |
20 |
xtuner |
4542 |
341 |
Python |
218 |
An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, …) |
2025-05-07T10:15:05Z |
21 |
agentops |
4396 |
397 |
Python |
90 |
Python SDK for AI agent monitoring, LLM cost tracking, benchmarking, and more. Integrates with most LLMs and agent frameworks including OpenAI Agents SDK, CrewAI, Langchain, Autogen, AG2, and CamelAI |
2025-05-15T19:59:00Z |
22 |
chinese-llm-benchmark |
4216 |
175 |
None |
29 |
目前已囊括232个大模型,覆盖chatgpt、gpt-4o、o3-mini、谷歌gemini、Claude3.5、智谱GLM-Zero、文心一言、qwen-max、百川、讯飞星火、商汤senseChat、minimax等商用模型, 以及DeepSeek-R1、qwq-32b、deepseek-v3、qwen2.5、llama3.3、phi-4、glm4、gemma3、mistral、书生internLM2.5等开源大模型。不仅提供能力评分排行榜,也提供所有模型的原始输出结果! |
2025-05-15T21:07:01Z |
23 |
paperless-ai |
3122 |
115 |
JavaScript |
15 |
An automated document analyzer for Paperless-ngx using OpenAI API, Ollama, Deepseek-r1, Azure and all OpenAI API compatible Services to automatically analyze and tag your documents. |
2025-05-15T19:08:27Z |
24 |
mistral-finetune |
2940 |
267 |
Python |
33 |
None |
2024-09-13T09:53:13Z |
25 |
AI-Infra-from-Zero-to-Hero |
2920 |
322 |
None |
13 |
🚀 Awesome System for Machine Learning ⚡️ AI System Papers and Industry Practice. ⚡️ System for Machine Learning, LLM (Large Language Model), GenAI (Generative AI). 🍻 OSDI, NSDI, SIGCOMM, SoCC, MLSys, etc. 🗃️ Llama3, Mistral, etc. 🧑💻 Video Tutorials. |
2024-08-14T05:12:47Z |
26 |
lsp-ai |
2756 |
95 |
Rust |
32 |
LSP-AI is an open-source language server that serves as a backend for AI-powered functionality, designed to assist and empower software engineers, not replace them. |
2025-01-07T22:17:38Z |
27 |
local-deep-research |
2658 |
267 |
Python |
24 |
Local Deep Research is an AI-powered assistant that transforms complex questions into comprehensive, cited reports by conducting iterative analysis using any LLM across diverse knowledge sources including academic databases, scientific repositories, web content, and private document collections. |
2025-05-15T23:33:52Z |
28 |
xTuring |
2647 |
203 |
Python |
10 |
Build, customize and control you own LLMs. From data pre-processing to fine-tuning, xTuring provides an easy way to personalize open-source LLMs. Join our discord community: https://discord.gg/TgHXuSJEk6 |
2024-09-23T09:40:48Z |
29 |
secret-llama |
2617 |
164 |
TypeScript |
19 |
Fully private LLM chatbot that runs entirely with a browser with no server needed. Supports Mistral and LLama 3. |
2024-06-05T02:04:17Z |
30 |
elia |
2152 |
132 |
Python |
12 |
A snappy, keyboard-centric terminal user interface for interacting with large language models. Chat with ChatGPT, Claude, Llama 3, Phi 3, Mistral, Gemma and more. |
2024-10-10T19:12:52Z |
31 |
json_repair |
1968 |
89 |
Python |
0 |
A python module to repair invalid JSON from LLMs |
2025-05-09T08:35:20Z |
32 |
maid |
1956 |
214 |
Dart |
12 |
Maid is a cross-platform Flutter app for interfacing with GGUF / llama.cpp models locally, and with Ollama and OpenAI models remotely. |
2025-04-29T04:47:37Z |
33 |
OnnxStream |
1943 |
88 |
C++ |
58 |
Lightweight inference library for ONNX files, written in C++. It can run Stable Diffusion XL 1.0 on a RPI Zero 2 (or in 298MB of RAM) but also Mistral 7B on desktops and servers. ARM, x86, WASM, RISC-V supported. Accelerated by XNNPACK. |
2025-05-13T19:34:45Z |
34 |
floneum |
1874 |
100 |
Rust |
40 |
Instant, controllable, local pre-trained AI models in Rust |
2025-05-15T21:53:51Z |
35 |
Ollamac |
1820 |
101 |
Swift |
40 |
Mac app for Ollama |
2025-03-12T22:28:22Z |
36 |
dialoqbase |
1755 |
280 |
TypeScript |
39 |
Create chatbots with ease |
2024-10-15T14:24:20Z |
37 |
maxtext |
1724 |
346 |
Python |
46 |
A simple, performant and scalable Jax LLM! |
2025-05-16T03:45:59Z |
38 |
papersgpt-for-zotero |
1599 |
48 |
JavaScript |
40 |
Zotero chat PDF with AI, DeepSeek, GPT 4.1, ChatGPT, Claude, Gemini, Qwen3 |
2025-04-29T11:15:02Z |
39 |
search2ai |
1269 |
190 |
JavaScript |
18 |
Help your LLMs online |
2025-02-19T16:26:01Z |
40 |
modelfusion |
1263 |
91 |
TypeScript |
33 |
The TypeScript library for building AI applications. |
2024-07-19T15:17:19Z |
41 |
aws-genai-llm-chatbot |
1244 |
377 |
TypeScript |
27 |
A modular and comprehensive solution to deploy a Multi-LLM and Multi-RAG powered chatbot (Amazon Bedrock, Anthropic, HuggingFace, OpenAI, Meta, AI21, Cohere, Mistral) using AWS CDK on AWS |
2025-05-02T08:29:10Z |
42 |
nextjs-ollama-llm-ui |
1227 |
286 |
TypeScript |
13 |
Fully-featured web interface for Ollama LLMs |
2025-02-04T19:07:06Z |
43 |
LLM-Prompt-Library |
1183 |
122 |
Python |
0 |
A playground of highly experimental prompts, tools & scripts for machine intelligence models from Apple, DeepSeek, OpenAI, Anthropic, Meta, Mistral, Google, xAI & others. Created & maintained by Alex Bilzerian. |
2025-04-23T01:01:30Z |
44 |
gp.nvim |
1168 |
98 |
Lua |
42 |
Gp.nvim (GPT prompt) Neovim AI plugin: ChatGPT sessions & Instructable text/code operations & Speech to text [OpenAI, Ollama, Anthropic, ..] |
2025-04-08T21:18:30Z |
45 |
poe-api-wrapper |
1084 |
145 |
Python |
27 |
👾 A Python API wrapper for Poe.com. With this, you will have free access to GPT-4, Claude, Llama, Gemini, Mistral and more! 🚀 |
2025-03-07T20:07:31Z |
46 |
BaseAI |
1046 |
92 |
TypeScript |
4 |
BaseAI — The Web AI Framework. The easiest way to build serverless autonomous AI agents with memory. Start building local-first, agentic pipes, tools, and memory. Deploy serverless with one command. |
2025-02-25T11:30:28Z |
47 |
witsy |
1037 |
79 |
TypeScript |
10 |
Witsy: desktop AI assistant / universal MCP client |
2025-05-16T03:37:22Z |
48 |
chatd |
1036 |
72 |
JavaScript |
26 |
Chat with your documents using local AI |
2024-07-06T01:21:36Z |
49 |
ai-dev-gallery |
1029 |
125 |
C# |
60 |
An open-source project for Windows developers to learn how to add AI with local models and APIs to Windows apps. |
2025-05-14T22:19:20Z |
50 |
RisuAI |
1021 |
178 |
TypeScript |
69 |
Make your own story. User-friendly software for LLM roleplaying |
2025-05-10T11:17:29Z |
51 |
generative-ai-use-cases |
1020 |
243 |
TypeScript |
45 |
Application implementation with business use cases for safely utilizing generative AI in business operations |
2025-05-16T03:31:21Z |
52 |
graphrag-local-ollama |
984 |
156 |
Python |
48 |
Local models support for Microsoft’s graphrag using ollama (llama3, mistral, gemma2 phi3)- LLM & Embedding extraction |
2024-09-30T02:43:30Z |
53 |
paperless-gpt |
910 |
44 |
Go |
43 |
Use LLMs and LLM Vision (OCR) to handle paperless-ngx - Document Digitalization powered by AI |
2025-05-14T08:23:05Z |
54 |
tt-metal |
857 |
168 |
C++ |
2398 |
:metal: TT-NN operator library, and TT-Metalium low level kernel programming model. |
2025-05-16T03:42:07Z |
55 |
airunner |
786 |
59 |
Python |
32 |
Offline inference engine for art, real-time voice conversations, LLM powered chatbots and automated workflows |
2025-05-16T03:44:18Z |
56 |
MixtralKit |
767 |
79 |
Python |
12 |
A toolkit for inference and evaluation of ‘mixtral-8x7b-32kseqlen’ from Mistral AI |
2023-12-15T19:10:55Z |
57 |
web-llm-chat |
744 |
127 |
TypeScript |
10 |
Chat with AI large language models running natively in your browser. Enjoy private, server-free, seamless AI conversations. |
2025-05-05T08:21:15Z |
58 |
Hexabot |
722 |
129 |
TypeScript |
129 |
Hexabot is an open-source AI chatbot / agent builder. It allows you to create and manage multi-channel and multilingual chatbots / agents with ease. |
2025-05-15T14:10:20Z |
59 |
mistral-common |
714 |
79 |
Python |
17 |
None |
2025-05-07T12:03:19Z |
60 |
fine-tune-mistral |
711 |
65 |
Python |
3 |
Fine-tune mistral-7B on 3090s, a100s, h100s |
2023-10-11T17:25:59Z |
61 |
ComfyUI-IF_AI_tools |
638 |
48 |
Python |
50 |
ComfyUI-IF_AI_tools is a set of custom nodes for ComfyUI that allows you to generate prompts using a local Large Language Model (LLM) via Ollama. This tool enables you to enhance your image generation workflow by leveraging the power of language models. |
2025-03-09T09:11:32Z |
62 |
parrot.nvim |
616 |
38 |
Lua |
2 |
parrot.nvim 🦜 - the plugin that brings stochastic parrots to Neovim. |
2025-05-09T18:37:43Z |
63 |
client-python |
592 |
120 |
Python |
17 |
Python client library for Mistral AI platform |
2025-04-16T19:41:52Z |
64 |
llm-finetuning |
592 |
92 |
Python |
1 |
Guide for fine-tuning Llama/Mistral/CodeLlama models and more |
2025-05-07T01:11:58Z |
65 |
Owl |
586 |
56 |
Python |
6 |
A personal wearable AI that runs locally |
2024-03-17T06:37:26Z |
66 |
ai-commits-intellij-plugin |
582 |
43 |
Kotlin |
18 |
AI Commits for IntelliJ based IDEs/Android Studio. |
2025-05-15T01:29:02Z |
67 |
BambooAI |
581 |
58 |
Python |
11 |
A Python library powered by Language Models (LLMs) for conversational data discovery and analysis. |
2025-05-13T07:04:26Z |
68 |
mistral |
572 |
52 |
Python |
18 |
Mistral: A strong, northwesterly wind: Framework for transparent and accessible large-scale language model training, built with Hugging Face 🤗 Transformers. |
2023-11-10T02:55:18Z |
69 |
llmcord |
553 |
117 |
Python |
3 |
Make Discord your LLM frontend ● Supports any OpenAI compatible API (Ollama, LM Studio, vLLM, OpenRouter, xAI, Mistral, Groq and more) |
2025-05-13T17:24:23Z |
70 |
rag-chatbot |
528 |
81 |
Python |
8 |
Chat with multiple PDFs locally |
2024-10-11T04:30:01Z |
71 |
embedJs |
502 |
59 |
TypeScript |
16 |
A NodeJS RAG framework to easily work with LLMs and embeddings |
2025-05-09T20:39:14Z |
72 |
helix |
497 |
49 |
Go |
126 |
♾️ Helix is a private GenAI stack for building AI applications with declarative pipelines, knowledge (RAG), API bindings, and first-class testing. |
2025-05-15T21:31:55Z |
73 |
ollama-voice-mac |
484 |
54 |
Python |
8 |
Mac compatible Ollama Voice |
2024-03-26T14:49:04Z |
74 |
DevoxxGenieIDEAPlugin |
477 |
57 |
Java |
47 |
DevoxxGenie is a plugin for IntelliJ IDEA that uses local LLM’s (Ollama, LMStudio, GPT4All, Jan and Llama.cpp) and Cloud based LLMs to help review, test, explain your project code. |
2025-05-15T12:44:03Z |
75 |
obsidian-bmo-chatbot |
451 |
61 |
TypeScript |
46 |
Generate and brainstorm ideas while creating your notes using Large Language Models (LLMs) from Ollama, LM Studio, Anthropic, Google Gemini, Mistral AI, OpenAI, and more for Obsidian. |
2024-09-12T04:07:29Z |
76 |
aikit |
451 |
39 |
Go |
20 |
🏗️ Fine-tune, build, and deploy open-source LLMs easily! |
2025-05-12T02:38:51Z |
77 |
LESS |
446 |
45 |
Jupyter Notebook |
16 |
[ICML 2024] LESS: Selecting Influential Data for Targeted Instruction Tuning |
2024-10-20T03:11:58Z |
78 |
mlx-llm |
442 |
30 |
Python |
0 |
Large Language Models (LLMs) applications and tools running on Apple Silicon in real-time with Apple MLX. |
2025-01-29T07:13:07Z |
79 |
bolna |
418 |
111 |
Python |
28 |
End-to-end platform for building voice first multimodal agents |
2024-10-28T05:40:38Z |
80 |
xllm |
398 |
21 |
Python |
6 |
🦖 X—LLM: Cutting Edge & Easy LLM Finetuning |
2024-01-17T16:43:39Z |
81 |
WorkflowAI |
389 |
42 |
Python |
1 |
WorkflowAI is an open-source platform where product and engineering teams
collaborate to build and iterate on AI features. |
2025-05-15T20:36:16Z |
82 |
fltr |
376 |
8 |
Rust |
1 |
Like grep but for natural language questions. Based on Mistral 7B or Mixtral 8x7B. |
2024-03-13T11:39:01Z |
83 |
GPTPortal |
375 |
68 |
JavaScript |
2 |
A feature-rich portal to chat with GPT-4, Claude, Gemini, Mistral, & OpenAI Assistant APIs via a lightweight Node.js web app; supports customizable multimodality for voice, images, & files. |
2025-04-17T06:26:52Z |
84 |
NeuralFlow |
362 |
16 |
Python |
4 |
Visualize the intermediate output of Mistral 7B |
2025-01-22T11:25:17Z |
85 |
yalm |
359 |
33 |
C++ |
1 |
Yet Another Language Model: LLM inference in C++/CUDA, no libraries except for I/O |
2025-01-15T07:22:42Z |
86 |
edgen |
357 |
18 |
Rust |
23 |
⚡ Edgen: Local, private GenAI server alternative to OpenAI. No GPU required. Run AI models locally: LLMs (Llama2, Mistral, Mixtral…), Speech-to-text (whisper) and many others. |
2024-05-23T14:21:38Z |
87 |
KVQuant |
352 |
30 |
Python |
14 |
[NeurIPS 2024] KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization |
2024-08-13T11:19:28Z |
88 |
simple-openai |
309 |
39 |
Java |
9 |
A Java library to use the OpenAI Api in the simplest possible way. |
2025-05-08T23:48:46Z |
89 |
LLaMa2lang |
306 |
35 |
Python |
0 |
Convenience scripts to finetune (chat-)LLaMa3 and other models for any language |
2024-06-17T14:00:13Z |
90 |
OllamaKit |
303 |
34 |
Swift |
5 |
Ollama client for Swift |
2025-03-09T22:20:34Z |
91 |
mistral |
292 |
120 |
Python |
0 |
Workflow Service for OpenStack. Mirror of code maintained at opendev.org. |
2025-05-09T19:11:01Z |
92 |
nanodl |
287 |
10 |
Python |
2 |
A Jax-based library for building transformers, includes implementations of GPT, Gemma, LlaMa, Mixtral, Whisper, SWin, ViT and more. |
2024-08-28T21:24:22Z |
93 |
aicommit2 |
286 |
25 |
TypeScript |
4 |
A Reactive CLI that generates git commit messages with Ollama, ChatGPT, Gemini, Claude, Mistral and other AI |
2025-05-06T04:54:07Z |
94 |
Heat |
275 |
17 |
Swift |
4 |
An LLM agnostic desktop and mobile client. |
2025-05-08T20:34:10Z |
95 |
ai-playground |
270 |
69 |
Python |
0 |
Code from tutorials presented on the “Code AI with Rok” YouTube channel |
2025-05-08T09:21:17Z |
96 |
END-TO-END-GENERATIVE-AI-PROJECTS |
269 |
83 |
None |
0 |
End to End Generative AI Industry Projects on LLM Models with Deployment_Awesome LLM Projects |
2025-01-24T07:20:37Z |
97 |
llm-mistral-invoice-cpu |
266 |
62 |
Python |
0 |
Data extraction with LLM on CPU |
2024-03-26T05:44:59Z |
98 |
unsaged |
256 |
79 |
TypeScript |
15 |
Open source chat kit engineered for seamless interaction with AI models. |
2025-02-25T18:02:25Z |
99 |
Kolosal |
248 |
19 |
C++ |
11 |
Kolosal AI is an OpenSource and Lightweight alternative to LM Studio to run LLMs 100% offline on your device. |
2025-05-02T20:40:29Z |
100 |
ProX |
246 |
17 |
Python |
1 |
[ICML 2025] Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale |
2025-05-14T13:01:52Z |