Dev

GitHub repos gaining traction - what high-signal users are starring and what's climbing the board, captured daily and enriched from GitHub. Raw material for spotting new tech and patterns worth building on.

653

repos tracked

153

surfaced this week

141

created < 30d

Python

top language

Python 271 TypeScript 84 HTML 36 JavaScript 33 Swift 32 Rust 29 Go 22 Jupyter Notebook 21 C++ 19 Shell 8 C 6 Elixir 6

Section: Language: Created: Sort: i

20 repos

★ 84.1k vllm-project/vllm Python

A high-throughput and memory-efficient inference and serving engine for LLMs

amd blackwell cuda deepseek deepseek-v3 gpt gpt-oss homepage ↗
★ 118.1k ggml-org/llama.cpp C++

LLM inference in C/C++

ggml homepage ↗
★ 5 pcuenca/LlamaLanguageModels Swift new · 11d old

Foundation Models API for llama.cpp
★ 3.1k raullenchai/Rapid-MLX Python

The fastest local AI engine for Apple Silicon. 4.2x faster than Ollama, 0.08s cached TTFT, 100% tool calling. 17 tool parsers, prompt cache, reasoning separation, cloud routing. Drop-in OpenAI replacement. Works with Claude Code, Cursor, Aider.

apple-silicon claude-code cursor deepseek fastapi hacktoberfest inference homepage ↗
★ 12.6k StarTrail-org/LEANN Python

[MLsys2026]: RAG on Everything with LEANN. Enjoy 97% storage savings while running a fast, accurate, and 100% private RAG application on your personal device.

ai faiss gpt-oss langchain llama-index llm localstorage homepage ↗
★ 3 ggml-org/llama.cpp-dev Shell new · 7d old
★ 2.1k ggml-org/llama.vim Vim Script

Vim plugin for LLM-assisted code/text completion

copilot developer-tool llama llm vim vim-plugin
★ 11.7k tensorzero/tensorzero Rust

TensorZero is an open-source LLMOps platform that unifies an LLM gateway, observability, evaluation, optimization, and experimentation.

ai ai-engineering anthropic artificial-intelligence deep-learning genai generative-ai homepage ↗
★ 67.3k unslothai/unsloth Python

Unsloth Studio is a web UI for training and running open models like Gemma 4, Qwen3.6, DeepSeek, gpt-oss locally.

agent deepseek fine-tuning gemma gemma3 gpt-oss llama homepage ↗
★ 5 ruixiang63/llama.cpp C++

LLM inference in C/C++
★ 29.6k sgl-project/sglang Python

SGLang is a high-performance serving framework for large language models and multimodal models.

attention blackwell cuda deepseek diffusion glm gpt-oss homepage ↗
★ 0 danielhanchen/llamacpp-cuda133-staging Shell new · 19d old

Throwaway staging: validate CUDA 13.3 prebuilt build leg
★ 123 unslothai/llama.cpp C++

LLM inference in C/C++
★ 3 oobabooga/llama.cpp C++

Port of Facebook's LLaMA model in C/C++
★ 0 fiesh/llama.cpp C++

LLM inference in C/C++
★ 0 devYRPauli/llama.cpp new · 29d old

LLM inference in C/C++
★ 1.3k FunAudioLLM/Fun-ASR C

End-to-end speech recognition large model: 31 languages, dialects, accents, lyrics, hotwords, timestamps, speaker diarization. Trained on tens of millions of hours.

31-languages asr audio audio-language-model chinese-dialects edge fun-asr homepage ↗
★ 2 danielhanchen/unsloth-staging-2 Python

Finetune Llama 3.1, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory

homepage ↗
★ 56 am17an/llama.cpp C++

LLM inference in C/C++
★ 11.1k run-llama/liteparse Rust

A fast, helpful, and open-source document parser

document-ocr document-processing ocr ocr-recognition pdf pdf-parser text-extraction homepage ↗