Dev

GitHub repos gaining traction - what high-signal users are starring and what's climbing the board, captured daily and enriched from GitHub. Raw material for spotting new tech and patterns worth building on.

653

repos tracked

153

surfaced this week

141

created < 30d

Python

top language

Python 271 TypeScript 84 HTML 36 JavaScript 33 Swift 32 Rust 29 Go 22 Jupyter Notebook 21 C++ 19 Shell 8 C 6 Elixir 6

Section: Language: Created: Sort: i

25 repos

★ 6.8k THUDM/slime Python

slime is an LLM post-training framework for RL Scaling.

homepage ↗
★ 1.6k PrimeIntellect-ai/prime-rl Python

Agentic RL Training at Scale
★ 21 PhoneBuddyAI/phonebuddy Python new · 15d old

Training open models for agentic phone use with real-app and mock-app environments.

homepage ↗
★ 2.4k huggingface/OpenEnv Python

An interface library for RL post training with environments.

homepage ↗
★ 0 yobibyte/Megatron-LM Python

Ongoing research training transformer models at scale

homepage ↗
★ 193 amazon-far/abc Python new · 14d old

ABC: Scalable Behavior Cloning with Open Data, Training, and Evaluation

bc diffusion-policy robotics vla homepage ↗
★ 299 vllm-project/vime Python

An LLM post-training framework with vLLM for RL Scaling
★ 7 iamtrask/abcGPT Python

Experimental fork of Karpathy's nanoGPT modifying the architecture and training loop to make per-source influence over predictions identifiable and controllable.
★ 2 Snowflake-AI-Research/Arctic-Platform Python new · 18d old

Arctic Training and Inference Platform

llms post-training reinforcement-learning rl training
★ 3.8k allenai/open-instruct Python

AllenAI's post-training codebase

homepage ↗
★ 67.3k unslothai/unsloth Python

Unsloth Studio is a web UI for training and running open models like Gemma 4, Qwen3.6, DeepSeek, gpt-oss locally.

agent deepseek fine-tuning gemma gemma3 gpt-oss llama homepage ↗
★ 1 uiuctml/convex_data_valuation Python

[ICML '26] Code repo for the paper entitled "Convex Dataset Valuation for Post-Training" at ICML 2026.

data-selection llm homepage ↗
★ 5.5k pytorch/torchtitan Python

A PyTorch native platform for training generative AI models
★ 0 meefs/Lens new · 29d old

Lens is a 3.8B-parameter text-to-image diffusion model that achieves quality competitive with and in several cases surpassing models like FLUX and SD3, while requiring significantly less training compute. Key ideas include maximizing data information density per batch and accelerating convergence.
★ 161.9k huggingface/transformers Python

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

audio deep-learning deepseek gemma glm hacktoberfest llm homepage ↗
★ 22 simeon-ned/predictive-style-matching Python

Predictive Style Matching (PSM), is method in which an offline predictor maps the robot’s lower-body state history and velocity commands to interpretable upper-body joint and gait targets that shape the rewards during training.

humanoid-robotics mjlab mujoco mujoco-warp reinforcement-learning robotics unitree-g1 homepage ↗
★ 0 Sneakr/unsloth Python

Unsloth Studio is a web UI for training and running open models like Gemma 4, Qwen3.6, DeepSeek, gpt-oss locally.

homepage ↗
★ 2 mvanhorn/unsloth

Unsloth Studio is a web UI for training and running open models like Gemma 4, Qwen3.6, DeepSeek, gpt-oss locally.

homepage ↗
★ 3.5k thinking-machines-lab/tinker-cookbook Python

Post-training with Tinker
★ 623 visinf/INSID3 Python

[CVPR 2026 Oral] "INSID3: Training-Free In-Context Segmentation with DINOv3"

homepage ↗
★ 769 pat-jj/harness-1 Python

🚀 Ultra Recipe for Training Long-Horizon Search Agents - matching frontier AI's search capability with a 20B model + stateful harness
★ 1 msrasys/nnscaler Python

nnScaler: Compiling DNN models for Parallel Training

homepage ↗
★ 181 FeynRL-project/FeynRL Python

Post-training framework for large models, from new objectives to new rollout systems.

homepage ↗
★ 1.6k radixark/miles Python

Miles is an enterprise-facing reinforcement learning framework for LLM and VLM post-training, forked from and co-evolving with slime.

homepage ↗
★ 210 mlc-ai/pith-train Python

Compact and Agent-Native MoE Training System