Dev

GitHub repos gaining traction - what high-signal users are starring and what's climbing the board, captured daily and enriched from GitHub. Raw material for spotting new tech and patterns worth building on.

653

repos tracked

153

surfaced this week

141

created < 30d

Python

top language

Python 271 TypeScript 84 HTML 36 JavaScript 33 Swift 32 Rust 29 Go 22 Jupyter Notebook 21 C++ 19 Shell 8 C 6 Elixir 6

Section: Language: Created: Sort: i

50 repos

★ 5 johanobandoc/ScaleMRL Python new · 13h old

Code for "Representation Learning Enables Scalable Multitask Deep Reinforcement Learning"

mrq multitask-learning reinforcement-learning representation-learning tdmpc2 world-models
★ 26 nicklashansen/mmbench2 Python new · 1d old

Official code repository for the paper "Hallucination in World Models is Predictable and Preventable".

dreamer4 hallucination-detection robotics world-models homepage ↗
★ 25 xlang-ai/OSWorld-V2 Python

OSWorld 2.0: Benchmarking Computer Use Agents on Long-Horizon Real-World Tasks

agent artificial-intelligence benchmark computer-use-agent cua gui language-model homepage ↗
★ 6.8k THUDM/slime Python

slime is an LLM post-training framework for RL Scaling.

homepage ↗
★ 1.6k PrimeIntellect-ai/prime-rl Python

Agentic RL Training at Scale
★ 3.6k borglab/gtsam Jupyter Notebook

GTSAM is a library of C++ classes that implement smoothing and mapping (SAM) in robotics and vision, using factor graphs and Bayes networks as the underlying computing paradigm rather than sparse matrices.

estimation perception robotics sensorfusion homepage ↗
★ 2k natolambert/rlhf-book Python

Textbook on reinforcement learning from human feedback

ai alignment rlhf homepage ↗
★ 78 MetaAgentX/OpenCaptchaWorld JavaScript

[NeurIPS 2025] The first web-based benchmark and platform to evaluate visual reasoning and interaction capabilities of MLLM powered agents through diverse and dynamic CAPTCHA puzzles.
★ 2.6k mujocolab/mjlab Python

Isaac Lab API, powered by MuJoCo-Warp, for RL and robotics research

isaaclab mujoco mujoco-warp reinforcement-learning robotics-simulation homepage ↗
★ 6.1k PufferAI/PufferLib C

Puffing up reinforcement learning

reinforcement-learning homepage ↗
★ 2.4k huggingface/OpenEnv Python

An interface library for RL post training with environments.

homepage ↗
★ 8 Metta-AI/coworld Python

Public Coworld CLI, Python helpers, manifest schemas, runner tooling, and reference worlds.
★ 23.7k calesthio/OpenMontage Python

World's first open-source, agentic video production system. 12 pipelines, 52 tools, 500+ agent skills. Turn your AI coding assistant into a full video production studio.

agent agentic-ai ai claude copilot cursor elevenlabs homepage ↗
★ 6.3k steipete/summarize TypeScript

Point at any URL/YouTube/Podcast or file. Get the gist. CLI and Chrome Extension.

ai cli summarize typescript homepage ↗
★ 0 bcharleson/Peekaboo Swift

Peekaboo is a macOS CLI & optional MCP server that enables AI agents to capture screenshots of applications, or the entire system, with optional visual question answering through local or remote AI models.

homepage ↗
★ 1.1k hexpm/hex Elixir

Package manager for the Erlang ecosystem.

elixir erlang hacktoberfest package-manager homepage ↗
★ 0 wangwllu/summarize new · 23d old

Point at any URL/YouTube/Podcast or file. Get the gist. CLI and Chrome Extension.

homepage ↗
★ 4.2k PrimeIntellect-ai/verifiers Python

Our library for RL environments + evals
★ 2.7k harbor-framework/harbor Python

Framework for evaluating and improving agents

evals rl-environments terminal-bench homepage ↗
★ 309 PriorLabs/tabpfn-extensions Python

Community extensions for TabPFN - the foundation model for tabular data. Built with TabPFN! 🤗

data-science machine-learning tabpfn tabular-data homepage ↗
★ 10.6k NVIDIA/cosmos Jupyter Notebook

NVIDIA Cosmos is an open platform of world models, datasets, and tools that enables developers to build Physical AI for robots, autonomous vehicles, smart infrastructure, and more.

homepage ↗
★ 1 kevinzakka/mjlab Python

Isaac Lab API, powered by MuJoCo-Warp, for RL and robotics research.
★ 0 bcharleson/codexbar Swift

Show usage stats for OpenAI Codex and Claude Code, without having to login.

homepage ↗
★ 167 UT-Austin-RPL/amago Python

off-policy RL on long sequences

homepage ↗
★ 178 amazon-far/deltatok Python

[CVPR 2026 Highlight] A Frame is Worth One Token: Efficient Generative World Modeling with Delta Tokens

computer-vision cvpr2026 deep-learning depth-estimation generative-model pytorch semantic-segmentation homepage ↗
★ 84.6k Stirling-Tools/Stirling-PDF Java

#1 PDF Application on GitHub that lets you edit PDFs on any device anywhere

docker hacktoberfest java pdf pdf-converter pdf-editor pdf-manipulation homepage ↗
★ 299 vllm-project/vime Python

An LLM post-training framework with vLLM for RL Scaling
★ 1 charliermarsh/salsa new · 13d old

A generic framework for on-demand, incrementalized computation. Inspired by adapton, glimmer, and rustc's query system.

homepage ↗
★ 277 haoz19/world-tracing Python

Office inference code for World Tracing (object/scene/dynamic). Live demos: https://haoz19.github.io/world-tracing-page/
★ 2 Snowflake-AI-Research/Arctic-Platform Python new · 18d old

Arctic Training and Inference Platform

llms post-training reinforcement-learning rl training
★ 71 MaureenZOU/worldstring Python new · 23d old

[Highlight] Official implementation for Actionable World Representation
★ 115 nv-tlabs/wosx C++ new · 24d old

Grid-Free Monte Carlo Solvers for Physics Simulations Involving Partial Differential Equations
★ 234 nv-tlabs/omni-dreams Python

NVIDIA OmniDreams is a world model that generates photorealistic video for autonomous-driving simulation in real time.

homepage ↗
★ 345 NVIDIA/flashdreams Python

high-performance inference and serving library for interactive autoregressive video and world models

efficiency interactive video-models world-models homepage ↗
★ 49 steipete/Markdansi TypeScript

Markdown to ANSII in TypeScript based on Micro-Mark, with support for URLs, tables, lists and more.

ansii markdown typescript homepage ↗
★ 10 ljang0/iOSWorld Swift new · 18d old
★ 22 OliverSieberling/dynamic-conv1d Python new · 29d old

Triton kernels for dynamic causal short convolutions.

homepage ↗
★ 12 diego-lima/rlmy Python new · 24d old

An interactive RLM (Recursive Language Model) agent built on DSPy.
★ 8 parlance-labs/website CSS

homepage ↗
★ 4.6k rasbt/reasoning-from-scratch Jupyter Notebook

Implement a reasoning LLM in PyTorch from scratch, step by step

ai artificial-intelligence chain-of-thought deep-learning distillation grpo inference-time-scaling homepage ↗
★ 15 CharlyCst/spadebox Rust

Sandboxed tools and JS runtime for AI agents

homepage ↗
★ 125 ethanhe42/nanoRL Python
★ 173 wuji-technology/wuji-mjlab Python

Wuji Hand in-hand reorientation RL with sim-to-real deployment, built on mjlab
★ 3.1k walkinglabs/hands-on-modern-rl Python

🚀 An open-source, hands-on curriculum bridging the gap from basic RL concepts to LLM alignment, RLVR, and advanced Agentic systems.

agent agentic agentic-ai agentic-rl dpo grpo llm homepage ↗
★ 11 togethercomputer/xorl Python

XoRL

homepage ↗
★ 181 FeynRL-project/FeynRL Python

Post-training framework for large models, from new objectives to new rollout systems.

homepage ↗
★ 371 nick7nlp/Awesome-LLM-On-Policy-Distillation Python

A curated collection of papers and resources on On-Policy Distillation for Large Language Models.

awesome-list awesome-opd awesomeopd github-pages knowledge-distillation large-language-models llm
★ 7.4k PriorLabs/TabPFN Python

⚡ TabPFN: Foundation Model for Tabular Data ⚡

data-science foundation-models machine-learning tabpfn tabular-data homepage ↗
★ 23 seandavi/lifeos-template Python

An opinionated, agentic life-OS for Claude Code — a markdown vault with daily/weekly/quarterly review skills baked in. Includes the feedback layer most templates skip.

agentic claude-agent-skills claude-code decision-making journaling life-os obsidian-vault
★ 169 ShirleyMaxx/REST3D Python new · 29d old

From a single casual image to a visually consistent and physically stable interactive 3D scene.