Dev
GitHub repos gaining traction - what high-signal users are starring and what's climbing the board, captured daily and enriched from GitHub. Raw material for spotting new tech and patterns worth building on.
653
repos tracked
153
surfaced this week
141
created < 30d
Python
top language
50 repos
-
Code for "Representation Learning Enables Scalable Multitask Deep Reinforcement Learning"
-
Official code repository for the paper "Hallucination in World Models is Predictable and Preventable".
-
OSWorld 2.0: Benchmarking Computer Use Agents on Long-Horizon Real-World Tasks
-
slime is an LLM post-training framework for RL Scaling.
-
Agentic RL Training at Scale
-
GTSAM is a library of C++ classes that implement smoothing and mapping (SAM) in robotics and vision, using factor graphs and Bayes networks as the underlying computing paradigm rather than sparse matrices.
-
Textbook on reinforcement learning from human feedback
-
[NeurIPS 2025] The first web-based benchmark and platform to evaluate visual reasoning and interaction capabilities of MLLM powered agents through diverse and dynamic CAPTCHA puzzles.
-
Isaac Lab API, powered by MuJoCo-Warp, for RL and robotics research
-
Puffing up reinforcement learning
-
An interface library for RL post training with environments.
-
Public Coworld CLI, Python helpers, manifest schemas, runner tooling, and reference worlds.
-
World's first open-source, agentic video production system. 12 pipelines, 52 tools, 500+ agent skills. Turn your AI coding assistant into a full video production studio.
-
Point at any URL/YouTube/Podcast or file. Get the gist. CLI and Chrome Extension.
-
Peekaboo is a macOS CLI & optional MCP server that enables AI agents to capture screenshots of applications, or the entire system, with optional visual question answering through local or remote AI models.
-
Package manager for the Erlang ecosystem.
-
Point at any URL/YouTube/Podcast or file. Get the gist. CLI and Chrome Extension.
-
Our library for RL environments + evals
-
Framework for evaluating and improving agents
-
Community extensions for TabPFN - the foundation model for tabular data. Built with TabPFN! 🤗
-
NVIDIA Cosmos is an open platform of world models, datasets, and tools that enables developers to build Physical AI for robots, autonomous vehicles, smart infrastructure, and more.
-
Isaac Lab API, powered by MuJoCo-Warp, for RL and robotics research.
-
Show usage stats for OpenAI Codex and Claude Code, without having to login.
-
off-policy RL on long sequences
-
[CVPR 2026 Highlight] A Frame is Worth One Token: Efficient Generative World Modeling with Delta Tokens
-
#1 PDF Application on GitHub that lets you edit PDFs on any device anywhere
-
An LLM post-training framework with vLLM for RL Scaling
-
A generic framework for on-demand, incrementalized computation. Inspired by adapton, glimmer, and rustc's query system.
-
Office inference code for World Tracing (object/scene/dynamic). Live demos: https://haoz19.github.io/world-tracing-page/
-
Arctic Training and Inference Platform
-
[Highlight] Official implementation for Actionable World Representation
-
Grid-Free Monte Carlo Solvers for Physics Simulations Involving Partial Differential Equations
-
NVIDIA OmniDreams is a world model that generates photorealistic video for autonomous-driving simulation in real time.
-
high-performance inference and serving library for interactive autoregressive video and world models
-
Markdown to ANSII in TypeScript based on Micro-Mark, with support for URLs, tables, lists and more.
-
Triton kernels for dynamic causal short convolutions.
-
An interactive RLM (Recursive Language Model) agent built on DSPy.
-
Implement a reasoning LLM in PyTorch from scratch, step by step
-
Sandboxed tools and JS runtime for AI agents
-
Wuji Hand in-hand reorientation RL with sim-to-real deployment, built on mjlab
-
🚀 An open-source, hands-on curriculum bridging the gap from basic RL concepts to LLM alignment, RLVR, and advanced Agentic systems.
-
XoRL
-
Post-training framework for large models, from new objectives to new rollout systems.
-
A curated collection of papers and resources on On-Policy Distillation for Large Language Models.
-
⚡ TabPFN: Foundation Model for Tabular Data ⚡
-
An opinionated, agentic life-OS for Claude Code — a markdown vault with daily/weekly/quarterly review skills baked in. Includes the feedback layer most templates skip.
-
From a single casual image to a visually consistent and physically stable interactive 3D scene.