Dev

GitHub repos gaining traction - what high-signal users are starring and what's climbing the board, captured daily and enriched from GitHub. Raw material for spotting new tech and patterns worth building on.

653

repos tracked

153

surfaced this week

141

created < 30d

Python

top language

Python 271 TypeScript 84 HTML 36 JavaScript 33 Swift 32 Rust 29 Go 22 Jupyter Notebook 21 C++ 19 Shell 8 C 6 Elixir 6

Section: Language: Created: Sort: i

45 repos

★ 45 CherYou/AutoResearchBench Python

Official Repo: AutoResearchBench: Benchmarking AI Agents on Complex Scientific Literature Discovery
★ 25 xlang-ai/OSWorld-V2 Python

OSWorld 2.0: Benchmarking Computer Use Agents on Long-Horizon Real-World Tasks

agent artificial-intelligence benchmark computer-use-agent cua gui language-model homepage ↗
★ 201.6k NousResearch/hermes-agent Python

The agent that grows with you

ai ai-agent ai-agents anthropic chatgpt claude claude-code homepage ↗
★ 840 OpenHands/software-agent-sdk Python

A clean, modular SDK for building AI agents with OpenHands V1.

agent sdk
★ 78 MetaAgentX/OpenCaptchaWorld JavaScript

[NeurIPS 2025] The first web-based benchmark and platform to evaluate visual reasoning and interaction capabilities of MLLM powered agents through diverse and dynamic CAPTCHA puzzles.
★ 29 zlab-princeton/ceobench-src Python

CEO-Bench: Can Agents Play the Long Game?

homepage ↗
★ 4.8k openclaw/Peekaboo Swift

Peekaboo is a macOS CLI & optional MCP server that enables AI agents to capture screenshots of applications, or the entire system, with optional visual question answering through local or remote AI models.

ai macos mcp screenshots swift homepage ↗
★ 0 bcharleson/Peekaboo Swift

Peekaboo is a macOS CLI & optional MCP server that enables AI agents to capture screenshots of applications, or the entire system, with optional visual question answering through local or remote AI models.

homepage ↗
★ 140 swyxio/skills TypeScript

Agent skills for Claude Code and other AI agents
★ 73 shreyashankar/error-discovery-skill new · 3d old

Interactive error analysis skill for AI agents. Studies LLM trace datasets, builds a review UI, monitors annotations, categorizes failure modes, proposes new samples.
★ 4.9k omnigent-ai/omnigent Python new · 15d old

Omnigent is an open-source AI agent framework and meta-harness: orchestrate Claude Code, Codex, Cursor, Pi, and custom agents — swap harnesses without rewriting, enforce policies and sandboxing, and collaborate in real time from any device.

agent-framework agent-governance agent-orchestration agents ai ai-agent ai-agents homepage ↗
★ 2.7k harbor-framework/harbor Python

Framework for evaluating and improving agents

evals rl-environments terminal-bench homepage ↗
★ 1.3k steipete/birdclaw TypeScript

Stores all your tweets nicely claw-able for agents.

homepage ↗
★ 3k steipete/oracle TypeScript

Ask the oracle when you're stuck. Invoke GPT-5 Pro with a custom context and files.

agents ai anthropic gemini-pro gpt-5-pro openai opus homepage ↗
★ 5.2k steipete/agent-scripts Shell

Scripts for agents, shared between my repositories.

ai-agents homepage ↗
★ 177 shreyashankar/plain-writing-skill HTML new · 11d old

A plain-language writing skill for AI agents, with a revision view that shows what changed.
★ 149 marimo-team/skills Python

skills for coding agents related to marimo
★ 4.6k amantus-ai/vibetunnel TypeScript

Turn any browser into your terminal & command your agents on the go.

remote terminal vibecoding homepage ↗
★ 844 openclaw/agent-skills Python

Useful skills for agents and claws.
★ 29 steipete/mcp-agentify TypeScript

MCP orchestrator that converts MPC servers to agents.
★ 59.9k DietrichGebert/ponytail JavaScript new · 15d old

Makes your AI agent think like the laziest senior dev in the room. The best code is the code you never wrote.

agent-skills ai-agents claude claude-code claude-code-plugin cursor-rules developer-tools homepage ↗
★ 744 rdi-berkeley/agents-last-exam Python

Agents' Last Exam

homepage ↗
★ 930 datacurve-ai/deep-swe Python

Measuring frontier coding agents on original, long-horizon engineering tasks

homepage ↗
★ 462 maddada/Ghostex Rust

Native Agent CLIs manager for macOS. Ghostty Terminals + Codex App Features/UX = Ghostex! Embedded browser & IDE. Strong agents support.

homepage ↗
★ 1.4k dust-tt/dust TypeScript

Custom AI agent platform to speed up your work.

agents large-language-models llm rust homepage ↗
★ 3.8k ucbepic/docetl Python

A system for agentic LLM-powered data processing and ETL

agents data data-pipelines document-analysis document-processing elt etl homepage ↗
★ 4.6k entireio/cli Go

📜 Entire CLI hooks into your Git workflow to capture AI agent sessions as you work. Sessions are indexed alongside commits, creating a searchable record of how code was written in your repo.

agents ai claude developer developer-platform gemini homepage ↗
★ 370 banteg/agents Python

my workflows for ai agents like codex and claude
★ 0 Nachx639/vibetunnel

Turn any browser into your terminal & command your agents on the go.

homepage ↗
★ 0 coygeek/agent-skills Python new · 17d old

Useful skills for agents and claws.
★ 14 SWE-bench/swe-bench.github.io JavaScript

Landing page + leaderboard for SWE-Bench benchmark

ai ai-agents benchmark homepage ↗
★ 64 steipete/bslog TypeScript

cli for Better Stack to fetch logs, ClickHouse SQL style. Made for humans and agents.
★ 549 mvanhorn/agentcookie Go

Your agent runs on a Mac that isn't your daily driver. agentcookie keeps its sessions in sync with the Mac you actually browse on, continuously, encrypted over Tailscale, so OpenClaw, Hermes, or any other agent runtime wakes up authenticated. macOS, peer-to-peer, no cloud middleman.

ai-agents automation chrome cli cookies golang macos
★ 0 peetzweg/birdclaw

Stores all your tweets nicely claw-able for agents.

homepage ↗
★ 28.6k chroma-core/chroma Rust

Search infrastructure for AI

agents ai ai-agents database rust rust-lang homepage ↗
★ 3.8k vercel-labs/just-bash TypeScript

Bash for Agents

homepage ↗
★ 102 zapier/AutomationBench Python

A benchmark for evaluating AI agents on realistic business workflows

benchmarks evals llm primeintellect homepage ↗
★ 19.1k trycua/cua HTML

Open-source infrastructure for Computer-Use Agents. Sandboxes, SDKs, and benchmarks to train and evaluate AI agents that can control full desktops (macOS, Linux, Windows).

agent ai-agent apple computer-use computer-use-agent containerization cua homepage ↗
★ 769 pat-jj/harness-1 Python

🚀 Ultra Recipe for Training Long-Horizon Search Agents - matching frontier AI's search capability with a 20B model + stateful harness
★ 15 CharlyCst/spadebox Rust

Sandboxed tools and JS runtime for AI agents

homepage ↗
★ 7k ogulcancelik/herdr Rust

agent multiplexer that lives in your terminal.

agent agent-orchestration ai ai-agents claude-code cli codex homepage ↗
★ 419 huawei-csl/KVarN Python new · 28d old

KVarN is a native vLLM KV-cache quantization backend for your agents: 3-5x more context, throughput above FP16, and FP16-level accuracy. Calibration-free, one flag.

agentic-ai kv-cache llm llm-inference long-context quantization vllm homepage ↗
★ 37 probabl-ai/skills Python

Data Science Skills for AI agents like Claude Code

skills homepage ↗
★ 434 bryanyzhu/agentic-ai-system-course JavaScript

Use agent to learn agent - A skeleton course on how to design, build, and operate production AI agents

agentic-ai agentic-workflow ai-agents course system-design tutorial
★ 23k manaflow-ai/cmux Swift

Open source Ghostty-based macOS terminal with vertical tabs and notifications for AI coding agents. Built for multitasking, organization, and programmability.

amp claude-code codex gemini ghostty opencode terminal homepage ↗