Dev
GitHub repos gaining traction - what high-signal users are starring and what's climbing the board, captured daily and enriched from GitHub. Raw material for spotting new tech and patterns worth building on.
653
repos tracked
153
surfaced this week
141
created < 30d
Python
top language
8 repos
-
Evaluation harness for OpenHands V1.
-
the LLM vulnerability scanner
-
Evaluation tools shared across anserini, pyserini, and pygaggle
-
ABC: Scalable Behavior Cloning with Open Data, Training, and Evaluation
-
TensorZero is an open-source LLMOps platform that unifies an LLM gateway, observability, evaluation, optimization, and experimentation.
-
VirtueBench V2: Multi-dimensional virtue evaluation benchmark for LLMs with tripartite and Ignatian temptation models
-
Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends
-
Rubric compiler and judge engine for LLM evaluation