LLMadness
March Madness Model Evals
7680 tools found
March Madness Model Evals
benchmark – Evaluating AI agents on Jujutsu version control
semantic code search using AST and call graphs
AI tool that brutally roasts your AI agent ideas
Create AI variations and remove bg in batch
A graph of story fragments shaped by reader votes
AI companions that build and act on your Mac (v0.5.0)
LLM Benchmark for Multi-Step Verifiable Reasoning
An AI assistant to automate tedious phone calls
static analysis for AI-generated code
Create AI variations and remove bg in batch
static analysis for AI-generated code
Turn Slack war-rooms and raw logs into incident reports
AI note taking app
AI-curated remote developer positions with scoring
My Linux VPS sandbox for self-hosting and AI art/video
The pytest for LLMs with 22 built-in assertions
AI-powered Linux kernel change explanations
AI agent that lives in Slack, executes tasks across your tools
autonomous dwarf civilization SIM with AI model routing
A local Mac dictation app built with Rust, Tauri and CoreML
AI agents forming a company (Claws running a startup)
autonomous dwarf civilization SIM with AI model routing
AI Operating System That Runs LLMs Without GPUs