A 630-line Python script that lets AI agents autonomously run, evaluate, and iterate on ML experiments overnight. No human needed.
AutoResearch flips the research paradigm: instead of a human manually tweaking parameters and running experiments one-by-one, an AI agent reads its own source code, forms hypotheses, rewrites the training logic, runs experiments, and evaluates outcomes.
You write instructions in plain English (program.md), point the agent at a training script, and go to sleep. By morning, you wake up to a full log of automated experiments and an optimized model. Every experiment is git-committed and logged.
Released March 6, 2026 under MIT license. One of the fastest-growing repositories in GitHub history, reaching 30k stars in its first week.
uv package manager. Works with Claude Code or any AI coding agent.uv run train.py with a fixed 5-min wall-clock budget. Every experiment gets identical time, making results directly comparable.| Commit | val_bpb | VRAM (GB) | Status | Description |
|---|---|---|---|---|
| a3f7c21 | 0.9979 | 38.2 | keep | Baseline run |
| b8e2d44 | 0.9891 | 39.1 | keep | Increase depth 8→10 |
| c1a9f67 | 0.0000 | 0.0 | crash | OOM on batch_size 2x |
| d5b3e89 | 0.9812 | 37.8 | keep | RoPE embeddings |
| Aspect | Manual Research | AutoResearch |
|---|---|---|
| Experiments / Night | 1-3 (if researcher stays late) | ~100 autonomous |
| Human Involvement | Constant: design, run, analyze each | Write program.md, then sleep |
| Consistency | Variable (fatigue, cognitive bias) | Identical 5-min budget each |
| Hypothesis Gen | Limited by time & human creativity | LLM generates continuously |
| Documentation | Often incomplete or missing | Every experiment git-committed |
| Cost | Researcher salary + GPU time | Just GPU (agent is free) |
curl -LsSf https://astral.sh/uv/install.sh | shgit clone https://github.com/karpathy/autoresearch
cd autoresearch && uv syncuv run prepare.py # ~2 min, one-timeuv run train.py # establishes baseline# In your AI agent, say:
"Have a look at program.md and let's
kick off a new experiment!"
# The agent handles everything from here.
by Rizvi Haider