Fix agent crash blindspot by forcing it to read traceback by dumko2001 · Pull Request #17 · karpathy/autoresearch

dumko2001 · 2026-03-07T09:16:48Z

Currently, program.md instructs the agent to evaluate runs via grep "^val_bpb:\|^\peak_vram_mb:" run.log. When a syntax error or OOM crash occurs, this grep outputs nothing. Because the agent never explicitly reads the raw log, it never actually sees the Python stack trace. It just hallucinates what the bug might have been, crippling its ability to 'fix it and re-run.'

This PR adds a one-sentence instruction for the agent to run tail -n 50 run.log if the grep is empty, ensuring it safely acquires the stack trace to successfully self-heal trivial crashes.

Allow agent to diagnose crashes by reading the python stack trace

bdf0c0d

karpathy merged commit bd75534 into karpathy:master Mar 7, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix agent crash blindspot by forcing it to read traceback#17

Fix agent crash blindspot by forcing it to read traceback#17
karpathy merged 1 commit intokarpathy:masterfrom
dumko2001:fix-agent-crash-blindspot

dumko2001 commented Mar 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

dumko2001 commented Mar 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants