None defined yet.
Toward Autonomous Long-Horizon Engineering for ML Research
BeyondSWE: Can Current Code Agent Survive Beyond Single-Repo Bug Fixing?