COMMITS
January 22, 2026
B
Update citing us section in README (#1402)
Boxuan Li committed
A
Rename tasks folder to original-tasks (#1382)
Alex Shaw committed
December 4, 2025
A
Update the readme to point to harbor.
Alex Shaw committed
November 19, 2025
H
complete the additional notes prompt (#1360)
Harsh Raj committed
November 7, 2025
A
Fix a yaml.
Alex Shaw committed
November 4, 2025
T
Update goose_agent.py to enable built in todo extension (#1344)
tlongwell-block committed
November 3, 2025
T
Update goose_agent.py to explicitly include needed developer extension (#1341)
tlongwell-block committed
November 2, 2025
I
3d-file-format-task (#1343)
Ivgeni "Iv" Segal committed
October 28, 2025
B
New task: Debug memory crash (#1335)
Boxuan Li committed
B
Port enhancements and fixes from terminal-bench-1.5 (#1322)
Boxuan Li committed
October 24, 2025
N
Chess in a regex (#1330)
Nicholas Carlini committed
N
fix zork (#1329)
Nicholas Carlini committed
N
Gcode (#1328)
Nicholas Carlini committed
October 23, 2025
I
Increase max agent timeout from 360 to 750 seconds (#1321)
Ivan Bercovich committed
A
Remove a silly constraint.
Alex Shaw committed
October 21, 2025
S
Revert "Revert "Add Cybench adapter (#775)" (#1313)" (#1315)
Slimshilin committed
I
Fix/remove custom docker compose intall win 3.11 (#1309)
Ivan Bercovich committed
B
Fix task install-klee-minimal (#1304)
Boxuan Li committed
B
Fix test sanity for three tasks (#1305)
Boxuan Li committed
H
Revert "Add Cybench adapter (#775)" (#1313)
Harsh Raj committed
October 20, 2025
G
Add Cybench adapter (#775)
gary committed
October 16, 2025
A
Remove sketchy test.
Alex Shaw committed
E
Improve cprofiling-python and png-generation tasks (#1077)
EtashGuha committed
October 15, 2025
H
Update test to check for hello.txt instead of hello-world.txt (#1294)
Haowei Lin committed
B
Fixes to move-helper yaml and rare-mineral-allocation yaml (#1051)
BardiaKoopah committed
October 13, 2025
J
Fixed #1165: Updated correctness boundry to be percentage based (#1168)
Jan-Lucas Uslu committed
M
Improve spec
Mike Merrill committed
J
add modernize-scientific-stack task (#966)
Jianbo Wu committed
October 11, 2025
I
Update styleguide.md (#1290)
Ivan Bercovich committed
October 8, 2025
Y
Add new task: Portfolio optimization (#942)
Yanhao Li committed