DCAgent2/DCAgent_dev_set_71_tasks_laion_rl_r2egym-nl2bash-stack-bugsseq-fixthink-again_lea079497 Updated about 2 hours ago
DCAgent2/swebench_verified_random_100_folders_exp_tas_timeout_multiplier_1_0_traces_2026ded1ad06 Viewer • Updated about 7 hours ago • 300
DCAgent2/terminal_bench_2_exp_psu_stackoverflow_316_glm_4_7_traces_20260312_194955 Viewer • Updated about 10 hours ago • 267
DCAgent2/terminal_bench_2_GLM_4_6_stackexchange_overflow_sandboxes_32eps_65k_reasoning_n507e7ebe Viewer • Updated about 10 hours ago • 267
DCAgent2/terminal_bench_2_GLM_4_6_stackexchange_overflow_sandboxes_32eps_65k_reasoning_nb63a57d0 Viewer • Updated about 13 hours ago • 267
DCAgent2/terminal_bench_2_exp_psu_stackoverflow_1K_glm_4_7_traces_20260312_210803 Viewer • Updated about 14 hours ago • 267
DCAgent2/terminal_bench_2_GLM_4_6_stackexchange_overflow_sandboxes_32eps_65k_reasoning_n85f4232a Viewer • Updated about 14 hours ago • 267
DCAgent2/terminal_bench_2_GLM_4_6_stackexchange_overflow_sandboxes_32eps_65k_reasoning_n8cdafffb Viewer • Updated about 15 hours ago • 267
DCAgent2/terminal_bench_2_exp_psu_stackoverflow_3K_glm_4_7_traces_20260312_210804 Viewer • Updated about 15 hours ago • 267