WPRM
community
AI & ML interests
None defined yet.
Recent Activity
View all activity
models 53
WPRM/qwen2.5-ar-reward-rejected-action-ablation-1
3B • Updated • 1
WPRM/llama-3.1-8b-ar-rm-mtl
8B • Updated
WPRM/qwen3-8b-ar-reward-cot-mtl-checklist-enhanced
8B • Updated • 1
WPRM/qwen-3b-ar-reward-cot-mtl-checklist-enhanced
3B • Updated
WPRM/qwen3-8b-checklist-enhanced
8B • Updated
WPRM/qwen3-ar-reward-cot-mtl-same-ratio-epoch2
8B • Updated • 1
WPRM/qwen3-ar-reward-cot-mtl
8B • Updated
WPRM/qwen3-ar-reward-cot-mtl-epoch1
8B • Updated
WPRM/qwen2_5vl-3b_ar_reward_cot_multimodal_mtl
4B • Updated • 1
WPRM/qwen2.5-ar-reward-cot-mtl
3B • Updated
datasets 119
WPRM/gitlab_failed_data
Viewer • Updated • 16 • 22
WPRM/ours_8b_mtl_enhanced_annotated_workarena_checklist
Viewer • Updated • 334 • 9
WPRM/ours_3b_mtl_enhanced_annotated_workarena_checklist
Viewer • Updated • 334 • 21
WPRM/4omini_obs_annotated_workarena_checklist
Viewer • Updated • 334 • 26
WPRM/ours_llama_8b_annotated_walite_combined_checklist
Viewer • Updated • 812 • 10
WPRM/workarena_checklist_raw
Viewer • Updated • 334 • 15
WPRM/human_dataset_sample_50
Viewer • Updated • 50 • 15
WPRM/webprm-rejected-action-ablation-dataset-rejected-action-3
Viewer • Updated • 21.8k • 21
WPRM/webprm-rejected-action-ablation-dataset-rejected-action-2
Viewer • Updated • 18.1k • 20
WPRM/webprm-rejected-action-ablation-dataset-rejected-action-1
Viewer • Updated • 12.1k • 24