Fang's picture

Fang

missing12

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 16 days ago

DVAO: Dynamic Variance-adaptive Advantage Optimization for Multi-reward Reinforcement Learning

liked a dataset about 1 year ago

liked a dataset about 1 year ago

cardiffnlp/tweet_eval

View all activity

Organizations

None yet

upvoted a paper 16 days ago

DVAO: Dynamic Variance-adaptive Advantage Optimization for Multi-reward Reinforcement Learning

Paper • 2605.25604 • Published 20 days ago • 134

liked 2 datasets about 1 year ago

SetFit/ag_news

Viewer • Updated Jan 19, 2022 • 128k • 3.07k • 7

cardiffnlp/tweet_eval

Viewer • Updated Jan 4, 2024 • 201k • 38.3k • 144

liked a model about 1 year ago

mrm8488/ddpm-ema-pokemon-v2-64

Updated Aug 11, 2022 • 1 • 1

upvoted a collection over 1 year ago

Synthetic Data and Self-Improvement

113 items • Updated Sep 26, 2025 • 9

liked 9 datasets over 1 year ago

prometheus-eval/Feedback-Collection

Viewer • Updated Oct 14, 2023 • 100k • 823 • 120

prometheus-eval/Preference-Collection

Viewer • Updated May 3, 2024 • 200k • 267 • 40

GAIR/preference-dissection

Viewer • Updated Feb 20, 2024 • 5.24k • 26 • 9

lmarena-ai/arena-hard-auto-v0.1

Viewer • Updated Sep 4, 2024 • 500 • 542 • 6

allenai/WildBench

Viewer • Updated Mar 4, 2025 • 2.3k • 2.77k • 39

lmarena-ai/webdev-arena-preference-10k

Viewer • Updated Mar 10, 2025 • 10.5k • 363 • 16

lmsys/lmsys-chat-1m

Viewer • Updated Jul 27, 2024 • 1M • 8.05k • 916

lmarena-ai/arena-human-preference-55k

Viewer • Updated May 17, 2024 • 57.5k • 1.28k • 159

lmsys/mt_bench_human_judgments

Viewer • Updated Jul 20, 2023 • 5.76k • 2.5k • 144