Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Sumuk Shashidhar's picture
10 9 17

Sumuk Shashidhar PRO

sumuks
Fishtiks's profile picture hshetty's profile picture adamm-hf's profile picture
·
https://sumuk.org
  • sumukx
  • sumukshashidhar
  • sumuks

AI & ML interests

Evaluations, Reasoning, Long Term Planning

Recent Activity

updated a dataset 12 days ago
sumuks/preference-atlas-rewards
published a dataset 12 days ago
sumuks/preference-atlas-rewards
liked a dataset 13 days ago
sumuks/preference-atlas
View all activity

Organizations

Blog-explorers's profile picture Verifiers For Code's profile picture Preference Agents's profile picture Sumuk's Archived Content's profile picture UIUC Conversational AI Lab's profile picture self-planner's profile picture Nerdy Face's profile picture Sumuk's Testing Grounds!'s profile picture Spiral Works's profile picture Your Bench's profile picture Sumuk's Second Set of Archived Content's profile picture InfoHunt's profile picture TextCleanLM's profile picture Sumuk's First Archival Storage Volume's profile picture popper's profile picture Sumuk's Archival Storage 2's profile picture Sumuk's Archival Storage 3's profile picture

Articles 1

Article
4

Getting Started with YourBench

Papers 5

arxiv:2505.01592
arxiv:2504.20090
arxiv:2504.01833
arxiv:2410.03731

models 0

None public yet

datasets 28

sumuks/preference-atlas-rewards

Viewer • Updated 12 days ago • 5.03k • 29

sumuks/preference-atlas

Viewer • Updated 13 days ago • 329k • 102 • 1

sumuks/reward-bench-2

Viewer • Updated 13 days ago • 1.87k • 43

sumuks/helpsteer3

Viewer • Updated 14 days ago • 49.1k • 248

sumuks/helpsteer3-easy

Viewer • Updated 20 days ago • 7.93k • 29

sumuks/helpsteer-pairwise-grading

Viewer • Updated 25 days ago • 51.8k • 19

sumuks/rupo-eval-logs-helpsteer3-1

Viewer • Updated 26 days ago • 1.43k • 35

sumuks/helpsteer3-rupo

Viewer • Updated 27 days ago • 38.2k • 167

sumuks/persuasiveness_detection

Viewer • Updated Feb 6 • 3.94k • 16

sumuks/rupo-eval-humanlike-dpo-dataset-lbhr-2

Preview • Updated Feb 6 • 14
View 28 datasets
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs