Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
87
EvalEval Bot
EvalEvalBot
Follow
evijit's profile picture
1 follower
·
2 following
AI & ML interests
None yet
Recent Activity
new
activity
about 5 hours ago
evaleval/EEE_datastore:
[Submission] TAB Error Recovery - 9 models, third-party evaluation
updated
a dataset
2 days ago
EvalEvalBot/eee-submission-index
new
activity
2 days ago
Qwen/Qwen2.5-VL-72B-Instruct:
Add EvalEval community eval results (mmlu_pro.yaml)
View all activity
Organizations
EvalEvalBot
's activity
All
Models
Datasets
Spaces
Buckets
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
New activity in
evaleval/EEE_datastore
about 5 hours ago
[Submission] TAB Error Recovery - 9 models, third-party evaluation
1
#140 opened about 5 hours ago by
RodTAB
updated
a dataset
2 days ago
EvalEvalBot/eee-submission-index
Viewer
•
Updated
2 days ago
•
1
•
20
New activity in
Qwen/Qwen2.5-VL-72B-Instruct
2 days ago
Add EvalEval community eval results (mmlu_pro.yaml)
#35 opened 2 days ago by
EvalEvalBot
published
a dataset
2 days ago
EvalEvalBot/eee-submission-index
Viewer
•
Updated
2 days ago
•
1
•
20
published
a bucket
4 days ago
EvalEvalBot/EEE_datastore_bucket
0 Bytes
updated
a dataset
4 days ago
evaleval/EEE_datastore
Updated
4 days ago
•
81.1k
•
27
New activity in
evaleval/EEE_datastore
7 days ago
[Submission] Latest LiveBench Data
2
#138 opened 7 days ago by
reuank
Fix LLM Stats provenance relationships
2
#137 opened 8 days ago by
Cerru02
New activity in
evaleval/EEE_datastore
8 days ago
[ACL Shared Task] wmt25_bhojpuri_maasai: Low-resource MT evaluation (Bhojpuri & Maasai)
3
#133 opened 23 days ago by
jboat
New activity in
evaleval/EEE_datastore
13 days ago
Shared Task - Submission
1
#136 opened 13 days ago by
UsmanGohar
[ACL Shared Task] Add OpenAI MRCR v2 (8-needle) leaderboard results
5
#119 opened 24 days ago by
bwingenroth
New activity in
evaleval/EEE_datastore
18 days ago
[ACL Shared Task] Add PACEBench evaluation results
4
#77 opened about 1 month ago by
mrpfisher
New activity in
evaleval/EEE_datastore
19 days ago
[ACL Shared Task] Add Chatbot Arena
16
#110 opened 26 days ago by
muhammadravi251001
[ACL Shared Task] Add AlpacaEval
7
#129 opened 23 days ago by
muhammadravi251001
New activity in
evaleval/EEE_datastore
20 days ago
[Submission] Journalistic-Bias Revised
1
#135 opened 20 days ago by
WanderingIsle
New activity in
evaleval/EEE_datastore
22 days ago
Parquet for dataset viewer
#134 opened 22 days ago by
EvalEvalBot
Generating Parquets
6
#58 opened about 2 months ago by
EvalEvalBot
[ACL Shared Task] Add LingOly benchmark results
5
#78 opened about 1 month ago by
ambean
[ACL Shared Task] Contribute MT-Bench results
4
#124 opened 24 days ago by
ameek
[ACL Shared Task] Contribute Humanity's Last Exam results
7
#125 opened 24 days ago by
ameek
Load more