Scale AI
company
Verified
AI & ML interests
None defined yet.
Recent Activity
View all activity
Papers
Agentic Rubrics as Contextual Verifiers for SWE Agents
ResearchRubrics: A Benchmark of Prompts and Rubrics For Evaluating Deep Research Agents
datasets 25
ScaleAI/SWE-Atlas-QnA
Viewer
• Updated
• 124 • 137 • 12
ScaleAI/RaR-Medicine
Viewer
• Updated
• 22.4k • 21 • 1
ScaleAI/RaR-Science
Viewer
• Updated
• 22.9k • 24 • 1
ScaleAI/SWE-bench_Pro
Benchmark
• Updated
• 731 • 165k • 54
ScaleAI/mrt
Updated
• 14.2k • 4
ScaleAI/audiomc
Viewer
• Updated
• 452 • 1.52k • 13
ScaleAI/lhaw
Viewer
• Updated
• 285 • 186 • 4
ScaleAI/SciPredict
Viewer
• Updated
• 405 • 27 • 1
ScaleAI/PRBench
Viewer
• Updated
• 1.65k • 369 • 6
ScaleAI/MCP-Atlas
Viewer
• Updated
• 500 • 1.53k • 10