SkillClaw: Let Skills Evolve Collectively with Agentic Evolver Paper • 2604.08377 • Published 3 days ago • 221
Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability Paper • 2604.06628 • Published 4 days ago • 228
MegaStyle: Constructing Diverse and Scalable Style Dataset via Consistent Text-to-Image Style Mapping Paper • 2604.08364 • Published 3 days ago • 88
MDPBench: A Benchmark for Multilingual Document Parsing in Real-World Scenarios Paper • 2603.28130 • Published 13 days ago • 10
SQuTR: A Robustness Benchmark for Spoken Query to Text Retrieval under Acoustic Noise Paper • 2602.12783 • Published Feb 13 • 216
Meta-Harness: End-to-End Optimization of Model Harnesses Paper • 2603.28052 • Published 13 days ago • 16