EnterpriseClawBench: Benchmarking Agents from Real Workplace Sessions Paper • 2606.23654 • Published 10 days ago • 79
PlanBench-XL: Evaluating Long-Horizon Planning of LLM Tool-Use Agents in Large-Scale Tool Ecosystems Paper • 2606.22388 • Published 11 days ago • 96
ShutterMuse: Capture-Time Photography Guidance with MLLMs Paper • 2606.25763 • Published 8 days ago • 46
Qwen-Image-Agent: Bridging the Context Gap in Real-World Image Generation Paper • 2606.26907 • Published 7 days ago • 48
KaLM-Reranker-V1: Fast but Not Late Interaction for Compressed Document Reranking Paper • 2606.22807 • Published 10 days ago • 49
Wan-Streamer v0.1: End-to-end Real-time Interactive Foundation Models Paper • 2606.25041 • Published 9 days ago • 111
MemSlides: A Hierarchical Memory Driven Agent Framework for Personalized Slide Generation with Multi-turn Local Revision Paper • 2606.17162 • Published 17 days ago • 175
Qwen-AgentWorld: Language World Models for General Agents Paper • 2606.24597 • Published 9 days ago • 144
MinerU-Popo: Universal Post-Processing Model for Structured Document Parsing Paper • 2605.24973 • Published May 24 • 1
view article Article Introducing the FFASR Leaderboard: Benchmarking ASR in the Real World +3 daniel-treble, whojavumusic, alessia-treble, georg-goetz, bezzam • 8 days ago • 7
view article Article PP-OCRv6 on Hugging Face: 50-Language OCR from 1.5M to 34.5M Parameters PaddlePaddle • 10 days ago • 26