Adaptive Preference Optimization with Uncertainty-aware Utility Anchor Paper • 2509.10515 • Published Sep 3, 2025 • 1
Adaptive Preference Optimization with Uncertainty-aware Utility Anchor Paper • 2509.10515 • Published Sep 3, 2025 • 1
TongSIM: A General Platform for Simulating Intelligent Machines Paper • 2512.20206 • Published Dec 23, 2025 • 28
Native Parallel Reasoner: Reasoning in Parallelism via Self-Distilled Reinforcement Learning Paper • 2512.07461 • Published Dec 8, 2025 • 78
Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm Paper • 2511.04570 • Published Nov 6, 2025 • 215