Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2601.09088

Alibaba-Apsara/DASD-4B-Thinking

Text Generation • 4B • Updated 6 days ago • 470 • 83
Alibaba-Apsara/DASD-30B-A3B-Thinking-Preview

Text Generation • 31B • Updated 6 days ago • 122 • 39
Alibaba-Apsara/Superior-Reasoning-SFT-gpt-oss-120b

Viewer • Updated 6 days ago • 306k • 5.3k • 170
Alibaba-Apsara/Superior-Reasoning-SFT-gpt-oss-120b-Logprob

Viewer • Updated 6 days ago • 435k • 1.07k • 27

about 10 hours ago

Youtu-LLM: Unlocking the Native Agentic Potential for Lightweight Large Language Models

Paper • 2512.24618 • Published 21 days ago • 138
Let It Flow: Agentic Crafting on Rock and Roll, Building the ROME Model within an Open Agentic Learning Ecosystem

Paper • 2512.24873 • Published 20 days ago • 101
AI Meets Brain: Memory Systems from Cognitive Neuroscience to Autonomous Agents

Paper • 2512.23343 • Published 23 days ago • 27
Scaling Open-Ended Reasoning to Predict the Future

Paper • 2512.25070 • Published 20 days ago • 15

Thought Manipulation: External Thought Can Be Efficient for Large Reasoning Models

Paper • 2504.13626 • Published Apr 18, 2025 • 7
Scaling Reasoning, Losing Control: Evaluating Instruction Following in Large Reasoning Models

Paper • 2505.14810 • Published May 20, 2025 • 62
Paper2Agent: Reimagining Research Papers As Interactive and Reliable AI Agents

Paper • 2509.06917 • Published Sep 8, 2025 • 41
hongliu9903/stack_edu_python

Viewer • Updated Jul 31, 2025 • 25.3M • 27 • 1

Reasoning Papers

Klear-Reasoner: Advancing Reasoning Capability via Gradient-Preserving Clipping Policy Optimization

Paper • 2508.07629 • Published Aug 11, 2025 • 43
Less Is More: Training-Free Sparse Attention with Global Locality for Efficient Reasoning

Paper • 2508.07101 • Published Aug 9, 2025 • 14
Compressing Chain-of-Thought in LLMs via Step Entropy

Paper • 2508.03346 • Published Aug 5, 2025 • 8
Train Long, Think Short: Curriculum Learning for Efficient Reasoning

Paper • 2508.08940 • Published Aug 12, 2025 • 27

reasoning_model

OpenMMReasoner: Pushing the Frontiers for Multimodal Reasoning with an Open and General Recipe

Paper • 2511.16334 • Published Nov 20, 2025 • 92
Parallel-R1: Towards Parallel Thinking via Reinforcement Learning

Paper • 2509.07980 • Published Sep 9, 2025 • 102
ParaThinker: Native Parallel Thinking as a New Paradigm to Scale LLM Test-time Compute

Paper • 2509.04475 • Published Aug 30, 2025 • 3
Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

Paper • 2512.01374 • Published Dec 1, 2025 • 100

Alibaba-Apsara/DASD-4B-Thinking

Text Generation • 4B • Updated 6 days ago • 470 • 83
Alibaba-Apsara/DASD-30B-A3B-Thinking-Preview

Text Generation • 31B • Updated 6 days ago • 122 • 39
Alibaba-Apsara/Superior-Reasoning-SFT-gpt-oss-120b

Viewer • Updated 6 days ago • 306k • 5.3k • 170
Alibaba-Apsara/Superior-Reasoning-SFT-gpt-oss-120b-Logprob

Viewer • Updated 6 days ago • 435k • 1.07k • 27

Reasoning Papers

Klear-Reasoner: Advancing Reasoning Capability via Gradient-Preserving Clipping Policy Optimization

Paper • 2508.07629 • Published Aug 11, 2025 • 43
Less Is More: Training-Free Sparse Attention with Global Locality for Efficient Reasoning

Paper • 2508.07101 • Published Aug 9, 2025 • 14
Compressing Chain-of-Thought in LLMs via Step Entropy

Paper • 2508.03346 • Published Aug 5, 2025 • 8
Train Long, Think Short: Curriculum Learning for Efficient Reasoning

Paper • 2508.08940 • Published Aug 12, 2025 • 27

about 10 hours ago

Youtu-LLM: Unlocking the Native Agentic Potential for Lightweight Large Language Models

Paper • 2512.24618 • Published 21 days ago • 138
Let It Flow: Agentic Crafting on Rock and Roll, Building the ROME Model within an Open Agentic Learning Ecosystem

Paper • 2512.24873 • Published 20 days ago • 101
AI Meets Brain: Memory Systems from Cognitive Neuroscience to Autonomous Agents

Paper • 2512.23343 • Published 23 days ago • 27
Scaling Open-Ended Reasoning to Predict the Future

Paper • 2512.25070 • Published 20 days ago • 15

reasoning_model

OpenMMReasoner: Pushing the Frontiers for Multimodal Reasoning with an Open and General Recipe

Paper • 2511.16334 • Published Nov 20, 2025 • 92
Parallel-R1: Towards Parallel Thinking via Reinforcement Learning

Paper • 2509.07980 • Published Sep 9, 2025 • 102
ParaThinker: Native Parallel Thinking as a New Paradigm to Scale LLM Test-time Compute

Paper • 2509.04475 • Published Aug 30, 2025 • 3
Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

Paper • 2512.01374 • Published Dec 1, 2025 • 100

Thought Manipulation: External Thought Can Be Efficient for Large Reasoning Models

Paper • 2504.13626 • Published Apr 18, 2025 • 7
Scaling Reasoning, Losing Control: Evaluating Instruction Following in Large Reasoning Models

Paper • 2505.14810 • Published May 20, 2025 • 62
Paper2Agent: Reimagining Research Papers As Interactive and Reliable AI Agents

Paper • 2509.06917 • Published Sep 8, 2025 • 41
hongliu9903/stack_edu_python

Viewer • Updated Jul 31, 2025 • 25.3M • 27 • 1

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs