-
Alibaba-Apsara/DASD-4B-Thinking
Text Generation • 4B • Updated • 470 • 83 -
Alibaba-Apsara/DASD-30B-A3B-Thinking-Preview
Text Generation • 31B • Updated • 122 • 39 -
Alibaba-Apsara/Superior-Reasoning-SFT-gpt-oss-120b
Viewer • Updated • 306k • 5.3k • 170 -
Alibaba-Apsara/Superior-Reasoning-SFT-gpt-oss-120b-Logprob
Viewer • Updated • 435k • 1.07k • 27
Collections
Discover the best community collections!
Collections including paper arxiv:2601.09088
-
Youtu-LLM: Unlocking the Native Agentic Potential for Lightweight Large Language Models
Paper • 2512.24618 • Published • 138 -
Let It Flow: Agentic Crafting on Rock and Roll, Building the ROME Model within an Open Agentic Learning Ecosystem
Paper • 2512.24873 • Published • 101 -
AI Meets Brain: Memory Systems from Cognitive Neuroscience to Autonomous Agents
Paper • 2512.23343 • Published • 27 -
Scaling Open-Ended Reasoning to Predict the Future
Paper • 2512.25070 • Published • 15
-
Thought Manipulation: External Thought Can Be Efficient for Large Reasoning Models
Paper • 2504.13626 • Published • 7 -
Scaling Reasoning, Losing Control: Evaluating Instruction Following in Large Reasoning Models
Paper • 2505.14810 • Published • 62 -
Paper2Agent: Reimagining Research Papers As Interactive and Reliable AI Agents
Paper • 2509.06917 • Published • 41 -
hongliu9903/stack_edu_python
Viewer • Updated • 25.3M • 27 • 1
-
Klear-Reasoner: Advancing Reasoning Capability via Gradient-Preserving Clipping Policy Optimization
Paper • 2508.07629 • Published • 43 -
Less Is More: Training-Free Sparse Attention with Global Locality for Efficient Reasoning
Paper • 2508.07101 • Published • 14 -
Compressing Chain-of-Thought in LLMs via Step Entropy
Paper • 2508.03346 • Published • 8 -
Train Long, Think Short: Curriculum Learning for Efficient Reasoning
Paper • 2508.08940 • Published • 27
-
OpenMMReasoner: Pushing the Frontiers for Multimodal Reasoning with an Open and General Recipe
Paper • 2511.16334 • Published • 92 -
Parallel-R1: Towards Parallel Thinking via Reinforcement Learning
Paper • 2509.07980 • Published • 102 -
ParaThinker: Native Parallel Thinking as a New Paradigm to Scale LLM Test-time Compute
Paper • 2509.04475 • Published • 3 -
Stabilizing Reinforcement Learning with LLMs: Formulation and Practices
Paper • 2512.01374 • Published • 100
-
Alibaba-Apsara/DASD-4B-Thinking
Text Generation • 4B • Updated • 470 • 83 -
Alibaba-Apsara/DASD-30B-A3B-Thinking-Preview
Text Generation • 31B • Updated • 122 • 39 -
Alibaba-Apsara/Superior-Reasoning-SFT-gpt-oss-120b
Viewer • Updated • 306k • 5.3k • 170 -
Alibaba-Apsara/Superior-Reasoning-SFT-gpt-oss-120b-Logprob
Viewer • Updated • 435k • 1.07k • 27
-
Klear-Reasoner: Advancing Reasoning Capability via Gradient-Preserving Clipping Policy Optimization
Paper • 2508.07629 • Published • 43 -
Less Is More: Training-Free Sparse Attention with Global Locality for Efficient Reasoning
Paper • 2508.07101 • Published • 14 -
Compressing Chain-of-Thought in LLMs via Step Entropy
Paper • 2508.03346 • Published • 8 -
Train Long, Think Short: Curriculum Learning for Efficient Reasoning
Paper • 2508.08940 • Published • 27
-
Youtu-LLM: Unlocking the Native Agentic Potential for Lightweight Large Language Models
Paper • 2512.24618 • Published • 138 -
Let It Flow: Agentic Crafting on Rock and Roll, Building the ROME Model within an Open Agentic Learning Ecosystem
Paper • 2512.24873 • Published • 101 -
AI Meets Brain: Memory Systems from Cognitive Neuroscience to Autonomous Agents
Paper • 2512.23343 • Published • 27 -
Scaling Open-Ended Reasoning to Predict the Future
Paper • 2512.25070 • Published • 15
-
OpenMMReasoner: Pushing the Frontiers for Multimodal Reasoning with an Open and General Recipe
Paper • 2511.16334 • Published • 92 -
Parallel-R1: Towards Parallel Thinking via Reinforcement Learning
Paper • 2509.07980 • Published • 102 -
ParaThinker: Native Parallel Thinking as a New Paradigm to Scale LLM Test-time Compute
Paper • 2509.04475 • Published • 3 -
Stabilizing Reinforcement Learning with LLMs: Formulation and Practices
Paper • 2512.01374 • Published • 100
-
Thought Manipulation: External Thought Can Be Efficient for Large Reasoning Models
Paper • 2504.13626 • Published • 7 -
Scaling Reasoning, Losing Control: Evaluating Instruction Following in Large Reasoning Models
Paper • 2505.14810 • Published • 62 -
Paper2Agent: Reimagining Research Papers As Interactive and Reliable AI Agents
Paper • 2509.06917 • Published • 41 -
hongliu9903/stack_edu_python
Viewer • Updated • 25.3M • 27 • 1