9 24 2

Jiahang Xu

Jiahang

JiahangXu

AI & ML interests

None yet

Recent Activity

upvoted a paper 6 days ago

SkillOpt: Executive Strategy for Self-Evolving Agent Skills

upvoted a paper 9 months ago

rStar2-Agent: Agentic Reasoning Technical Report

upvoted a paper 9 months ago

The Landscape of Agentic Reinforcement Learning for LLMs: A Survey

View all activity

Organizations

upvoted a paper 6 days ago

SkillOpt: Executive Strategy for Self-Evolving Agent Skills

Paper • 2605.23904 • Published 9 days ago • 206

upvoted 2 papers 9 months ago

rStar2-Agent: Agentic Reasoning Technical Report

Paper • 2508.20722 • Published Aug 28, 2025 • 118

The Landscape of Agentic Reinforcement Learning for LLMs: A Survey

Paper • 2509.02547 • Published Sep 2, 2025 • 238

authored a paper 9 months ago

Prompt Orchestration Markup Language

Paper • 2508.13948 • Published Aug 19, 2025 • 48

upvoted a paper 9 months ago

Prompt Orchestration Markup Language

Paper • 2508.13948 • Published Aug 19, 2025 • 48

upvoted 5 papers 10 months ago

Pass@k Training for Adaptively Balancing Exploration and Exploitation of Large Reasoning Models

Paper • 2508.10751 • Published Aug 14, 2025 • 29

upvoted 2 papers 12 months ago

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9, 2025 • 265

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

Paper • 2506.01939 • Published Jun 2, 2025 • 190

upvoted 2 papers about 1 year ago

Chain-of-Model Learning for Language Model

Paper • 2505.11820 • Published May 17, 2025 • 121

Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs

Paper • 2503.01743 • Published Mar 3, 2025 • 91

upvoted a paper over 1 year ago

LongRoPE2: Near-Lossless LLM Context Window Scaling

Paper • 2502.20082 • Published Feb 27, 2025 • 36

published a model over 1 year ago

Jiahang/Qwen2.5-1.5B-Open-R1-Distill

Updated Feb 24, 2025

upvoted 2 papers over 1 year ago

Logic-RL: Unleashing LLM Reasoning with Rule-Based Reinforcement Learning

Paper • 2502.14768 • Published Feb 20, 2025 • 47

Beyond Prompt Content: Enhancing LLM Performance via Content-Format Integrated Prompt Optimization

Paper • 2502.04295 • Published Feb 6, 2025 • 13

commented a paper over 1 year ago

Beyond Prompt Content: Enhancing LLM Performance via Content-Format Integrated Prompt Optimization

Paper • 2502.04295 • Published Feb 6, 2025 • 13 •

upvoted a paper over 1 year ago

The Lessons of Developing Process Reward Models in Mathematical Reasoning

Paper • 2501.07301 • Published Jan 13, 2025 • 101

Jiahang Xu

AI & ML interests

Recent Activity

Organizations

Jiahang's activity