Submitted by
zihan
AI & ML interests
None defined yet.
Recent Activity
View all activity
Papers
ResRL: Boosting LLM Reasoning via Negative Sample Projection Residual Reinforcement Learning
From P(y|x) to P(y): Investigating Reinforcement Learning in Pre-train Space