AI & ML interests
None defined yet.
Recent Activity
View all activity
Papers
ResRL: Boosting LLM Reasoning via Negative Sample Projection Residual Reinforcement Learning
From $P(y|x)$ to $P(y)$: Investigating Reinforcement Learning in Pre-train Space
CASIA 's models
None public yet