AI & ML interests
None defined yet.
Recent Activity
View all activity
Papers
ResRL: Boosting LLM Reasoning via Negative Sample Projection Residual Reinforcement Learning
From $P(y|x)$ to $P(y)$: Investigating Reinforcement Learning in Pre-train Space
models 0
None public yet
datasets 0
None public yet