submission19025
submission19025
·
AI & ML interests
None yet
Recent Activity
upvoted a paper 12 days ago
Data Repetition Beats Data Scaling in Long-CoT Supervised Fine-Tuning upvoted an article about 1 month ago
Re-understanding KL Approximation from an RL-for-LLM Lens: Notes on “Approximating KL Divergence” liked
a dataset 4 months ago
ftajwar/deduplicated_dapo_dataset Organizations
None yet