arxiv:2407.01790
Yumeng Li
Yumeng
AI & ML interests
Generative Models, Vision-Language Models, Out-of-Distribution Generalization
Recent Activity
upvoted an article 11 days ago
A Guide to Reinforcement Learning Post-Training for LLMs: PPO, DPO, GRPO, and Beyond liked
a Space 6 months ago
nanotron/ultrascale-playbook liked
a dataset 9 months ago
TIGER-Lab/OmniEdit-Filtered-1.2M Organizations
None yet