arxiv:2605.06638
Eric Lan
Eric-Lan
·
AI & ML interests
Reinforcement Fine-Tuning, Reinforcement Learning, RLHF/VR, LLM Alignment, Reasoning, Diffusion Model, Speculative Decoding, Federated Learning
Recent Activity
authored a paper 1 day ago
Can RL Teach Long-Horizon Reasoning to LLMs? Expressiveness Is Key upvoted a paper 4 days ago
Can RL Teach Long-Horizon Reasoning to LLMs? Expressiveness Is Key liked a model 5 months ago
huseyinatahaninan/Qwen2.5-7B-Instruct-CI