Self-Hinting Language Models Enhance Reinforcement Learning Paper • 2602.03143 • Published 13 days ago • 29
SAGE Collection Self-Hinting Language Models Enhance Reinforcement Learning • 19 items • Updated 7 days ago • 2
Qwen3-MoE Collection Compressed Qwen3 MoE models with a reduced number of experts. See additional models at https://huggingface.co/bknyaz. • 9 items • Updated 5 days ago • 2
ReGuLaR: Variational Latent Reasoning Guided by Rendered Chain-of-Thought Paper • 2601.23184 • Published 17 days ago • 35