Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
CL Yu's picture
2 9 13

CL Yu

clyu
21world's profile picture
·

AI & ML interests

None yet

Recent Activity

liked a model 2 days ago
Qwen/Qwen3-Coder-Next
submitted a paper 2 days ago
Approximation of Log-Partition Function in Policy Mirror Descent Induces Implicit Regularization for LLM Post-Training
upvoted an article 4 days ago
We Got Claude to Build CUDA Kernels and teach open models!
View all activity

Organizations

DuckAI's profile picture n-alignment's profile picture

submitted a paper to Daily Papers 2 days ago

Approximation of Log-Partition Function in Policy Mirror Descent Induces Implicit Regularization for LLM Post-Training

Paper • 2602.05933 • Published 3 days ago • 5
authored 2 papers 6 months ago

WebAgent-R1: Training Web Agents via End-to-End Multi-Turn Reinforcement Learning

Paper • 2505.16421 • Published May 22, 2025 • 19

Think-RM: Enabling Long-Horizon Reasoning in Generative Reward Models

Paper • 2505.16265 • Published May 22, 2025 • 8
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs