Submitted by
Yuting Ning
AI & ML interests
Natural language processing, language models, language agents
Recent Activity
View all activity
Papers
When Actions Go Off-Task: Detecting and Correcting Misaligned Actions in Computer-Use Agents
Bridging Online and Offline RL: Contextual Bandit Learning for Multi-Turn Code Generation