What will happen if we train a Q function for digital agents?
HAO BAI
JackBAI
AI & ML interests
Representation learning, language models.
Recent Activity
upvoted
a
paper
about 4 hours ago
InT: Self-Proposed Interventions Enable Credit Assignment in LLM Reasoning
submitted
a paper
about 4 hours ago
InT: Self-Proposed Interventions Enable Credit Assignment in LLM Reasoning
authored
a paper
13 days ago
ERA: Transforming VLMs into Embodied Agents via Embodied Prior Learning
and Online Reinforcement Learning