AI & ML interests
None defined yet.
Recent Activity
View all activity
Papers
RLinf-USER: A Unified and Extensible System for Real-World Online Policy Learning in Embodied AI
WideSeek-R1: Exploring Width Scaling for Broad Information Seeking via Multi-Agent Reinforcement Learning
-
RLinf/RLinf-OpenVLAOFT-LIBERO-130-Base-Lora
Reinforcement Learning • 8B • Updated • 281 -
RLinf/RLinf-OpenVLAOFT-ManiSkill-Base-Main
8B • Updated • 325 -
RLinf/RLinf-OpenVLAOFT-LIBERO-90-Base-Lora
Reinforcement Learning • 8B • Updated • 64 -
RLinf/RLinf-OpenVLAOFT-LIBERO-130
Reinforcement Learning • 8B • Updated • 59 • 2
WideSeek-R1: Exploring Width Scaling for Broad Information Seeking via Multi-Agent Reinforcement Learning
-
RLinf/RLinf-OpenVLAOFT-LIBERO-130-Base-Lora
Reinforcement Learning • 8B • Updated • 281 -
RLinf/RLinf-OpenVLAOFT-ManiSkill-Base-Main
8B • Updated • 325 -
RLinf/RLinf-OpenVLAOFT-LIBERO-90-Base-Lora
Reinforcement Learning • 8B • Updated • 64 -
RLinf/RLinf-OpenVLAOFT-LIBERO-130
Reinforcement Learning • 8B • Updated • 59 • 2
WideSeek-R1: Exploring Width Scaling for Broad Information Seeking via Multi-Agent Reinforcement Learning