rednote-hilab

company

https://github.com/rednote-hilab

AI & ML interests

None defined yet.

Recent Activity

floyed submitted a paper 4 days ago

Anti-Self-Distillation for Reasoning RL via Pointwise Mutual Information

ygfrancois new activity about 1 month ago

rednote-hilab/dots.mocr:Add ParseBench evaluation results

ygfrancois new activity about 1 month ago

rednote-hilab/dots.mocr:Add MDPBench evaluation results

View all activity

Papers

Anti-Self-Distillation for Reasoning RL via Pointwise Mutual Information

Bootstrapping Exploration with Group-Level Natural Language Feedback in Reinforcement Learning

View all Papers

rednote-hilab 's datasets

None public yet