Self-Hinting Language Models Enhance Reinforcement Learning
Baohao Liao
baohao
AI & ML interests
NLP
Recent Activity
updated
a model about 6 hours ago
baohao/byt5-base-optim_clean-final_fold5-3_ep20bs1x8lr8e-5_bestavg3 published
a model about 6 hours ago
baohao/byt5-base-optim_clean-final_fold5-3_ep20bs1x8lr8e-5_bestavg3 updated
a model about 6 hours ago
baohao/byt5-base-optim_clean-final_fold5-2_ep20bs2x16lr1e-4_bestavg3