·
AI & ML interests
None yet
Organizations
None yet
models
14
IDoNotHaveAName/aug_model
Updated
IDoNotHaveAName/Hint-informed-env
Updated
IDoNotHaveAName/GRPO-800-reproduction
Updated
IDoNotHaveAName/PRM-GRPO-800-1.5B
Updated
IDoNotHaveAName/Hint-Informed-grpo
2B
•
Updated
IDoNotHaveAName/reproduce-grpo-1.5B
Updated
IDoNotHaveAName/model-trainby-mistake
Text Generation
•
2B
•
Updated
•
1
IDoNotHaveAName/2epoch-experiment
Text Generation
•
2B
•
Updated
•
1
IDoNotHaveAName/X-R1-3epoch
Text Generation
•
2B
•
Updated
•
1
IDoNotHaveAName/GRPO-1epoch-train-by-mistake-collections-without-hint
Text Generation
•
2B
•
Updated
•
2
datasets
29
IDoNotHaveAName/example_question
Viewer
•
Updated
•
1
IDoNotHaveAName/big_dataset
Viewer
•
Updated
•
1.65k
•
1
Viewer
•
Updated
•
1.01k
•
3
Viewer
•
Updated
•
1.03k
IDoNotHaveAName/new_dataset
Viewer
•
Updated
•
1.06k
•
2
IDoNotHaveAName/experiment_rag_X_R1
Viewer
•
Updated
•
1.03k
•
2
IDoNotHaveAName/small_data
Viewer
•
Updated
•
16
•
1
IDoNotHaveAName/boxed-amc
Viewer
•
Updated
•
40
•
6
IDoNotHaveAName/mmlu-qa-elementary-math
Viewer
•
Updated
•
378
•
2
•
1
IDoNotHaveAName/x-r1-random-800
Viewer
•
Updated
•
800
•
4