arxiv:2412.03123
Jinghan Jia
flyingbugs
AI & ML interests
None yet
Organizations
models
146
flyingbugs/bi_unlearn_wmdp
Text Generation
•
7B
•
Updated
•
3
flyingbugs/OpenR1-Qwen-math-7B-SFT-mid-only
Text Generation
•
8B
•
Updated
•
1
flyingbugs/qwen-65-open-r1
Text Generation
•
8B
•
Updated
flyingbugs/GeneralThought-195K-65-qwen7b
Text Generation
•
8B
•
Updated
•
3
flyingbugs/limo-solutions-deepseek-qwen-7b
Text Generation
•
8B
•
Updated
flyingbugs/deepseek-distilled-qwen-7b-rl
Text Generation
•
8B
•
Updated
•
2
flyingbugs/Qwen2.5-Math-7B-limo-32b
Text Generation
•
8B
•
Updated
•
1
flyingbugs/Qwen2.5-math-1.5B-Open-R1-Distill-eos-new
Text Generation
•
2B
•
Updated
•
1
flyingbugs/Qwen2.5-1.5B-Open-R1-Distill-eos-epic-new
Text Generation
•
2B
•
Updated
flyingbugs/Qwen2.5-math-1.5B-Open-R1-Distill-eos
Text Generation
•
2B
•
Updated
datasets
83
flyingbugs/OpenR1-Math-220k-pruned-mid
Viewer
•
Updated
•
93.7k
•
26
flyingbugs/GeneralThought-195K-65
Viewer
•
Updated
•
127k
•
24
flyingbugs/limo-solutions-deepseek
Viewer
•
Updated
•
817
•
11
flyingbugs/star1_rlhf_train
Viewer
•
Updated
•
1k
•
3
flyingbugs/limo-deepseek32b-responses
Viewer
•
Updated
•
817
•
9
flyingbugs/OpenR1-Math-220k-random-0.65-subset
Viewer
•
Updated
•
60.9k
•
15
flyingbugs/pku_safe_rlhf_
Viewer
•
Updated
•
73.9k
•
31
flyingbugs/aime_2024
Viewer
•
Updated
•
30
•
1
flyingbugs/pure_math
Viewer
•
Updated
•
17.4k
•
2
flyingbugs/pku_safe_rlhf_combined_math
Viewer
•
Updated
•
91.3k
•
6