AI & ML interests
Data-centric AI, LLM, MLLM
Recent Activity
Papers
Closing the Data Loop: Using OpenDataArena to Engineer Superior Training Datasets
OpenDataArena: A Fair and Open Arena for Benchmarking Post-Training Dataset Value
Organization Card
High-quality dataset for financial LLM post-training.
models 11
OpenDataArena/ODA-Fin-RL-8B
Reinforcement Learning β’ 8B β’ Updated
OpenDataArena/ODA-Fin-SFT-8B
Question Answering β’ 308k β’ Updated
β’ 11 β’ 1
OpenDataArena/MMFineReason-4B
Visual Question Answering β’ Updated
β’ 50 β’ 14
OpenDataArena/MMFineReason-2B
Visual Question Answering β’ 2B β’ Updated
β’ 10 β’ 8
OpenDataArena/MMFineReason-8B
Visual Question Answering β’ 9B β’ Updated
β’ 72 β’ 10
OpenDataArena/Qwen3-8B-ODA-Math-460k
Text Generation β’ 308k β’ Updated
β’ 7 β’ 1
OpenDataArena/Qwen2.5-7B-ODA-Math-460k
Text Generation β’ 8B β’ Updated
β’ 2
OpenDataArena/Qwen3-8B-ODA-Mixture-100k
Text Generation β’ 308k β’ Updated
β’ 50 β’ 1
OpenDataArena/Qwen3-8B-ODA-Mixture-500k
Text Generation β’ 308k β’ Updated
β’ 17
OpenDataArena/Qwen2.5-7B-ODA-Mixture-100k
Text Generation β’ 333k β’ Updated
β’ 9
datasets 12
OpenDataArena/ODA-Fin-SFT-318k
Viewer
β’ Updated
β’ 5 β’ 36
OpenDataArena/ODA-Fin-RL-12k
Viewer
β’ Updated
β’ 12.4k β’ 12
OpenDataArena/ODA-Mixture-500k
Viewer
β’ Updated
β’ 506k β’ 230 β’ 122
OpenDataArena/ODA-scored-data-2603
Viewer
β’ Updated
β’ 6.49M β’ 50 β’ 5
OpenDataArena/MMFineReason-1.8M-Qwen3-VL-235B-Thinking
Viewer
β’ Updated
β’ 1.81M β’ 2.15k β’ 118
OpenDataArena/MMFineReason-SFT-123K-Qwen3-VL-235B-Thinking
Viewer
β’ Updated
β’ 123k β’ 691 β’ 76
OpenDataArena/MMFineReason-SFT-586K-Qwen3-VL-235B-Thinking
Viewer
β’ Updated
β’ 586k β’ 396 β’ 6
OpenDataArena/MMFineReason-Full-2.3M-Qwen3-VL-235B-Thinking
Viewer
β’ Updated
β’ 2.29M β’ 4.92k β’ 64
OpenDataArena/ODA-Math-460k
Viewer
β’ Updated
β’ 460k β’ 493 β’ 104
OpenDataArena/ODA-Mixture-100k
Viewer
β’ Updated
β’ 101k β’ 229 β’ 97