Hyeongwon/P2-split2_bs256_prob_Qwen3-4B-Base_0317-01 Text Generation • 196k • Updated about 4 hours ago
Hyeongwon/PH_det_sft_FC_swap_labewise_data_oversampling_bf16_lr0.00002_context_12k-Qwen3-8B-Base Text Generation • 308k • Updated 20 days ago • 85
Hyeongwon/PH_prob_sft_FC_swap_labewise_data_oversampling_bf16_lr0.00002_context_12k-Qwen3-8B-Base Text Generation • 308k • Updated 21 days ago • 97