koutch/paper_llama_llama3.1-8b_train_sft_train_edit Text Generation • 8B • Updated about 16 hours ago • 25
koutch/paper_llama_llama3.1-8b_train_sft_train_code Text Generation • 8B • Updated about 17 hours ago • 20
koutch/paper_llama_llama3.1-8b_train_sft_train_para Text Generation • 8B • Updated about 17 hours ago • 81
koutch/paper_llama_llama3.1-8b_train_sft_train_dual Text Generation • 8B • Updated about 17 hours ago • 28
koutch/paper_llama_llama3.1-8b_train_sft_all_train_dual Text Generation • 8B • Updated about 19 hours ago • 27
koutch/paper_qwen_qwen3-instruct-4b_train_sft_all_train_dual Text Generation • 4B • Updated about 20 hours ago • 8
koutch/paper_smol_smol3-3B_train_sft_all_train_dual Text Generation • 3B • Updated about 20 hours ago • 8
koutch/paper_qwen_qwen3-instruct-4b_train_sft_train_dual Text Generation • 4B • Updated about 21 hours ago • 10
koutch/paper_smol_smol3-3B_train_sft_train_code Text Generation • 3B • Updated about 22 hours ago • 10
koutch/paper_smol_smol3-3B_train_sft_train_para Text Generation • 3B • Updated about 22 hours ago • 64
koutch/paper_qwen_qwen3-instruct-4b_train_sft_train_edit Text Generation • 4B • Updated about 22 hours ago • 8
koutch/paper_smol_smol3-3B_train_sft_train_dual Text Generation • 3B • Updated about 22 hours ago • 10
koutch/paper_qwen_qwen3-instruct-4b_train_sft_train_code Text Generation • 4B • Updated about 22 hours ago • 9
koutch/paper_qwen_qwen3-instruct-4b_train_sft_train_think Text Generation • 4B • Updated 6 days ago • 49
koutch/paper_llama_llama3.1-8b_train_sft_train_no_think Text Generation • 8B • Updated 6 days ago • 51
koutch/paper_qwen_qwen3-instruct-4b_train_sft_train_no_think Text Generation • 4B • Updated 6 days ago • 51
koutch/paper_qwen_qwen3-instruct-4b_train_sft_train_para Text Generation • 4B • Updated 6 days ago • 58
koutch/paper_qwen_qwen3-instruct-4b_train_sft_all_train_think Text Generation • 4B • Updated 6 days ago • 46
koutch/paper_llama_llama3.1-8b_train_sft_all_train_think Text Generation • 8B • Updated 7 days ago • 41
koutch/short_paper_llama_2.json_train_dpo_v1_train_no_think Text Generation • 8B • Updated 9 days ago • 52
koutch/short_paper_llama_2.json_train_dpo_v2_train_no_think Text Generation • 8B • Updated 9 days ago • 44
koutch/short_paper_qwen_2.json_train_dpo_v2_train_no_think Text Generation • 4B • Updated 9 days ago • 48
koutch/short_paper_qwen_2.json_train_dpo_v1_train_no_think Text Generation • 4B • Updated 9 days ago • 43