AI & ML interests
None yet
Organizations
None yet
PicchiEv/M2_DPO_beta0.1_epochs5_NO_SFT
0.6B
•
Updated
PicchiEv/MNLP_M3_dpo_model
0.6B
•
Updated
PicchiEv/M3_DPO_beta0.1_5epochs_SFT
0.6B
•
Updated
PicchiEv/M2_SimPo_SFT_beta_2_gamma1.5_5epochs
0.6B
•
Updated
PicchiEv/M2_SimPO_beta2_gamma1_5epochs_5e-6
0.6B
•
Updated
PicchiEv/M2_SimPO_beta2_gamma15_5epochs_5e-6
0.6B
•
Updated
PicchiEv/M2_DPO_beta01_lr5e-6-5epochs
0.6B
•
Updated
PicchiEv/UF_M3_SimPO_NO_SFT
0.6B
•
Updated
PicchiEv/UF_M3_SIMPObeta2
0.6B
•
Updated
PicchiEv/UF_STEM-DPO_5e-6_beta01
0.6B
•
Updated
0.6B
•
Updated
0.6B
•
Updated
PicchiEv/M3_SimPo_beta2_gamma15_SFT
0.6B
•
Updated
PicchiEv/SimPo_beta2_gamma1_SFT
0.6B
•
Updated
PicchiEv/SimPo_beta_2_gamma15_No_SFT
0.6B
•
Updated
PicchiEv/M2_LNDPO_beta10_SFT_ON
0.6B
•
Updated
0.6B
•
Updated
PicchiEv/LNDPO_beta5_noSFT
0.6B
•
Updated
PicchiEv/SimPO_small_beta10_gamma3
0.6B
•
Updated
0.6B
•
Updated
•
2
0.6B
•
Updated
PicchiEv/M3_beta0.1lr5e-6
0.6B
•
Updated
0.6B
•
Updated
0.6B
•
Updated
PicchiEv/MNLP_M2_lre-5_beta03
0.6B
•
Updated
0.6B
•
Updated
0.6B
•
Updated
PicchiEv/MNLP_M2_dpo_model
0.6B
•
Updated
0.6B
•
Updated
PicchiEv/MNLP_M2_DPO_NO_SFT
0.6B
•
Updated