payelb/UltraFeedback_openbmb_Llama-3.2-1B_aligned_with_semantic_MARS_deberta_RM Updated 11 minutes ago
payelb/UltraFeedback_openbmb_Llama-3.2-1B_aligned_with_semantic_MARS_roberta_RM Updated about 6 hours ago
payelb/HHRLHF_roberta-base_1k_fixed_MARS_semantic_distance_synth Text Classification • 0.1B • Updated 3 days ago • 20
payelb/PKUSafeRLHF_reward-model-deberta-v3-base_1k_fixed_MARS_semantic_refined Text Classification • 0.2B • Updated 4 days ago • 102