wgcyeo/ci-feedback_asym_bi_kl_hybrid_fixed_ema_Qwen3-4B_bw0p25_fw0p75_ema0p999_ep30 Updated 2 minutes ago
wgcyeo/ci-feedback_weighted_asym_bi_kl_fixed_ema_Qwen3-4B-Instruct-2507_bw0p25_fw0p75_ema0p999_ep30 Text Generation • Updated about 14 hours ago
wgcyeo/ci-feedback_weighted_asym_bi_kl_fixed_ema_Qwen3-4B-Instruct-2507_bw0p25_fw0p75_ema0p999_ep30 Text Generation • Updated about 14 hours ago
wgcyeo/ci-feedback_weighted_asym_bi_kl_fixed_ema_Qwen3-4B-Instruct-2507_bw0p75_fw0p25_ema0p999_ep30 Text Generation • Updated about 17 hours ago
wgcyeo/ci-feedback_weighted_asym_bi_kl_fixed_ema_Qwen3-4B-Instruct-2507_bw0p75_fw0p25_ema0p999_ep30 Text Generation • Updated about 17 hours ago
wgcyeo/ci-feedback_weighted_asymmetric_bidirectional_kl_fixed_ema_Qwen3-4B_bw0p25_fw0p75_ema0p999_ep30 Text Generation • Updated about 18 hours ago
wgcyeo/ci-feedback_weighted_asymmetric_bidirectional_kl_fixed_ema_Qwen3-4B_bw0p25_fw0p75_ema0p999_ep30 Text Generation • Updated about 18 hours ago
Memory Transfer Learning: How Memories are Transferred Across Domains in Coding Agents Paper • 2604.14004 • Published 4 days ago • 28
wgcyeo/ci-feedback_verbal_both_ema_Qwen2.5-3B-Instruct_reverse_kl_ema0p999_ep30 Text Generation • Updated 3 days ago • 12
wgcyeo/ci-feedback_verbal_both_ema_Qwen2.5-3B-Instruct_reverse_kl_ema0p999_ep30 Text Generation • Updated 3 days ago • 12
wgcyeo/ci-grpo_Qwen3-8B_bs16_g16_mb128_lr1e-6_b1e-3_clip0p2_temp0p7_ep30 Text Generation • Updated 3 days ago • 12
wgcyeo/ci-grpo_Qwen3-8B_bs16_g16_mb128_lr1e-6_b1e-3_clip0p2_temp0p7_ep30 Text Generation • Updated 3 days ago • 12
wgcyeo/ci-feedback_weighted_asymmetric_bi_kl_fixed_ema_Qwen2.5-14B-Instruct_bw0p5_fw0p5_ema0p999_ep30 Text Generation • Updated 3 days ago • 12
wgcyeo/ci-feedback_weighted_asymmetric_bi_kl_fixed_ema_Qwen2.5-14B-Instruct_bw0p5_fw0p5_ema0p999_ep30 Text Generation • Updated 3 days ago • 12
wgcyeo/ci-feedback_weighted_asymmetric_bidirectional_kl_fixed_ema_Qwen3-8B_bw0p5_fw0p5_ema0p999_ep30 Text Generation • Updated 5 days ago • 14
wgcyeo/ci-feedback_weighted_asymmetric_bidirectional_kl_fixed_ema_Qwen3-8B_bw0p5_fw0p5_ema0p999_ep30 Text Generation • Updated 5 days ago • 14
wgcyeo/ci-grpo_Qwen3-14B_bs8_g16_mb128_lr1e-6_b1e-3_clip0p2_temp0p7_ep30 Text Generation • Updated 5 days ago • 13
wgcyeo/ci-grpo_Qwen3-14B_bs8_g16_mb128_lr1e-6_b1e-3_clip0p2_temp0p7_ep30 Text Generation • Updated 5 days ago • 13
wgcyeo/ci-feedback_weighted_asymmetric_bidirectional_kl_fixed_ema_Qwen3-1.7B_bw0p5_fw0p5_ema0p999_ep30 Text Generation • Updated 6 days ago • 10
wgcyeo/ci-feedback_weighted_asymmetric_bidirectional_kl_fixed_ema_Qwen3-1.7B_bw0p5_fw0p5_ema0p999_ep30 Text Generation • Updated 6 days ago • 10