tongliuphysics/qwen3-4b-loopmultiturn3k-4096-rollout16-bs256-1201-fastv2-step40 4B • Updated about 10 hours ago
tongliuphysics/qwen3-4b-loopmultiturn3k-4096-rollout16-bs256-1201-fastv2-step20 4B • Updated about 10 hours ago
tongliuphysics/qwen3-4b-loopmultiturn3k-4096-rollout8-bs256-1201-savingprompts-step30 4B • Updated about 10 hours ago
tongliuphysics/qwen3-4b-loopmultiturn3k-4096-rollout8-bs256-1201-savingprompts-step40 4B • Updated about 10 hours ago
tongliuphysics/qwen3-4b-loopmultiturn3k-4096-rollout8-bs256-1201-savingprompts-step20 4B • Updated about 10 hours ago
tongliuphysics/qwen3-4b-loopmultiturn3k-4096-rollout8-bs256-1201-savingprompts-step10 4B • Updated about 10 hours ago
tongliuphysics/qwen3-4b-ins-normal-n1-singleturn666-binary-rollout8-bs256-0501 4B • Updated Jan 5 • 2
tongliuphysics/qwen-3binstruct-normal-0.5with5rollouts-granular-rollout5-340steps-addentropytoadvantage-alpha1 3B • Updated Nov 19, 2025
tongliuphysics/Mistral-7B-Base-SFT-FocalPO Text Generation • 7B • Updated Nov 29, 2024 • 1 • 1