Delta Belief RL Collection of the models for our paper "Intrinsic Credit Assignment for Long Horizon Interaction" iaa01/CIA-1.7B 2B • Updated Feb 13 • 3 • 1 iaa01/CIA-4B 4B • Updated Feb 13 • 55 • 3 Klingspor/StarPO-1.7B Text Generation • 2B • Updated Feb 13 • 1 • Klingspor/StarPO-4B Text Generation • 4B • Updated Feb 13 • 6 • • 2
Delta Belief RL Collection of the models for our paper "Intrinsic Credit Assignment for Long Horizon Interaction" iaa01/CIA-1.7B 2B • Updated Feb 13 • 3 • 1 iaa01/CIA-4B 4B • Updated Feb 13 • 55 • 3 Klingspor/StarPO-1.7B Text Generation • 2B • Updated Feb 13 • 1 • Klingspor/StarPO-4B Text Generation • 4B • Updated Feb 13 • 6 • • 2