Eduardo Moura Almeida PRO
Dumoura
AI & ML interests
None yet
Organizations
models 15
Dumoura/smol-course-SmolVLM2-2.2B-Instruct-trl-sft-ChartQA
Updated
Dumoura/smollm3-dpo-aligned
Text Generation • 3B • Updated
• 1
Dumoura/LFM2-1.2B-job-sft
Text Generation • 1B • Updated
• 8 • 1
Dumoura/SmolLM3-Custom-SFT
Text Generation • 3B • Updated
• 1
Dumoura/chronotope-v2-shakespeare_char
22M • Updated
• 1
Dumoura/a2c-PandaPickAndPlace-v3
Reinforcement Learning • Updated
• 1
Dumoura/rl_vizdoom_health_gathering_supreme
Reinforcement Learning • Updated
Dumoura/Pixelcopter-PLE-v0
Reinforcement Learning • Updated
Dumoura/Reinforce-CartPole-v1
Reinforcement Learning • Updated
Dumoura/dqn-SpaceInvadersNoFrameskip-v4
Reinforcement Learning • Updated
• 1