·
AI & ML interests
None yet
Organizations
ibndias/qwen3-0.6b-reasoning-safeguard
1B • Updated • 1
ibndias/Anonymizer-0.6B-Q4_K_M-GGUF
0.6B • Updated • 2
ibndias/kanana-safeguard-8b-Q2_K-GGUF
Text Generation
• 8B • Updated • 4
ibndias/kanana-safeguard-8b-Q4_K_M-GGUF
Text Generation
• 8B • Updated • 1
ibndias/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
• 2B • Updated • 1
ibndias/gemma-3-1b-reasoning-grpo
Text Generation
• 1.0B • Updated • 3
ibndias/DeepSeek-R1-Distill-Qwen-1.5B-GRPO
Text Generation
• 2B • Updated ibndias/Qwen-2.5-7B-Simple-RL
Updated
ibndias/Qwen2.5-1.5B-Open-R1-GRPO
Text Generation
• 2B • Updated ibndias/Qwen2.5-1.5B-Open-R1-GRPO1st
Text Generation
• 2B • Updated • 2
Reinforcement Learning
• Updated ibndias/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
• Updated ibndias/ppo-LunarLander-v2
Reinforcement Learning
• Updated • 2
ibndias/Nous-Hermes-2-MoE-2x34B
Text Generation
• 61B • Updated • 727
ibndias/NeuralHermes-MoE-2x7B
Text Generation
• 13B • Updated • 786
• 1
ibndias/mistral-7b-gtfobins-lora
Text Generation
• 7B • Updated • 6
ibndias/llama2-gtfobins-lora-3ep
ibndias/mistral-gtfobins-lora-3ep
ibndias/llama2-lora-gtfobins-1ep