Inference Optimization
community
AI & ML interests
None defined yet.
Recent Activity
View all activity
models 102
inference-optimization/Qwen3-32B-Thinking-eagle3-ckpt5
2B • Updated
• 4
inference-optimization/sarvam-105b-FP8-Dynamic
Text Generation • 106B • Updated
• 44
inference-optimization/sarvam-30b-FP8-Dynamic
Text Generation • 32B • Updated
• 87 • 1
inference-optimization/sarvam-30b-NVFP4
Text Generation • 19B • Updated
• 49 • 1
inference-optimization/sarvam-105b-NVFP4
61B • Updated
• 26 • 1
inference-optimization/Qwen3.5-35B-A3B-FP8-Dynamic
35B • Updated
• 15
inference-optimization/Kimi-K2-Instruct-0905-BF16-FP8-BLOCK
Text Generation • 1T • Updated
• 25
inference-optimization/MiniMax-M2.5-BF16
Text Generation • 229B • Updated
• 98
inference-optimization/gpt-oss-20b-FP8-Dynamic
21B • Updated
• 11
inference-optimization/Qwen3-Coder-Next.w8a8
80B • Updated
• 28
datasets 0
None public yet