Can you also make zai-org/GLM-4.7-Flash same as you made this model?
🚀
1
#6 opened about 11 hours ago
by
dibu28
is it supposed to run this slow on a broadwell xeon?
1
#4 opened 9 days ago
by
PlatonicSkeptic
I use this quantized model for RAG, if context is long, always get empty output
7
#3 opened 10 days ago
by
gemlincong
Thinking, coder models
3
#1 opened 11 days ago
by
NIK2703