JunHowie

62 7 18

AI & ML interests

None yet

Recent Activity

new activity about 11 hours ago

QuantTrio/GLM-5.2-Int4-Int8Mix:GLM-5.2-Int4-Int8Mix generates only "!!!!!" on H100 8×80GB with vLLM 0.23.0/0.24.0 — IndexShare sparse indexer K-cache never populated during prefill

liked a model 7 days ago

QuantTrio/GLM-5.2-Int4-Int8Mix

new activity 7 days ago

QuantTrio/GLM-5.2-Int4-Int8Mix:Any chances for A100?

View all activity

Organizations

New activity in QuantTrio/GLM-5.2-Int4-Int8Mix about 11 hours ago

GLM-5.2-Int4-Int8Mix generates only "!!!!!" on H100 8×80GB with vLLM 0.23.0/0.24.0 — IndexShare sparse indexer K-cache never populated during prefill

#3 opened about 14 hours ago by

kinggenguo

New activity in QuantTrio/GLM-5.2-Int4-Int8Mix 7 days ago

Any chances for A100?

#1 opened 7 days ago by

traphix

New activity in QuantTrio/GLM-5.1-AWQ 9 days ago

GLM-5.2-AWQ

#2 opened 14 days ago by

ag1988

New activity in QuantTrio/GLM-5-AWQ 2 months ago

[Request] Great work! Do you have plans to also create GLM-5.1-AWQ?

🤗 1

#6 opened 3 months ago by

ag1988

New activity in QuantTrio/Qwen3.6-35B-A3B-AWQ 2 months ago

Very good quality tested. On par with Qwen3.5-27b-awq. lot's of thank to QuantTrio

#2 opened 2 months ago by

New activity in QuantTrio/Qwopus3.5-27B-v3-AWQ-6Bit 2 months ago

Would be great to have 6bit AWQ with repaired tensors.

#1 opened 3 months ago by

slavap5

New activity in QuantTrio/Qwen3.5-27B-AWQ 2 months ago

Request for AWQ Quant of Qwen3.5-40B-Claude-4.6-Opus-Deckard-Heretic-Uncensored-Thinking-i1

#6 opened 3 months ago by

celikburak

New activity in QuantTrio/MiniMax-M2.5-AWQ 2 months ago

QuantTrio/MiniMax-M2.7-AWQ release?

👍 1

#3 opened 3 months ago by

sigbjobo

New activity in QuantTrio/Qwen3.5-27B-AWQ 2 months ago

This is the best quant version in the world,better than FP8

🚀 5

#2 opened 4 months ago by

New activity in QuantTrio/gemma-4-31B-it-AWQ-6Bit 2 months ago

Update chat_template.jinja according to upstream google official repository

#2 opened 3 months ago by

dayvidwelles

New activity in QuantTrio/GLM-5-AWQ 3 months ago

vllm部署失败

#3 opened 4 months ago by

Yuxin362

New activity in QuantTrio/sarvam-105b-AWQ 3 months ago

Do you take quant requests?

#1 opened 4 months ago by

pathosethoslogos

New activity in QuantTrio/Qwen3.5-9B-AWQ 3 months ago

why cuda12.8 needed?

#1 opened 3 months ago by

justplus

New activity in QuantTrio/Qwen3.5-27B-AWQ 4 months ago

--max-model-len 32768 seems a bit too small for agent use cases ?

#3 opened 4 months ago by

edwarddukewu

New activity in arcee-ai/Trinity-Large-Preview 4 months ago

AWQ

🤝 3

#3 opened 5 months ago by

darkstar3537

New activity in QuantTrio/GLM-5-AWQ 4 months ago

Great work

#1 opened 4 months ago by

JoeyHwong

New activity in QuantTrio/Qwen3.5-397B-A17B-AWQ 4 months ago

Qwen3.5-397B-A17B-AWQ vs Qwen3.5-122B-A10B

#2 opened 4 months ago by

zuuky

New activity in QuantTrio/Kimi-K2.5-E304 4 months ago

Kimi-K2.5-E192 ?

#2 opened 5 months ago by

Rebis

New activity in QuantTrio/MiniMax-M2.5-AWQ 4 months ago

Qwen3.5 AWQ 4 Bit

#1 opened 4 months ago by

yuchenxie

New activity in QuantTrio/Qwen3-VL-235B-A22B-Thinking-AWQ 4 months ago

Qwen3.5 AWQ

#3 opened 4 months ago by

timroethig

JunHowie

AI & ML interests

Recent Activity

Organizations

JunHowie's activity

GLM-5.2-Int4-Int8Mix generates only "!!!!!" on H100 8×80GB with vLLM 0.23.0/0.24.0 — IndexShare sparse indexer K-cache never populated during prefill

Any chances for A100?

GLM-5.2-AWQ

[Request] Great work! Do you have plans to also create GLM-5.1-AWQ?

Very good quality tested. On par with Qwen3.5-27b-awq. lot's of thank to QuantTrio

Would be great to have 6bit AWQ with repaired tensors.

Request for AWQ Quant of Qwen3.5-40B-Claude-4.6-Opus-Deckard-Heretic-Uncensored-Thinking-i1

QuantTrio/MiniMax-M2.7-AWQ release?

This is the best quant version in the world,better than FP8

Update chat_template.jinja according to upstream google official repository

vllm部署失败

Do you take quant requests?

why cuda12.8 needed?

--max-model-len 32768 seems a bit too small for agent use cases ?

AWQ

Great work

Qwen3.5-397B-A17B-AWQ vs Qwen3.5-122B-A10B

Kimi-K2.5-E192 ?

Qwen3.5 AWQ 4 Bit

Qwen3.5 AWQ