-
Qwen/Qwen2.5-0.5B-Instruct
Text Generation • 0.5B • Updated • 6.74M • 478 -
Qwen/Qwen2.5-0.5B-Instruct-GPTQ-Int4
Text Generation • 0.5B • Updated • 1.04k • 9 -
Qwen/Qwen2.5-0.5B-Instruct-GPTQ-Int8
Text Generation • 0.5B • Updated • 672 • 10 -
Qwen/Qwen2.5-1.5B-Instruct
Text Generation • 2B • Updated • 8.42M • • 634
Sree Harsha Nelaturu
deepmage121
AI & ML interests
Data and Compute Efficient Deep Learning.
Recent Activity
updated
a dataset 7 days ago
deepmage121/drafter_split_training published
a dataset 7 days ago
deepmage121/drafter_split_training new activity
20 days ago
evaleval/alphaxiv_datastore:Add alphaXiv SOTA raw scrape (5,069 files, 237MB)