AI & ML interests

None defined yet.

Recent Activity

AdinaY 
posted an update about 4 hours ago
AdinaY 
posted an update 1 day ago
view post
Post
1478
Z.ai just released a powerful lightweight option of GLM 4.7

✨ 30B total/3B active - MoE

zai-org/GLM-4.7-Flash
AdinaY 
posted an update 1 day ago
view post
Post
123
Another Chinese model fully trained on domestic chips, released by China Telecom 👀

Tele-AI/TeleChat3-36B-Thinking

TeleChat3-36B-Thinking:
✨ Native support for the Ascend + MindSpore ecosystem
✨ Inspired by DeepSeek’s architecture design, bringing training stability and efficiency gains.
  • 2 replies
·
AdinaY 
posted an update 4 days ago
view post
Post
979
After a VLM, StepFun dropped a new audio model: Step-Audio-R1.1, enabling thinking while speaking 🔥

stepfun-ai/Step-Audio-R1.1

✨ Apache 2.0
✨ Combines dual-brain architecture and acoustic-grounded reasoning to enable real-time dialogue with SOTA-level reasoning
  • 2 replies
·
AdinaY 
posted an update 5 days ago
view post
Post
1681
We have a new heatmap live on huggingface now🔥

woojun-jung/open-source-release-heatmap-ko

Korean community built their own version to track labs that actively publish open work, inspired by Chinese open source heat map!

This is the open source community at its best ♥️
  • 1 reply
·
AdinaY 
posted an update 6 days ago
view post
Post
657
More lightweight multimodal models are coming 👀

StepFun has been focused on multimodal AI from the very beginning. Their latest release a new foundational model: STEP3-VL🔥
https://huggingface.co/collections/stepfun-ai/step3-vl-10b
✨ 10B - Apache2.0
✨ Leads in the 10B class and competes with models 10–20× larger
AdinaY 
posted an update 6 days ago
view post
Post
309
Agentic capability is the new battleground🔥

LongCat-Flash-Thinking-2601, the latest reasoning model from Meituan- LongCat

✨ MoE - 560B total / 27B active
✨ MIT license
✨ Agentic tool use
✨ Multi-environment RL
✨ Parallel + iterative reasoning

meituan-longcat/LongCat-Flash-Thinking-2601
AdinaY 
posted an update 6 days ago
view post
Post
306
GLM-Image from Z.ai is out 🔥

It was fully trained on Ascend Atlas 800T A2 with MindSpore, probably the first SOTA multimodal model fully trained on domestic chips 👀

zai-org/GLM-Image

✨ Hybrid Architecture: combined autoregressive + diffusion design delivers strong semantic alignment with high-fidelity details
✨ Strong performance in long, dense, and multilingual text rendering
✨ MIT licensed (VQ tokenizer & ViT weights under Apache 2.0)
✨ Now live on Hugging Face inference provider 🤗