8 21

Yeha Kim PRO

yeha

yeha-777

AI & ML interests

Causality, Trustworthy AI, MAS, Multi-Modal Generative Models, etc.

Recent Activity

upvoted an article 10 days ago

Welcome Gemma 4: Frontier multimodal intelligence on device

upvoted a collection 10 days ago

Cosmos-Predict2.5

liked a model 10 days ago

google/gemma-4-31B-it

View all activity

Organizations

upvoted an article 10 days ago

Article

Welcome Gemma 4: Frontier multimodal intelligence on device

16 days ago

•

854

upvoted a collection 10 days ago

Cosmos-Predict2.5

Collection

Improved World Simulation with Video Foundation Models for Physical AI • 2 items • Updated 2 days ago • 20

liked a model 10 days ago

google/gemma-4-31B-it

Image-Text-to-Text • 33B • Updated 7 days ago • 3.51M • • 2k

upvoted a paper 10 days ago

Lyra: Generative 3D Scene Reconstruction via Video Diffusion Model Self-Distillation

Paper • 2509.19296 • Published Sep 23, 2025 • 27

liked a model 29 days ago

robbyant/lingbot-world-base-act-preview

Image-to-Video • Updated Mar 5 • 16

liked a dataset about 1 month ago

ropedia-ai/xperience-10m

Updated 28 days ago • 2.32M • 169

upvoted a paper about 1 month ago

Helios: Real Real-Time Long Video Generation Model

Paper • 2603.04379 • Published Mar 4 • 186

liked 2 models about 2 months ago

zhuhz22/Causal-Forcing

Text-to-Video • Updated Feb 7 • 7

nvidia/DreamDojo

Updated Feb 23 • 42 • 32

liked 2 models 2 months ago

Qwen/Qwen3.5-397B-A17B

Image-Text-to-Text • 403B • Updated Mar 15 • 739k • • 1.46k

robbyant/lingbot-world-base-cam

Image-to-Video • Updated Feb 2 • 330

upvoted a collection 2 months ago

LingBot-World

Collection

3 items • Updated 15 days ago • 37

liked a dataset 2 months ago

phyworldbench/phyworldbench

Viewer • Updated May 16, 2025 • 350 • 11 • 4

liked a Space 3 months ago

Qwen3-TTS Demo

🎙

1.88k

Generate speech audio from text with custom or cloned voices

liked a model 4 months ago

Qwen/Qwen2.5-VL-32B-Instruct

Image-Text-to-Text • 33B • Updated Apr 14, 2025 • 67.6k • 480

upvoted a paper 5 months ago

Inferix: A Block-Diffusion based Next-Generation Inference Engine for World Simulation

Paper • 2511.20714 • Published Nov 25, 2025 • 50

liked a Space 5 months ago

FLUX.2 [dev]

💻

833

Generate or edit images from text prompts with optional pictures

liked 2 models 5 months ago

MachineDelusions/Qwen-Edit-Loras

Updated Nov 22, 2025 • 36

facebook/sam3

Mask Generation • 0.9B • Updated Nov 20, 2025 • 2.09M • 1.89k

upvoted a collection 5 months ago

PS3: Scaling Vision Pre-Training to 4K Resolution

Collection

Enabling 4k resolution for VLMs, CVPR 2025, https://nvlabs.github.io/PS3/ • 15 items • Updated 2 days ago • 9

Yeha Kim PRO

AI & ML interests

Recent Activity

Organizations

yeha's activity

Welcome Gemma 4: Frontier multimodal intelligence on device

Qwen3-TTS Demo

FLUX.2 [dev]