AI & ML interests

None yet

Recent Activity

liked a model 1 day ago

MiniT2I/MiniT2I

liked a model 1 day ago

BiliSakura/JiT-diffusers

upvoted a paper 1 day ago

RepFusion: Leveraging Multimodal Priors for Denoising in Representation Space

View all activity

Organizations

liked 2 models 1 day ago

MiniT2I/MiniT2I

Text-to-Image • Updated 3 days ago • 45 • 4

BiliSakura/JiT-diffusers

Unconditional Image Generation • Updated 21 days ago • 1

upvoted a paper 1 day ago

RepFusion: Leveraging Multimodal Priors for Denoising in Representation Space

Paper • 2606.14700 • Published 8 days ago • 14

liked a model 21 days ago

nyu-visionx/RAEv2-models

Updated May 18 • 3

upvoted 2 papers about 1 month ago

Qwen-Image-2.0 Technical Report

Paper • 2605.10730 • Published May 11 • 114

HumanNet: Scaling Human-centric Video Learning to One Million Hours

Paper • 2605.06747 • Published May 7 • 52

upvoted a paper about 2 months ago

Efficient Training on Multiple Consumer GPUs with RoundPipe

Paper • 2604.27085 • Published Apr 29 • 47

liked a Space about 2 months ago

The Smol Training Playbook

📚

3.21k

The secrets to building world-class LLMs

liked a model about 2 months ago

deepseek-ai/DeepSeek-V4-Pro-Base

1.6T • Updated Apr 27 • 20.3k • 298

upvoted 2 papers 3 months ago

One Layer Is Enough: Adapting Pretrained Visual Encoders for Image Generation

Paper • 2512.07829 • Published Dec 8, 2025 • 25

SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features

Paper • 2502.14786 • Published Feb 20, 2025 • 166

upvoted an article 4 months ago

Article

Visualize and understand GPU memory in PyTorch

qgallouedec

•

Dec 24, 2024

• 273

upvoted a paper 5 months ago

MHLA: Restoring Expressivity of Linear Attention via Token-Level Multi-Head

Paper • 2601.07832 • Published Jan 12 • 53

upvoted 2 articles 6 months ago

Article

混合专家模型（MoE）详解

osanseviero, lewtun, philschmid, smangrul, ybelkada, pcuenq

•

Dec 11, 2023

• 86

Article

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

ariG23498, merve, pcuenq, reach-vb

•

Mar 12, 2025

• 497

liked a model 7 months ago

moonshotai/Kimi-K2-Thinking

Text Generation • 1.1T • Updated Jan 30 • 159k • • 1.7k

GarvinRay

AI & ML interests

Recent Activity

Organizations

GarvinRay's activity

The Smol Training Playbook

Visualize and understand GPU memory in PyTorch

混合专家模型（MoE）详解

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM