dooho lee

BlueYellowGreen

https://leedooho.com

BlueYellowGreen

AI & ML interests

None yet

Recent Activity

upvoted an article about 18 hours ago

🚀 DTS: A Candidate for the Best Parallel Reasoning in LLMs

upvoted a paper about 18 hours ago

LLaDA2.1: Speeding Up Text Diffusion via Token Editing

upvoted a paper about 18 hours ago

OPUS: Towards Efficient and Principled Data Selection in Large Language Model Pre-training in Every Iteration

View all activity

Organizations

None yet

upvoted an article about 18 hours ago

Article

🚀 DTS: A Candidate for the Best Parallel Reasoning in LLMs

1 day ago

•

upvoted 4 papers about 18 hours ago

LLaDA2.1: Speeding Up Text Diffusion via Token Editing

Paper • 2602.08676 • Published 3 days ago • 58

OPUS: Towards Efficient and Principled Data Selection in Large Language Model Pre-training in Every Iteration

Paper • 2602.05400 • Published 8 days ago • 295

When to Memorize and When to Stop: Gated Recurrent Memory for Long-Context Reasoning

Paper • 2602.10560 • Published 1 day ago • 22

Step 3.5 Flash: Open Frontier-Level Intelligence with 11B Active Parameters

Paper • 2602.10604 • Published 1 day ago • 147

upvoted 5 articles 1 day ago

Article

Performant local mixture-of-experts CPU inference with GPU acceleration in llama.cpp

14 days ago

•

Article

Fine-Tuning FunctionGemma on TPU to Create a Virtual Fitness Coach in 10 Minutes, $0.50

11 days ago

•

Article

CRAFT: Continuous Reasoning and Agentic Feedback Tuning

8 days ago

•

Article

From Golden Gate Bridge to Broken JSON: Why Anthropic's SAE Steering Fails for Structured Output

5 days ago

•

Article

Training Qwen3 VL to label bbox : synthetic data, environment and training analysis

3 days ago

•

upvoted 4 papers 2 days ago

upvoted 4 papers 10 days ago

Shaping capabilities with token-level data filtering

Paper • 2601.21571 • Published 14 days ago • 26

ConceptMoE: Adaptive Token-to-Concept Compression for Implicit Compute Allocation

Paper • 2601.21420 • Published 15 days ago • 42

Idea2Story: An Automated Pipeline for Transforming Research Concepts into Complete Scientific Narratives

Paper • 2601.20833 • Published 15 days ago • 175

Golden Goose: A Simple Trick to Synthesize Unlimited RLVR Tasks from Unverifiable Internet Text

Paper • 2601.22975 • Published 13 days ago • 96

upvoted 2 papers 14 days ago

Qwen3-ASR Technical Report

Paper • 2601.21337 • Published 15 days ago • 34

Jet-RL: Enabling On-Policy FP8 Reinforcement Learning with Unified Training and Rollout Precision Flow

Paper • 2601.14243 • Published 23 days ago • 21

dooho lee

AI & ML interests

Recent Activity

Organizations

BlueYellowGreen's activity

🚀 DTS: A Candidate for the Best Parallel Reasoning in LLMs

Performant local mixture-of-experts CPU inference with GPU acceleration in llama.cpp

Fine-Tuning FunctionGemma on TPU to Create a Virtual Fitness Coach in 10 Minutes, $0.50

CRAFT: Continuous Reasoning and Agentic Feedback Tuning

From Golden Gate Bridge to Broken JSON: Why Anthropic's SAE Steering Fails for Structured Output

Training Qwen3 VL to label bbox : synthetic data, environment and training analysis