WTF GENIUS PAPERS Collection Papers that made me appreciate my major and my life a little more. obs=Observation, innov=Innovation. Most papers are abt improving tiny models. • 136 items • Updated about 11 hours ago • 21
Darwin Family: MRI-Trust-Weighted Evolutionary Merging for Training-Free Scaling of Language-Model Reasoning Paper • 2605.14386 • Published 3 days ago • 50
Long Context Pre-Training with Lighthouse Attention Paper • 2605.06554 • Published 10 days ago • 16
Achieving Gold-Medal-Level Olympiad Reasoning via Simple and Unified Scaling Paper • 2605.13301 • Published 4 days ago • 135 • 2
WTF GENIUS PAPERS Collection Papers that made me appreciate my major and my life a little more. obs=Observation, innov=Innovation. Most papers are abt improving tiny models. • 136 items • Updated about 11 hours ago • 21
Achieving Gold-Medal-Level Olympiad Reasoning via Simple and Unified Scaling Paper • 2605.13301 • Published 4 days ago • 135
view article Article Unlocking asynchronicity in continuous batching +1 ror, pcuenq, ariG23498 • 3 days ago • 36
WTF GENIUS PAPERS Collection Papers that made me appreciate my major and my life a little more. obs=Observation, innov=Innovation. Most papers are abt improving tiny models. • 136 items • Updated about 11 hours ago • 21
PreScam: A Benchmark for Predicting Scam Progression from Early Conversations Paper • 2605.12243 • Published 5 days ago • 1
LLM-based Detection of Manipulative Political Narratives Paper • 2605.14354 • Published 3 days ago • 2
PRISM: Prior Rectification and Uncertainty-Aware Structure Modeling for Diffusion-Based Text Image Super-Resolution Paper • 2605.13027 • Published 4 days ago • 4
WTF GENIUS PAPERS Collection Papers that made me appreciate my major and my life a little more. obs=Observation, innov=Innovation. Most papers are abt improving tiny models. • 136 items • Updated about 11 hours ago • 21
Adaptive Teacher Exposure for Self-Distillation in LLM Reasoning Paper • 2605.11458 • Published 5 days ago • 4
STALE: Can LLM Agents Know When Their Memories Are No Longer Valid? Paper • 2605.06527 • Published 10 days ago • 37
From Pixels to Concepts: Do Segmentation Models Understand What They Segment? Paper • 2605.09591 • Published 7 days ago • 2
The Extrapolation Cliff in On-Policy Distillation of Near-Deterministic Structured Outputs Paper • 2605.08737 • Published 8 days ago • 2
WTF GENIUS PAPERS Collection Papers that made me appreciate my major and my life a little more. obs=Observation, innov=Innovation. Most papers are abt improving tiny models. • 136 items • Updated about 11 hours ago • 21
FeatCal: Feature Calibration for Post-Merging Models Paper • 2605.13030 • Published 4 days ago • 6