In a Training Loop 🔄

Urro PRO

urroxyz

https://urro.xyz/

urroxyz

AI & ML interests

computational linguistics major 🤖🔎🔠 i am autistic. if i come off rude, i probably didn't mean to. please feel free to ask me for clarification.

Recent Activity

upvoted a paper 3 days ago

PlanningBench: Generating Scalable and Verifiable Planning Data for Evaluating and Training Large Language Models

upvoted a paper 3 days ago

Safety Alignment as Continual Learning: Mitigating the Alignment Tax via Orthogonal Gradient Projection

updated a collection 3 days ago

WTF GENIUS PAPERS

View all activity

Organizations

commented 2 papers 3 days ago

OScaR: The Occam's Razor for Extreme KV Cache Quantization in LLMs and Beyond

Paper • 2605.19660 • Published 5 days ago • 39 •

$δ$-mem: Efficient Online Memory for Large Language Models

Paper • 2605.12357 • Published 12 days ago • 120 •

commented a paper 4 days ago

Anti-Self-Distillation for Reasoning RL via Pointwise Mutual Information

Paper • 2605.11609 • Published 12 days ago • 189 •

commented a paper 8 days ago

Achieving Gold-Medal-Level Olympiad Reasoning via Simple and Unified Scaling

Paper • 2605.13301 • Published 11 days ago • 156 •

commented a paper 10 days ago

Orthrus: Memory-Efficient Parallel Token Generation via Dual-View Diffusion

Paper • 2605.12825 • Published 12 days ago • 12 •

commented 6 papers 11 days ago

Reliable Chain-of-Thought via Prefix Consistency

Paper • 2605.07654 • Published 16 days ago • 1 •

Multi-Stream LLMs: Unblocking Language Models with Parallel Streams of Thoughts, Inputs and Outputs

Paper • 2605.12460 • Published 12 days ago • 17 •

PASA: A Principled Embedding-Space Watermarking Approach for LLM-Generated Text under Semantic-Invariant Attacks

Paper • 2605.10977 • Published 15 days ago • 10 •

LoopUS: Recasting Pretrained LLMs into Looped Latent Refinement Models

Paper • 2605.11011 • Published 14 days ago • 9 •

Your Language Model is Its Own Critic: Reinforcement Learning with Value Estimation from Actor's Internal States

Paper • 2605.07579 • Published 16 days ago • 16 •

$δ$-mem: Efficient Online Memory for Large Language Models

Paper • 2605.12357 • Published 12 days ago • 120 •

New activity in blog-explorers/README 12 days ago

[Support] Community Articles

🚀🤝 2

106

#5 opened about 2 years ago by

victor

commented 2 papers 13 days ago

SpecBlock: Block-Iterative Speculative Decoding with Dynamic Tree Drafting

Paper • 2605.07243 • Published 16 days ago • 4 •

What if AI systems weren't chatbots?

Paper • 2605.07896 • Published 16 days ago • 8 •

New activity in OwnedByDanes/Usenet-Corpus-1980-2013 19 days ago

Misleading README

#4 opened 19 days ago by

urroxyz

commented a paper 23 days ago

Large Language Models Explore by Latent Distilling

Paper • 2604.24927 • Published 27 days ago • 74 •

commented a paper 29 days ago

Sapiens2

Paper • 2604.21681 • Published Apr 23 • 19 •

commented 3 papers about 1 month ago