Submitted by taesiri 15 WebGym: Scaling Training Environments for Visual Web Agents with Realistic Tasks Microsoft 2
Submitted by Jue Zhang 27 DoVer: Intervention-Driven Auto Debugging for LLM Multi-Agent Systems Microsoft 4
Submitted by Xiao Liang 2 Gold-Medal-Level Olympiad Geometry Solving with Efficient Heuristic Auxiliary Constructions Microsoft 13 2
Submitted by Chaoyun Zhang 14 GUI-360: A Comprehensive Dataset and Benchmark for Computer-Using Agents Microsoft 2
Submitted by Huanyu_Zhang 21 Latent Sketchpad: Sketching Visual Thoughts to Elicit Multimodal Reasoning in MLLMs Microsoft 62 1
Submitted by Jiayu Ding 1 Information-Preserving Reformulation of Reasoning Traces for Antidistillation Microsoft 2
Submitted by taesiri 12 SwiReasoning: Switch-Thinking in Latent and Explicit for Pareto-Superior Reasoning LLMs Microsoft 39 2
Submitted by Minki Kang 32 ACON: Optimizing Context Compression for Long-horizon LLM Agents Microsoft 48 2
Submitted by Pranjal A. Chitale 4 The role of synthetic data in Multilingual, Multi-cultural AI systems: Lessons from Indic Languages Microsoft 2
Submitted by Ruiyu Wang 3 CAD-Tokenizer: Towards Text-based CAD Prototyping via Modality-Specific Tokenization Microsoft 2
Submitted by Xiao Liu 8 Behind RoPE: How Does Causal Mask Encode Positional Information? Microsoft 4 2
Submitted by Eric Lan 4 Contextual Integrity in LLMs via Reasoning and Reinforcement Learning Microsoft 6 1
1 TwT: Thinking without Tokens by Habitual Reasoning Distillation with Multi-Teachers' Guidance Microsoft
1 PromptIntern: Saving Inference Costs by Internalizing Recurrent Prompt during Large Language Model Fine-tuning Microsoft 5
Submitted by AK 259 Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone Microsoft 42
Submitted by AK 32 WizardCoder: Empowering Code Large Language Models with Evol-Instruct Microsoft 9.48k 2
2 Table Meets LLM: Can Large Language Models Understand Structured Table Data? A Benchmark and Empirical Study Microsoft 7