Microsoft

company

Verified

https://www.microsoft.com/en-us/research/

AI & ML interests

None defined yet.

Recent Activity

shaipeerms new activity about 16 hours ago

microsoft/NOTSOFAR:Update Data License section to CC BY 4.0

igorab-msft new activity 3 days ago

microsoft/NOTSOFAR:Update Data License section to CC BY 4.0

Tej-a55 submitted a paper 4 days ago

A Benchmark and Framework for Evaluating Next Action Predictions in Spreadsheets

View all activity

Papers

FastContext: Training Efficient Repository Explorer for Coding Agents

A Benchmark and Framework for Evaluating Next Action Predictions in Spreadsheets

View all Papers

Articles

Differential Transformer V2

Introducing OptiMind, a research model designed for optimization

microsoft 's papers 54

Submitted by

Shaoqiu Zhang

FastContext: Training Efficient Repository Explorer for Coding Agents

microsoft

Submitted by

Tejas Agrawal

A Benchmark and Framework for Evaluating Next Action Predictions in Spreadsheets

microsoft

Submitted by

Wanli Li

WeaveBench: A Long-Horizon, Real-World Benchmark for Computer-Use Agents with Hybrid Interfaces

microsoft

Submitted by

HAO BAI

AsyncWebRL: Efficient Multi-Step RL for Visual Web Agents

microsoft

Submitted by

Rui Yang

OpenWebRL: Demystifying Online Multi-turn Reinforcement Learning for Visual Web Agents

microsoft

Beyond Static Dialogues: Benchmarking Realistic, Heterogeneous, and Evolving Long-Term Memory

microsoft

Submitted by

Jinjing Zhao

Lens: Rethinking Training Efficiency for Foundational Text-to-Image Models

microsoft

Submitted by

Miaosen Zhang

Covering Human Action Space for Computer Use: Data Synthesis and Benchmark

microsoft

Submitted by

Baolin Peng

Synthetic Computers at Scale for Long-Horizon Productivity Simulation

microsoft

Submitted by

Sebastian Ehlert

Accurate and scalable exchange-correlation with deep learning

microsoft

Submitted by

tan

Contrastive Attribution in the Wild: An Interpretability Analysis of LLM Failures on Realistic Benchmarks

microsoft

Submitted by

Aditya Kanade

Faithful GRPO: Improving Visual Spatial Reasoning in Multimodal Language Models via Constrained Policy Optimization

microsoft

2

Submitted by

ZHOU

AVGen-Bench: A Task-Driven Benchmark for Multi-Granular Evaluation of Text-to-Audio-Video Generation

microsoft

Submitted by

Haozhe Qi

AdaptToken: Entropy-based Adaptive Token Selection for MLLM Long Video Understanding

microsoft

Submitted by

Kevin Qu

Loc3R-VLM: Language-based Localization and 3D Reasoning with Vision-Language Models

microsoft

Submitted by

Furu Wei

VIBEVOICE-ASR Technical Report

microsoft

Submitted by

taesiri

Phi-4-reasoning-vision-15B Technical Report

microsoft

Submitted by

JeonghyeKim

Exploratory Memory-Augmented LLM Agent via Hybrid On- and Off-Policy Optimization

microsoft

3

Submitted by

Taiwei Shi

Experiential Reinforcement Learning

microsoft

Submitted by

Sayan Deb Sarkar

CoPE-VideoLM: Codec Primitives For Efficient Video Language Models

microsoft

Submitted by

Zijie Chen

Improving Data and Reward Design for Scientific Reasoning in Large Language Models

microsoft

2

Submitted by

Mingqian Feng

SEMA: Simple yet Effective Learning for Multi-Turn Jailbreak Attacks

microsoft

Submitted by

Jialiang Zhu

RE-TRAC: REcursive TRAjectory Compression for Deep Search Agents

microsoft

Submitted by

Xiao Liu

MSign: An Optimizer Preventing Training Instability in Large Language Models via Stable Rank Restoration

microsoft

3

Submitted by

Minh-Quan Le

PISCES: Annotation-free Text-to-Video Post-Training via Optimal Transport-Aligned Rewards

microsoft

Submitted by

Mingqian Feng

Statistical Estimation of Adversarial Risk in Large Language Models under Best-of-N Sampling

microsoft

Submitted by

Tianyi Chen

CUA-Skill: Develop Skills for Computer Using Agent

microsoft

Submitted by

taesiri

WebGym: Scaling Training Environments for Visual Web Agents with Realistic Tasks

microsoft

Sigma-Moe-Tiny Technical Report

microsoft

Native and Compact Structured Latents for 3D Generation

microsoft

SIGMA: An AI-Empowered Training Stack on Early-Life Hardware

microsoft

Submitted by

Jue Zhang

DoVer: Intervention-Driven Auto Debugging for LLM Multi-Agent Systems

microsoft

Submitted by

Xiao Liang

Gold-Medal-Level Olympiad Geometry Solving with Efficient Heuristic Auxiliary Constructions

microsoft

Submitted by

taesiri

Fara-7B: An Efficient Agentic Model for Computer Use

microsoft

Submitted by

Chaoyun Zhang

UFO^3: Weaving the Digital Agent Galaxy

microsoft

Submitted by

Chaoyun Zhang

GUI-360: A Comprehensive Dataset and Benchmark for Computer-Using Agents

microsoft

Submitted by

Huanyu_Zhang

Latent Sketchpad: Sketching Visual Thoughts to Elicit Multimodal Reasoning in MLLMs

microsoft

Submitted by

Jannis Vamvas

QueST: Incentivizing LLMs to Generate Difficult Problems

microsoft

3

Submitted by

Shun Zheng

Deep Self-Evolving Reasoning

microsoft

2

Submitted by

Jiayu Ding

Information-Preserving Reformulation of Reasoning Traces for Antidistillation

microsoft

Submitted by

taesiri

SwiReasoning: Switch-Thinking in Latent and Explicit for Pareto-Superior Reasoning LLMs

microsoft

Submitted by

taesiri

Just Do It!? Computer-Use Agents Exhibit Blind Goal-Directedness

microsoft

3

Submitted by

Minki Kang

ACON: Optimizing Context Compression for Long-horizon LLM Agents

microsoft

Submitted by

Miaosen Zhang

InfoAgent: Advancing Autonomous Information-Seeking Agents

microsoft

2

Submitted by

Pranjal A. Chitale

The role of synthetic data in Multilingual, Multi-cultural AI systems: Lessons from Indic Languages

microsoft

2

Submitted by

Ruiyu Wang

CAD-Tokenizer: Towards Text-based CAD Prototyping via Modality-Specific Tokenization

microsoft

3

Submitted by

Xiao Liu

Behind RoPE: How Does Causal Mask Encode Positional Information?

microsoft

Accurate Chemistry Collection: Coupled cluster atomization energies for broad chemical space

microsoft

Submitted by

Eric Lan

Contextual Integrity in LLMs via Reasoning and Reinforcement Learning

microsoft

TwT: Thinking without Tokens by Habitual Reasoning Distillation with Multi-Teachers' Guidance

microsoft

PromptIntern: Saving Inference Costs by Internalizing Recurrent Prompt during Large Language Model Fine-tuning

microsoft

Submitted by

AK

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

microsoft

Submitted by

AK

WizardCoder: Empowering Code Large Language Models with Evol-Instruct

microsoft

Table Meets LLM: Can Large Language Models Understand Structured Table Data? A Benchmark and Empirical Study

microsoft