AI and Microsoft Research header - abstract neural network pattern on dark spectrum background

AI Frontiers

AI Frontiers blog

Articles

AutoGen v0.4: Reimagining the foundation of agentic AI for scale, extensibility, and robustness

February 25, 2025

Gagan Bansal introduces a transformative update to the AutoGen framework that builds on user feedback and redefines modularity, stability, and flexibility to empower the next generation of agentic AI research and applications.

Research Forum Episode 5 | John Langford

Articles

Belief state transformers

February 25, 2025

John Langford talks about a new transformer architecture that generates compact belief states for goal-conditioned planning, enhancing planning algorithms’ efficiency and effectiveness.

Articles

OmniParser V2: Turning Any LLM into a Computer Use Agent

February 12, 2025

Yadong Lu, Senior Researcher; Thomas Dhome-Casanova, Software Engineer; Jianwei Yang, Principal Researcher; Ahmed Awadallah, Partner Research Manager Graphic User interface (GUI) automation requires agents with the ability to understand and interact with user screens. However, using general purpose LLM models…

An abstract image of the Magentic-One multi-agent team shown as a hierarchy of agents with the top node showing a gear icon to represent the Orchestrator agent, and four leaf nodes with icons representing the Coder, Computer Terminal, Web Surfer, FIle Surfer agents, respectively. The image shows a title with the text

Articles

Magentic-One: A Generalist Multi-Agent System for Solving Complex Tasks

November 4, 2024

By Adam Fourney, Principal Researcher; Gagan Bansal, Senior Researcher; Hussein Mozannar, Senior Researcher; Victor Dibia, Principal Research Software Engineer; Saleema Amershi, Partner Research Manager Contributors: Adam Fourney, Gagan Bansal, Hussein Mozannar, Cheng Tan, Eduardo Salinas, Erkang (Eric) Zhu, Friederike Niedtner,…

Articles

OmniParser for pure vision-based GUI agent

October 8, 2024

By Yadong Lu, Senior Researcher; Jianwei Yang, Principal Researcher; Yelong Shen, Principal Research Manager; Ahmed Awadallah, Partner Research Manager Recent advancements in large vision-language models (VLMs), such as GPT-4V and GPT-4o, have demonstrated considerable promise in driving intelligent agent systems…

Research Forum | Episode 4 Talk 2 | Corby Rosset

Articles

Direct Nash Optimization: Teaching language models to self-improve with general preferences

September 3, 2024

This talk discusses teaching language models to self-improve using a preference oracle like GPT-4, framing it as a two-player game to find an optimal policy at a Nash equilibrium, and achieving state-of-the-art win rates against GPT-4 Turbo on benchmarks such…

Microsoft Research Forum | Episode 3 | Adam Fourney

Articles

AutoGen Update: Complex Tasks and Agents

June 4, 2024

Adam Fourney discusses the effectiveness of using multiple agents, working together, to complete complex multi-step tasks. He will showcase their capability to outperform previous single-agent solutions on benchmarks like GAIA, utilizing customizable arrangements of agents that collaborate, reason, and utilize…

Research Forum January 2024 - Besmira Nushi

Articles

Evaluation and Understanding of Foundation Models

January 30, 2024

Besmira Nushi summarizes timely challenges and ongoing work on evaluating and in-depth understanding of large foundation models as well as agent platforms built upon such models at the Microsoft Research Forum.

Research Forum January 2024 - Dipendra Misra

Articles

Improving Reasoning in Language Models with LASER: Layer-Selective Rank Reduction

January 30, 2024

Dipendra Misra, Senior Researcher at Microsoft Research New York City and AI Frontiers lightning talk presentation at the Microsoft Research Forum.