Magma: A foundation model for multimodal AI Agents | Microsoft Research Forum
Jianwei Yang, Principal Researcher, Microsoft Research Redmond, introduces Magma, a new multimodal agentic foundation model designed for UI navigation in digital environments and robotics manipulation in physical settings. It covers two new techniques, Set-of-Mark and Trace-of-Mark, for action grounding and planning, and details the unified pretraining pipeline that learns agentic capabilities.
This session aired on February 25, 2025, at Microsoft Research Forum, Episode 5.
Register for the series: https://aka.ms/registerresearchforumYTe5 (opens in new tab)
Continue watching episode 5: https://aka.ms/researchforumYTe5 (opens in new tab)
Explore all previous episodes: https://aka.ms/researchforumYTplaylist (opens in new tab)
- Series:
- Microsoft Research Forum
- Date:
- Speakers:
- Jianwei Yang
- Affiliation:
- Microsoft Research Redmond
Series: Microsoft Research Forum
-
Using LLMs for safe low-level programming | Microsoft Research Forum
Speakers:- Aseem Rastogi,
- Pantazis Deligiannis
-
-
Belief state transformers | Microsoft Research Forum
Speakers:- John Langford
-
Magma: A foundation model for multimodal AI Agents | Microsoft Research Forum
Speakers:- Jianwei Yang
-
Chimera: Accurate synthesis prediction by ensembling models with... | Microsoft Research Forum
Speakers:- Marwin Segler
-
-
Keynote: Multimodal Generative AI for Precision Health | Microsoft Research Forum
Speakers:- Hoifung Poon
-
-
-
-
-
-
-
-
AutoGen Update: Complex Tasks and Agents
Speakers:- Adam Fourney
-
MatterGen: A Generative Model for Materials Design
Speakers:- Tian Xie
-
-
Insights into the Challenges and Opportunities of Large Multi-Modal Models for Blind and Low Vision Users: CLIP
Speakers:- Daniela Massiceti
-
Research Forum 3 | Panel: Generative AI for Global Impact: Challenges and Opportunities
Speakers:- Jacki O'Neill,
- Tanuja Ganu,
- Sunayana Sitaram
-
-
-
-
-
-
What's new in AutoGen?
Speakers:- Chi Wang