{"id":1123770,"date":"2025-02-25T11:31:20","date_gmt":"2025-02-25T19:31:20","guid":{"rendered":"https:\/\/www.microsoft.com\/en-us\/research\/?post_type=msr-blog-post&p=1123770"},"modified":"2025-02-25T15:06:54","modified_gmt":"2025-02-25T23:06:54","slug":"magma-a-foundation-model-for-multimodal-ai-agents","status":"publish","type":"msr-blog-post","link":"https:\/\/www.microsoft.com\/en-us\/research\/articles\/magma-a-foundation-model-for-multimodal-ai-agents\/","title":{"rendered":"Magma: A foundation model for multimodal AI agents"},"content":{"rendered":"\n

Presented by Jianwei Yang<\/a> at Microsoft Research Forum, February 2025<\/strong><\/em><\/p>\n\n\n\n

\"Jianwei<\/figure>
\n
\n

\u201cIn this project we developed the first agentic foundation model, Magma, that can understand multimodal input and also take action in both digital and physical environments.”<\/p>\n\u2013<\/em> Jianwei Yang, Principal Researcher, Microsoft Research Redmond<\/cite><\/blockquote>\n<\/div><\/div>\n\n\n\n

\n