{"id":1160129,"date":"2026-01-20T09:00:00","date_gmt":"2026-01-20T17:00:00","guid":{"rendered":"https:\/\/www.microsoft.com\/en-us\/research\/?p=1160129"},"modified":"2026-03-18T18:07:06","modified_gmt":"2026-03-19T01:07:06","slug":"multimodal-reinforcement-learning-with-agentic-verifier-for-ai-agents","status":"publish","type":"post","link":"https:\/\/www.microsoft.com\/en-us\/research\/blog\/multimodal-reinforcement-learning-with-agentic-verifier-for-ai-agents\/","title":{"rendered":"Argos: Multimodal reinforcement learning with agentic verifier for AI agents"},"content":{"rendered":"\n
\"Diagram<\/figure>\n\n\n\n
\n\t\n\t
\n\t\t
\n\t\t\t
\n
\n

At a glance<\/h2>\n\n\n\n