{"id":1011564,"date":"2024-03-08T16:53:36","date_gmt":"2024-03-09T00:53:36","guid":{"rendered":"https:\/\/www.microsoft.com\/en-us\/research\/?post_type=msr-blog-post&p=1011564"},"modified":"2024-03-11T06:12:35","modified_gmt":"2024-03-11T13:12:35","slug":"getting-modular-with-language-models-building-and-reusing-a-library-of-experts-for-task-generalization","status":"publish","type":"msr-blog-post","link":"https:\/\/www.microsoft.com\/en-us\/research\/articles\/getting-modular-with-language-models-building-and-reusing-a-library-of-experts-for-task-generalization\/","title":{"rendered":"Getting Modular with Language Models: Building and Reusing a Library of Experts for Task Generalization"},"content":{"rendered":"\n

Presented by Alessandro Sordoni<\/a> at Microsoft Research Forum, March 2024<\/strong><\/em><\/p>\n\n\n\n

\"Alessandro<\/figure>
\n
\n

\u201cWe have witnessed basically the wide adoption of large language models, such as GPT-4, that have very broad capabilities and can be used to solve a variety of tasks. But they are, kind of, expensive to serve, and somehow, we can actually think and ask ourselves, are they really necessary for most tasks that users \u2026 might need?\u201d<\/p>\n\u2013<\/em> Alessandro Sordoni, Principal Researcher, Microsoft Research Montreal<\/cite><\/blockquote>\n<\/div><\/div>\n\n\n\n

\n