{"id":1171602,"date":"2026-05-14T10:05:55","date_gmt":"2026-05-14T17:05:55","guid":{"rendered":"https:\/\/www.microsoft.com\/en-us\/research\/?post_type=msr-video&p=1171602"},"modified":"2026-05-14T10:05:57","modified_gmt":"2026-05-14T17:05:57","slug":"new-fine-tuning-of-language-models-match-meaning-not-tokens","status":"publish","type":"msr-video","link":"https:\/\/www.microsoft.com\/en-us\/research\/video\/new-fine-tuning-of-language-models-match-meaning-not-tokens\/","title":{"rendered":"New fine-tuning of language models: Match meaning, not tokens"},"content":{"rendered":"\n

Language models are usually trained to predict the next word, but that does not always lead to the best overall answers. We introduce energy-based fine-tuning, a new method that trains models to produce better full responses, leading to stronger results without the need for complex reward models or verifiers.<\/p>\n\n\n\n

Explore more<\/h3>\n\n\n\n