{"id":584854,"date":"2019-05-08T12:44:12","date_gmt":"2019-05-08T19:44:12","guid":{"rendered":"https:\/\/www.microsoft.com\/en-us\/research\/?p=584854"},"modified":"2019-06-17T11:23:07","modified_gmt":"2019-06-17T18:23:07","slug":"reinforcement-learning-for-the-real-world-with-dr-john-langford-and-rafah-hosn","status":"publish","type":"post","link":"https:\/\/www.microsoft.com\/en-us\/research\/podcast\/reinforcement-learning-for-the-real-world-with-dr-john-langford-and-rafah-hosn\/","title":{"rendered":"Reinforcement learning for the real world with Dr. John Langford and Rafah Hosn"},"content":{"rendered":"

\"Dr.<\/a><\/p>\n

Episode 75, May 8, 2019<\/h3>\n

Dr. John Langford<\/a>, a partner researcher in the Machine Learning group at Microsoft Research New York City, is a reinforcement learning expert who is working, in his own words, to solve machine learning. Rafah Hosn<\/a>, also of MSR New York, is a principal program manager who\u2019s working to take that work to the world. If that sounds like big thinking in the Big Apple, well, New York City has always been a \u201cgo big, or go home\u201d kind of town, and MSR NYC is a \u201cgo big, or go home\u201d kind of lab.<\/p>\n

Today, Dr. Langford explains why online reinforcement learning is critical to solving machine learning and how moving from the current foundation of a Markov decision process toward a contextual bandit<\/a> future might be part of the solution. Rafah Hosn talks about why it\u2019s important, from a business perspective, to move RL agents out of simulated environments and into the open world, and gives us an under-the-hood look at the product side of MSR\u2019s \u201cresearch, incubate, transfer\u201d process, focusing on real world reinforcement learning which, at Microsoft, is now called Azure Cognitive Services Personalizer<\/a>.<\/p>\n

Related:<\/h3>\n