MSR theme: Reinforcement Learning Research

Return to Microsoft Research Lab – Montréal

Robust, adaptive, modular ML | Montréal

Projects

Theoretical foundations for Offline Reinforcement Learning

MSR contributions in the space of theoretical foundation for Offline RL Globally, MSR has made some recent advances in the space of the statistical foundations of Offline RL (opens in new tab), where a central question is to understand what…

Offline Reinforcement Learning Algorithms

In this page, we describe the algorithmic landscape of Offline RL and enumerate some algorithmic development efforts made by MSR in this space In a tutorial lecture (opens in new tab) on Offline RL (opens in new tab), we analyze its…

Offline Reinforcement Learning

This page introduces the research area of Offline Reinforcement Learning (also sometimes called Batch Reinforcement Learning). It consists in training a target policy from a fixed dataset of trajectories collected with a behavioral policy. In comparison to classic Reinforcement Learning…