Microsoft Research Blog
Chargement...
Blog de recherche Microsoft
Finding the best learning targets automatically: Fully Parameterized Quantile Function for distributional RL
| Li Zhao
Reinforcement learning has achieved great success in game scenarios, with RL agents beating human competitors in such games as Go and poker. Distributional reinforcement learning, in particular, has proven to be an effective approach for training an agent to maximize…