On the many dimensions of Dynamic Programming based Reinforcement Learning algorithms by Prof. Bruno Scherrer
Date
2020年1月21日 (火) 11:00 〜 12:00
Location
C016, Level C, Lab 1
Description
Abstract:
Starting from the standard Value and Policy Iteration, I shall describe many dimensions of Dynamic Programming algorithms for solving the Reinforcement Learning Problem. I will discuss their sensitivity to errors. I will also explain the connections to some of them to somewhat recent state-of-the-art algorithms.
Biography:
Bruno Scherrer has been a researcher at INRIA since 2004. He has contributed to the mathematical analysis of Dynamic Programming algorithms applied to Reinforcement Learning, in particular to approximation schemes.
All-OIST Category:
Intra-Group Category
Subscribe to the OIST Calendar: Right-click to download, then open in your calendar application.