Christina Lee Yu (Cornell University) & Sean Sinclair (Cornell)
https://simons.berkeley.edu/talks/onl...
Data-Driven Decision Processes Boot Camp
This tutorial will focus on the online learning perspective towards reinforcement learning when the model is unknown, and one incurs regret for actions selected during the learning process itself. Building on the preceding talks, as well as yesterday's tutorials on multi-arm bandits, we will focus on the challenges introduced in analysing regret under the Markovian dynamics. We will also discuss the interaction between learning and function approximation, the role of structure, and existing challenges and open problems.
Смотрите видео Online Reinforcement Learning and Regret онлайн без регистрации, длительностью часов минут секунд в хорошем качестве. Это видео добавил пользователь Simons Institute 01 Январь 1970, не забудьте поделиться им ссылкой с друзьями и знакомыми, на нашем сайте его посмотрели 5,065 раз и оно понравилось 49 людям.