Tea Time Talks 2024: Alireza Kazemipour, Optimism and Mon-MDPs

Published: 27 September 2024
on channel: Amii

Tea Time Talks are back for another year. This summer lecture series, presented by Amii and the RLAI Lab at the University of Alberta, give researchers the chance to discuss early-stage ideas and prospective research. Join us for another series of informal 20-minute talks where AI leaders discuss the future of machine learning research.

Abstract:
Arguably, the principle of Optimism in the Face of Uncertainty (OFU) was the primary driving force behind the development of many sample efficient exploration algorithms for reinforcement learning throughout the whole 2000s. However, works such as The End of Optimism in 2016 showed inefficiencies of optimism-based exploration methods obscured in their worst-case regret guarantees, and pointed toward more theoretically sound exploration principles such as Information-Directed Sampling as alternatives.

Nevertheless, in this talk I intend to present that under Mon-MDP formulation, OFU is not effective let alone efficient and I will try to spend time explaining what challenge OFU faces once applied to Mon-MDPs.

Watch video Tea Time Talks 2024: Alireza Kazemipour, Optimism and Mon-MDPs online without registration, duration hours minute second in high quality. This video was added by user Amii 27 September 2024, don't forget to share it with your friends and acquaintances, it has been viewed on our site 35 once and liked it 0 people.

00:00:00