Stochastic Bandits: Foundations and Current Perspectives

Опубликовано: 01 Январь 1970
на канале: Simons Institute

4,521

Shipra Agrawal (Columbia University)
https://simons.berkeley.edu/talks/sto...
Data-Driven Decision Processes Boot Camp

This talk will focus on the main algorithms for stochastic bandits, a fundamental model for sequential learning that assumes that rewards of different actions come identically and independently from fixed distributions. We will cover the main algorithms for stochastic bandits (Upper Confidence Bound and Thompson Sampling) and subsequently discuss how they can be adapted to incorporate various additional constraints.

Смотрите видео Stochastic Bandits: Foundations and Current Perspectives онлайн без регистрации, длительностью часов минут секунд в хорошем качестве. Это видео добавил пользователь Simons Institute 01 Январь 1970, не забудьте поделиться им ссылкой с друзьями и знакомыми, на нашем сайте его посмотрели 4,521 раз и оно понравилось 40 людям.

141