Shipra Agrawal (Columbia University)
https://simons.berkeley.edu/talks/sto...
Data-Driven Decision Processes Boot Camp
This talk will focus on the main algorithms for stochastic bandits, a fundamental model for sequential learning that assumes that rewards of different actions come identically and independently from fixed distributions. We will cover the main algorithms for stochastic bandits (Upper Confidence Bound and Thompson Sampling) and subsequently discuss how they can be adapted to incorporate various additional constraints.
Watch video Stochastic Bandits: Foundations and Current Perspectives online without registration, duration hours minute second in high quality. This video was added by user Simons Institute 01 January 1970, don't forget to share it with your friends and acquaintances, it has been viewed on our site 4,52 once and liked it 4 people.