CS 269S, Winter 2019: Theory Seminar
Bren Hall, Room 1423, 1pm

Februaru 1, 2019:

Recharging Bandits

James Liu

We introduce a general model of bandit problems in which the expected payout of an arm is an increasing concave function of the time since it was last played. We first develop a PTAS for the underlying optimization problem of determining a reward-maximizing sequence of arm pulls. We then show how to use this PTAS in a learning setting to obtain sublinear regret.

(Based on a paper by Immorlica and Kleinberg at FOCS 2018) http://ieee-focs.org/FOCS-2018-Papers/pdfs/59f309.pdf

CS 269S, Winter 2019: Theory Seminar Bren Hall, Room 1423, 1pm

Februaru 1, 2019:

Recharging Bandits

James Liu

CS 269S, Winter 2019: Theory Seminar
Bren Hall, Room 1423, 1pm