Fast rates for the multi-armed bandit

Published on Oct 06, 20142229 Views

Sébastien Bubeck

Since the seminal work of Lai and Robbins (1985) we know bandit strategies with normalized regret of order (i) 1/sqrt(T) for any stochastic bandit, and (ii) log(T) / T for 'benign' distributions. In B

NIPS Workshops 2013 - Lake Tahoe

Related categories

Text Mining

Fast rates for the multi-armed bandit

Sébastien Bubeck

NIPS Workshops 2013 - Lake Tahoe

Related categories

VIDEOLECTURES

LEGAL