Open problem : adversarial multiarmed bandits with limited advice


Autoria(s): Seldin, Yevgeny; Bartlett, Peter L.; Crammer, Koby
Data(s)

2013

Resumo

Adversarial multiarmed bandits with expert advice is one of the fundamental problems in studying the exploration-exploitation trade-o. It is known that if we observe the advice of all experts on every round we can achieve O(√KTlnN) regret, where K is the number of arms, T is the number of game rounds, and N is the number of experts. It is also known that if we observe the advice of just one expert on every round, we can achieve regret of order O(√NT). Our open problem is what can be achieved by asking M experts on every round, where 1 < M < N.

Formato

application/pdf

Identificador

http://eprints.qut.edu.au/70844/

Relação

http://eprints.qut.edu.au/70844/1/70844.pdf

http://jmlr.org/proceedings/papers/v30/Seldin13.pdf

Seldin, Yevgeny, Bartlett, Peter L., & Crammer, Koby (2013) Open problem : adversarial multiarmed bandits with limited advice. In Conference on Learning Theory (COLT 2013), June 12-14, 2013, Princeton, NJ.

http://purl.org/au-research/grants/ARC/FL110100281

Direitos

Copyright 2013 The Author(s)

Fonte

Science & Engineering Faculty

Tipo

Conference Paper