Open problem : adversarial multiarmed bandits with limited advice
Data(s) |
2013
|
---|---|
Resumo |
Adversarial multiarmed bandits with expert advice is one of the fundamental problems in studying the exploration-exploitation trade-o. It is known that if we observe the advice of all experts on every round we can achieve O(√KTlnN) regret, where K is the number of arms, T is the number of game rounds, and N is the number of experts. It is also known that if we observe the advice of just one expert on every round, we can achieve regret of order O(√NT). Our open problem is what can be achieved by asking M experts on every round, where 1 < M < N. |
Formato |
application/pdf |
Identificador | |
Relação |
http://eprints.qut.edu.au/70844/1/70844.pdf http://jmlr.org/proceedings/papers/v30/Seldin13.pdf Seldin, Yevgeny, Bartlett, Peter L., & Crammer, Koby (2013) Open problem : adversarial multiarmed bandits with limited advice. In Conference on Learning Theory (COLT 2013), June 12-14, 2013, Princeton, NJ. http://purl.org/au-research/grants/ARC/FL110100281 |
Direitos |
Copyright 2013 The Author(s) |
Fonte |
Science & Engineering Faculty |
Tipo |
Conference Paper |