Optimal strategies and minimax lower bounds for online convex games


Autoria(s): Abernethy, Jacob D.; Bartlett, Peter L.; Rakhlin, Alexander; Tewari, Ambuj
Data(s)

22/02/2008

Resumo

A number of learning problems can be cast as an Online Convex Game: on each round, a learner makes a prediction x from a convex set, the environment plays a loss function f, and the learner’s long-term goal is to minimize regret. Algorithms have been proposed by Zinkevich, when f is assumed to be convex, and Hazan et al., when f is assumed to be strongly convex, that have provably low regret. We consider these two settings and analyze such games from a minimax perspective, proving minimax strategies and lower bounds in each case. These results prove that the existing algorithms are essentially optimal.

Identificador

http://eprints.qut.edu.au/44006/

Publicador

University of California, Berkeley

Relação

http://www.eecs.berkeley.edu/Pubs/TechRpts/2008/EECS-2008-19.pdf

Abernethy, Jacob D., Bartlett, Peter L., Rakhlin, Alexander, & Tewari, Ambuj (2008) Optimal strategies and minimax lower bounds for online convex games. Technical Report, UCB/EECS-2008-19. University of California, Berkeley, Berkeley, California (USA).

Direitos

Copyright © 2008, by the author(s). All rights reserved.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. To copy otherwise, to republish, to post on servers or to redistribute to lists, requires prior specific permission.

Fonte

Faculty of Science and Technology; Mathematical Sciences

Palavras-Chave #080600 INFORMATION SYSTEMS #OAVJ
Tipo

Report