Adaptive online gradient descent


Autoria(s): Bartlett, Peter L.; Hazan, Elad; Rakhlin, Alexander
Contribuinte(s)

Platt, J.C.

Koller, D.

Singer, Y.

Roweis, S.T.

Data(s)

2007

Resumo

We study the rates of growth of the regret in online convex optimization. First, we show that a simple extension of the algorithm of Hazan et al eliminates the need for a priori knowledge of the lower bound on the second derivatives of the observed functions. We then provide an algorithm, Adaptive Online Gradient Descent, which interpolates between the results of Zinkevich for linear functions and of Hazan et al for strongly convex functions, achieving intermediate rates between [square root T] and [log T]. Furthermore, we show strong optimality of the algorithm. Finally, we provide an extension of our results to general norms.

Formato

application/pdf

Identificador

http://eprints.qut.edu.au/81815/

Publicador

MIT Press

Relação

http://eprints.qut.edu.au/81815/1/__staffhome.qut.edu.au_staffgrouph%24_harbison_Desktop_3319-adaptive-online-gradient-descent.pdf

http://papers.nips.cc/paper/3319-adaptive-online-gradient-descent.pdf

Bartlett, Peter L., Hazan, Elad, & Rakhlin, Alexander (2007) Adaptive online gradient descent. In Platt, J.C., Koller, D., Singer, Y., & Roweis, S.T. (Eds.) Advances in Neural Information Processing Systems 20, MIT Press, Canada, pp. 65-72.

Direitos

Copyright © 2007, by the author(s). All rights reserved.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. To copy otherwise, to republish, to post on servers or to redistribute to lists, requires prior specific permission.

Fonte

Faculty of Science and Technology; Mathematical Sciences

Palavras-Chave #080100 ARTIFICIAL INTELLIGENCE AND IMAGE PROCESSING #OAVJ
Tipo

Conference Paper