Restless bandits, partial conservation laws and indexability


Autoria(s): Niño-Mora, José
Contribuinte(s)

Universitat Pompeu Fabra. Departament d'Economia i Empresa

Data(s)

15/09/2005

Resumo

We show that if performance measures in a stochastic scheduling problem satisfy a set of so-called partial conservation laws (PCL), which extend previously studied generalized conservation laws (GCL), then the problem is solved optimally by a priority-index policy for an appropriate range of linear performance objectives, where the optimal indices are computed by a one-pass adaptive-greedy algorithm, based on Klimov's. We further apply this framework to investigate the indexability property of restless bandits introduced by Whittle, obtaining the following results: (1) we identify a class of restless bandits (PCL-indexable) which are indexable; membership in this class is tested through a single run of the adaptive-greedy algorithm, which also computes the Whittle indices when the test is positive; this provides a tractable sufficient condition for indexability; (2) we further indentify the class of GCL-indexable bandits, which includes classical bandits, having the property that they are indexable under any linear reward objective. The analysis is based on the so-called achievable region method, as the results follow fromnew linear programming formulations for the problems investigated.

Identificador

http://hdl.handle.net/10230/629

Idioma(s)

eng

Direitos

L'accés als continguts d'aquest document queda condicionat a l'acceptació de les condicions d'ús establertes per la següent llicència Creative Commons

info:eu-repo/semantics/openAccess

<a href="http://creativecommons.org/licenses/by-nc-nd/3.0/es/">http://creativecommons.org/licenses/by-nc-nd/3.0/es/</a>

Palavras-Chave #Operations Management #stochastic scheduling #markov decision chains #bandit problems #achievable region
Tipo

info:eu-repo/semantics/workingPaper