Average control of Markov decision processes with Feller transition probabilities and general action spaces


Autoria(s): Costa, Oswaldo Luiz do Valle; Dufour, F.
Contribuinte(s)

UNIVERSIDADE DE SÃO PAULO

Data(s)

01/11/2013

01/11/2013

02/08/2013

Resumo

This paper studies the average control problem of discrete-time Markov Decision Processes (MDPs for short) with general state space, Feller transition probabilities, and possibly non-compact control constraint sets A(x). Two hypotheses are considered: either the cost function c is strictly unbounded or the multifunctions A(r)(x) = {a is an element of A(x) : c(x, a) <= r} are upper-semicontinuous and compact-valued for each real r. For these two cases we provide new results for the existence of a solution to the average-cost optimality equality and inequality using the vanishing discount approach. We also study the convergence of the policy iteration approach under these conditions. It should be pointed out that we do not make any assumptions regarding the convergence and the continuity of the limit function generated by the sequence of relative difference of the alpha-discounted value functions and the Poisson equations as often encountered in the literature. (C) 2012 Elsevier Inc. All rights reserved.

USP project MaCLinC

USP project MaCLinC

CNPq (Brazilian National Research Council) [301067/09-0]

Brazilian National Research Council (CNPq)

French National Agency of Research (ANR)

French National Agency of Research (ANR) [ANR-09-SEGI-004]

Identificador

JOURNAL OF MATHEMATICAL ANALYSIS AND APPLICATIONS, SAN DIEGO, v. 396, n. 1, supl. 1, Part 3, pp. 58-69, DEC 1, 2012

0022-247X

http://www.producao.usp.br/handle/BDPI/37182

10.1016/j.jmaa.2012.05.073

http://dx.doi.org/10.1016/j.jmaa.2012.05.073

Idioma(s)

eng

Publicador

ACADEMIC PRESS INC ELSEVIER SCIENCE

SAN DIEGO

Relação

JOURNAL OF MATHEMATICAL ANALYSIS AND APPLICATIONS

Direitos

restrictedAccess

Copyright ACADEMIC PRESS INC ELSEVIER SCIENCE

Palavras-Chave #MARKOV DECISION PROCESSES #AVERAGE COST #GENERAL BOREL SPACES #FELLER TRANSITION PROBABILITIES #NON-COMPACT ACTION SET #POLICY ITERATION #BOREL SPACES #UNBOUNDED COSTS #STATE-SPACE #OPTIMALITY EQUATION #POLICY ITERATION #CRITERIA #MATHEMATICS, APPLIED #MATHEMATICS
Tipo

article

original article

publishedVersion