Biblioteca Digital

SINGULARLY PERTURBED DISCOUNTED MARKOV CONTROL PROCESSES IN A GENERAL STATE SPACE

**Autoria(s):** Costa, Oswaldo Luiz do Valle; Dufour, F.
Contribuinte(s)	UNIVERSIDADE DE SÃO PAULO
Data(s)	05/11/2013 05/11/2013 2012
Resumo	This paper studies the asymptotic optimality of discrete-time Markov decision processes (MDPs) with general state space and action space and having weak and strong interactions. By using a similar approach as developed by Liu, Zhang, and Yin [Appl. Math. Optim., 44 (2001), pp. 105-129], the idea in this paper is to consider an MDP with general state and action spaces and to reduce the dimension of the state space by considering an averaged model. This formulation is often described by introducing a small parameter epsilon > 0 in the definition of the transition kernel, leading to a singularly perturbed Markov model with two time scales. Our objective is twofold. First it is shown that the value function of the control problem for the perturbed system converges to the value function of a limit averaged control problem as epsilon goes to zero. In the second part of the paper, it is proved that a feedback control policy for the original control problem defined by using an optimal feedback policy for the limit problem is asymptotically optimal. Our work extends existing results of the literature in the following two directions: the underlying MDP is defined on general state and action spaces and we do not impose strong conditions on the recurrence structure of the MDP such as Doeblin's condition. USP USP Brazilian National Research Council (CNPq) CNPq (Brazilian National Research Council) [301067/09-0] ARPEGE of the French National Agency of Research (ANR) [ANR-09-SEGI-004] ARPEGE of the French National Agency of Research (ANR)
Identificador	SIAM JOURNAL ON CONTROL AND OPTIMIZATION, PHILADELPHIA, v. 50, n. 2, supl. 1, Part 3, pp. 720-747, APR, 2012 0363-0129 http://www.producao.usp.br/handle/BDPI/41676 10.1137/110827466 http://dx.doi.org/10.1137/110827466
Idioma(s)	eng
Publicador	SIAM PUBLICATIONS PHILADELPHIA
Relação	SIAM JOURNAL ON CONTROL AND OPTIMIZATION
Direitos	restrictedAccess Copyright SIAM PUBLICATIONS
Palavras-Chave	#MARKOV DECISION PROCESSES #OPTIMAL CONTROL #INFINITE DISCOUNTED EXPECTED COST #OPTIMAL CONTROL #SINGULAR PERTURBATION #DECISION-PROCESSES #DISCRETE-TIME #CHAINS #AUTOMATION & CONTROL SYSTEMS #MATHEMATICS, APPLIED
Tipo	article original article publishedVersion

Acesso ao item digital