Biblioteca Digital

The Policy Iteration Algorithm for Average Continuous Control of Piecewise Deterministic Markov Processes

**Autoria(s):** Costa, Oswaldo Luiz do Valle; DUFOUR, F.
Contribuinte(s)	UNIVERSIDADE DE SÃO PAULO
Data(s)	18/10/2012 18/10/2012 2010
Resumo	The main goal of this paper is to apply the so-called policy iteration algorithm (PIA) for the long run average continuous control problem of piecewise deterministic Markov processes (PDMP`s) taking values in a general Borel space and with compact action space depending on the state variable. In order to do that we first derive some important properties for a pseudo-Poisson equation associated to the problem. In the sequence it is shown that the convergence of the PIA to a solution satisfying the optimality equation holds under some classical hypotheses and that this optimal solution yields to an optimal control strategy for the average control problem for the continuous-time PDMP in a feedback form. CNPq (Brazilian National Research Council)[301067/09-0] French National Agency of Research (ANR)[ANR-09-SEGI-004]
Identificador	APPLIED MATHEMATICS AND OPTIMIZATION, v.62, n.2, p.185-204, 2010 0095-4616 http://producao.usp.br/handle/BDPI/18682 10.1007/s00245-010-9099-4 http://dx.doi.org/10.1007/s00245-010-9099-4
Idioma(s)	eng
Publicador	SPRINGER
Relação	Applied Mathematics and Optimization
Direitos	restrictedAccess Copyright SPRINGER
Palavras-Chave	#Piecewise-deterministic Markov Processes #Continuous-time #Long-run average cost #Optimal control #Integro-differential optimality inequation #Policy iteration algorithm #DECISION-PROCESSES #BOREL SPACES #OPTIMALITY #Mathematics, Applied
Tipo	article original article publishedVersion

Acesso ao item digital