Multilinear and Integer Programming for Markov Decision Processes with Imprecise Probabilities


Autoria(s): Shirota Filho, Ricardo; Cozman, Fabio Gagliardi; Trevizan, Felipe Werndl; de Campos, Cassio Polpo; de Barros, Leliane Nunes
Data(s)

2007

Resumo

Markov Decision Processes (MDPs) are extensively used to encode sequences of decisions with probabilistic effects. Markov Decision Processes with Imprecise Probabilities (MDPIPs) encode sequences of decisions whose effects are modeled using sets of probability distributions. In this paper we examine the computation of Γ-maximin policies for MDPIPs using multilinear and integer programming. We discuss the application of our algorithms to “factored” models and to a recent proposal, Markov Decision Processes with Set-valued Transitions (MDPSTs), that unifies the fields of probabilistic and “nondeterministic” planning in artificial intelligence research. 

Formato

application/pdf

Identificador

http://pure.qub.ac.uk/portal/en/publications/multilinear-and-integer-programming-for-markov-decision-processes-with-imprecise-probabilities(adba0bf3-e76e-4c15-92f7-cc002f9375a7).html

http://pure.qub.ac.uk/ws/files/29972728/decampos2007c.pdf

Idioma(s)

eng

Direitos

info:eu-repo/semantics/openAccess

Fonte

Shirota Filho , R , Cozman , F G , Trevizan , F W , de Campos , C P & de Barros , L N 2007 , Multilinear and Integer Programming for Markov Decision Processes with Imprecise Probabilities . in Proceedings of the 5th International Symposium on Imprecise Probability: Theories and Applications . pp. 395-404 , The 5th International Symposium on Imprecise Probability: Theories and Applications , Prague , Czech Republic , 16-19 July .

Tipo

contributionToPeriodical