A reinforcement learning based algorithm for finite horizon Markov decision processes


Autoria(s): Bhatnagar, Shalabh; Abdulla, Mohammed Shahid
Data(s)

2006

Resumo

We develop a simulation based algorithm for finite horizon Markov decision processes with finite state and finite action space. Illustrative numerical experiments with the proposed algorithm are shown for problems in flow control of communication networks and capacity switching in semiconductor fabrication.

Formato

application/pdf

Identificador

http://eprints.iisc.ernet.in/30451/1/04177082.pdf

Bhatnagar, Shalabh and Abdulla, Mohammed Shahid (2006) A reinforcement learning based algorithm for finite horizon Markov decision processes. In: 45th IEEE Conference on Decision and Control,, Dec 13-15, 2006, San Diego, CA, pp. 5519-5524.

Publicador

Institute of Electrical and Electronics Engineers

Relação

http://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=4177082

http://eprints.iisc.ernet.in/30451/

Palavras-Chave #Computer Science & Automation (Formerly, School of Automation)
Tipo

Conference Paper

PeerReviewed