A reinforcement learning based algorithm for finite horizon Markov decision processes
Data(s) |
2006
|
---|---|
Resumo |
We develop a simulation based algorithm for finite horizon Markov decision processes with finite state and finite action space. Illustrative numerical experiments with the proposed algorithm are shown for problems in flow control of communication networks and capacity switching in semiconductor fabrication. |
Formato |
application/pdf |
Identificador |
http://eprints.iisc.ernet.in/30451/1/04177082.pdf Bhatnagar, Shalabh and Abdulla, Mohammed Shahid (2006) A reinforcement learning based algorithm for finite horizon Markov decision processes. In: 45th IEEE Conference on Decision and Control,, Dec 13-15, 2006, San Diego, CA, pp. 5519-5524. |
Publicador |
Institute of Electrical and Electronics Engineers |
Relação |
http://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=4177082 http://eprints.iisc.ernet.in/30451/ |
Palavras-Chave | #Computer Science & Automation (Formerly, School of Automation) |
Tipo |
Conference Paper PeerReviewed |