A Simultaneous Deterministic Perturbation Actor-Critic Algorithm with an Application to Optimal Mortgage Refinancing
Data(s) |
07/05/2006
|
---|---|
Resumo |
We develop a simulation-based, two-timescale actor-critic algorithm for infinite horizon Markov decision processes with finite state and action spaces, with a discounted reward criterion. The algorithm is of the gradient ascent type and performs a search in the space of stationary randomized policies. The algorithm uses certain simultaneous deterministic perturbation stochastic approximation (SDPSA) gradient estimates for enhanced performance. We show an application of our algorithm on a problem of mortgage refinancing. Our algorithm obtains the optimal refinancing strategies in a computationally efficient manner |
Formato |
application/pdf |
Identificador |
http://eprints.iisc.ernet.in/41969/1/A_Simultaneous.pdf Raju Chinthalapati, VL and Bhatnagar, S (2006) A Simultaneous Deterministic Perturbation Actor-Critic Algorithm with an Application to Optimal Mortgage Refinancing. In: Proceedings of the 45th IEEE Conference on Decision & Control Manchester Grand Hyatt Hotel, 13-15 Dec. 2006, San Diego, CA . |
Publicador |
IEEE |
Relação |
http://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=4177608&tag=1 http://eprints.iisc.ernet.in/41969/ |
Palavras-Chave | #Computer Science & Automation (Formerly, School of Automation) |
Tipo |
Conference Paper PeerReviewed |