A Simultaneous Deterministic Perturbation Actor-Critic Algorithm with an Application to Optimal Mortgage Refinancing


Autoria(s): Raju Chinthalapati, VL; Bhatnagar, S
Data(s)

07/05/2006

Resumo

We develop a simulation-based, two-timescale actor-critic algorithm for infinite horizon Markov decision processes with finite state and action spaces, with a discounted reward criterion. The algorithm is of the gradient ascent type and performs a search in the space of stationary randomized policies. The algorithm uses certain simultaneous deterministic perturbation stochastic approximation (SDPSA) gradient estimates for enhanced performance. We show an application of our algorithm on a problem of mortgage refinancing. Our algorithm obtains the optimal refinancing strategies in a computationally efficient manner

Formato

application/pdf

Identificador

http://eprints.iisc.ernet.in/41969/1/A_Simultaneous.pdf

Raju Chinthalapati, VL and Bhatnagar, S (2006) A Simultaneous Deterministic Perturbation Actor-Critic Algorithm with an Application to Optimal Mortgage Refinancing. In: Proceedings of the 45th IEEE Conference on Decision & Control Manchester Grand Hyatt Hotel, 13-15 Dec. 2006, San Diego, CA .

Publicador

IEEE

Relação

http://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=4177608&tag=1

http://eprints.iisc.ernet.in/41969/

Palavras-Chave #Computer Science & Automation (Formerly, School of Automation)
Tipo

Conference Paper

PeerReviewed