Biblioteca Digital

**Autoria(s):** Raju Chinthalapati, VL; Bhatnagar, S
Data(s)	07/05/2006
Resumo	We develop a simulation-based, two-timescale actor-critic algorithm for infinite horizon Markov decision processes with finite state and action spaces, with a discounted reward criterion. The algorithm is of the gradient ascent type and performs a search in the space of stationary randomized policies. The algorithm uses certain simultaneous deterministic perturbation stochastic approximation (SDPSA) gradient estimates for enhanced performance. We show an application of our algorithm on a problem of mortgage refinancing. Our algorithm obtains the optimal refinancing strategies in a computationally efficient manner
Formato	application/pdf
Identificador	http://eprints.iisc.ernet.in/41969/1/A_Simultaneous.pdf Raju Chinthalapati, VL and Bhatnagar, S (2006) A Simultaneous Deterministic Perturbation Actor-Critic Algorithm with an Application to Optimal Mortgage Refinancing. In: Proceedings of the 45th IEEE Conference on Decision & Control Manchester Grand Hyatt Hotel, 13-15 Dec. 2006, San Diego, CA .
Publicador	IEEE
Relação	http://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=4177608&tag=1 http://eprints.iisc.ernet.in/41969/
Palavras-Chave	#Computer Science & Automation (Formerly, School of Automation)
Tipo	Conference Paper PeerReviewed

Acesso ao item digital