Biblioteca Digital

**Autoria(s):** BIANCHI, Reinaldo A. C.; RIBEIRO, Carlos H. C.; Costa, Anna Helena Reali
Contribuinte(s)	UNIVERSIDADE DE SÃO PAULO
Data(s)	18/10/2012 18/10/2012 2008
Resumo	This paper investigates how to make improved action selection for online policy learning in robotic scenarios using reinforcement learning (RL) algorithms. Since finding control policies using any RL algorithm can be very time consuming, we propose to combine RL algorithms with heuristic functions for selecting promising actions during the learning process. With this aim, we investigate the use of heuristics for increasing the rate of convergence of RL algorithms and contribute with a new learning algorithm, Heuristically Accelerated Q-learning (HAQL), which incorporates heuristics for action selection to the Q-Learning algorithm. Experimental results on robot navigation show that the use of even very simple heuristic functions results in significant performance enhancement of the learning rate.
Identificador	JOURNAL OF HEURISTICS, v.14, n.2, p.135-168, 2008 1381-1231 http://producao.usp.br/handle/BDPI/18147 10.1007/s10732-007-9031-5 http://dx.doi.org/10.1007/s10732-007-9031-5
Idioma(s)	eng
Publicador	SPRINGER
Relação	Journal of Heuristics
Direitos	restrictedAccess Copyright SPRINGER
Palavras-Chave	#reinforcement learning #heuristic function #robot navigation #action selection #MOBILE ROBOTS #REINFORCEMENT #LOCALIZATION #Computer Science, Artificial Intelligence #Computer Science, Theory & Methods
Tipo	article original article publishedVersion

Acesso ao item digital