Accelerating autonomous learning by using heuristic selection of actions


Autoria(s): BIANCHI, Reinaldo A. C.; RIBEIRO, Carlos H. C.; Costa, Anna Helena Reali
Contribuinte(s)

UNIVERSIDADE DE SÃO PAULO

Data(s)

18/10/2012

18/10/2012

2008

Resumo

This paper investigates how to make improved action selection for online policy learning in robotic scenarios using reinforcement learning (RL) algorithms. Since finding control policies using any RL algorithm can be very time consuming, we propose to combine RL algorithms with heuristic functions for selecting promising actions during the learning process. With this aim, we investigate the use of heuristics for increasing the rate of convergence of RL algorithms and contribute with a new learning algorithm, Heuristically Accelerated Q-learning (HAQL), which incorporates heuristics for action selection to the Q-Learning algorithm. Experimental results on robot navigation show that the use of even very simple heuristic functions results in significant performance enhancement of the learning rate.

Identificador

JOURNAL OF HEURISTICS, v.14, n.2, p.135-168, 2008

1381-1231

http://producao.usp.br/handle/BDPI/18147

10.1007/s10732-007-9031-5

http://dx.doi.org/10.1007/s10732-007-9031-5

Idioma(s)

eng

Publicador

SPRINGER

Relação

Journal of Heuristics

Direitos

restrictedAccess

Copyright SPRINGER

Palavras-Chave #reinforcement learning #heuristic function #robot navigation #action selection #MOBILE ROBOTS #REINFORCEMENT #LOCALIZATION #Computer Science, Artificial Intelligence #Computer Science, Theory & Methods
Tipo

article

original article

publishedVersion