1 resultado para tabu search algorithm
em Universitat de Girona, Spain
Filtro por publicador
- AMS Tesi di Dottorato - Alm@DL - Università di Bologna (6)
- AMS Tesi di Laurea - Alm@DL - Università di Bologna (4)
- ArchiMeD - Elektronische Publikationen der Universität Mainz - Alemanha (1)
- Aston University Research Archive (13)
- Biblioteca de Teses e Dissertações da USP (1)
- Biblioteca Digital da Produção Intelectual da Universidade de São Paulo (6)
- Biblioteca Digital da Produção Intelectual da Universidade de São Paulo (BDPI/USP) (68)
- Biblioteca Virtual del Sistema Sanitario Público de Andalucía (BV-SSPA), Junta de Andalucía. Consejería de Salud y Bienestar Social, Spain (2)
- Biodiversity Heritage Library, United States (4)
- BORIS: Bern Open Repository and Information System - Berna - Suiça (6)
- Brock University, Canada (23)
- Bucknell University Digital Commons - Pensilvania - USA (1)
- Bulgarian Digital Mathematics Library at IMI-BAS (13)
- CentAUR: Central Archive University of Reading - UK (26)
- CiencIPCA - Instituto Politécnico do Cávado e do Ave, Portugal (2)
- Cochin University of Science & Technology (CUSAT), India (1)
- Consorci de Serveis Universitaris de Catalunya (CSUC), Spain (103)
- Cor-Ciencia - Acuerdo de Bibliotecas Universitarias de Córdoba (ABUC), Argentina (4)
- Dalarna University College Electronic Archive (3)
- Department of Computer Science E-Repository - King's College London, Strand, London (2)
- Digital Commons - Michigan Tech (2)
- Digital Commons at Florida International University (6)
- DigitalCommons@The Texas Medical Center (2)
- DigitalCommons@University of Nebraska - Lincoln (5)
- Doria (National Library of Finland DSpace Services) - National Library of Finland, Finland (63)
- FUNDAJ - Fundação Joaquim Nabuco (5)
- Galway Mayo Institute of Technology, Ireland (2)
- Illinois Digital Environment for Access to Learning and Scholarship Repository (2)
- Institute of Public Health in Ireland, Ireland (1)
- Instituto Politécnico do Porto, Portugal (78)
- Iowa Publications Online (IPO) - State Library, State of Iowa (Iowa), United States (1)
- Lume - Repositório Digital da Universidade Federal do Rio Grande do Sul (1)
- Martin Luther Universitat Halle Wittenberg, Germany (6)
- Massachusetts Institute of Technology (3)
- National Center for Biotechnology Information - NCBI (2)
- Nottingham eTheses (20)
- Publishing Network for Geoscientific & Environmental Data (1)
- QUB Research Portal - Research Directory and Institutional Repository for Queen's University Belfast (1)
- RDBU - Repositório Digital da Biblioteca da Unisinos (12)
- ReCiL - Repositório Científico Lusófona - Grupo Lusófona, Portugal (1)
- Repositório Aberto da Universidade Aberta de Portugal (1)
- Repositório Científico do Instituto Politécnico de Lisboa - Portugal (26)
- Repositório da Produção Científica e Intelectual da Unicamp (4)
- Repositório Institucional UNESP - Universidade Estadual Paulista "Julio de Mesquita Filho" (80)
- RUN (Repositório da Universidade Nova de Lisboa) - FCT (Faculdade de Cienecias e Technologia), Universidade Nova de Lisboa (UNL), Portugal (25)
- Scielo Saúde Pública - SP (44)
- Scottish Institute for Research in Economics (SIRE) (SIRE), United Kingdom (7)
- Universidad de Alicante (4)
- Universidad Politécnica de Madrid (14)
- Universidade Complutense de Madrid (1)
- Universidade do Minho (64)
- Universidade dos Açores - Portugal (1)
- Universidade Federal do Rio Grande do Norte (UFRN) (7)
- Universitat de Girona, Spain (1)
- Universitätsbibliothek Kassel, Universität Kassel, Germany (1)
- Université de Lausanne, Switzerland (82)
- Université de Montréal (2)
- Université de Montréal, Canada (11)
- Université Laval Mémoires et thèses électroniques (1)
- University of Queensland eSpace - Australia (74)
- University of Southampton, United Kingdom (1)
Resumo:
This paper proposes a high-level reinforcement learning (RL) control system for solving the action selection problem of an autonomous robot. Although the dominant approach, when using RL, has been to apply value function based algorithms, the system here detailed is characterized by the use of direct policy search methods. Rather than approximating a value function, these methodologies approximate a policy using an independent function approximator with its own parameters, trying to maximize the future expected reward. The policy based algorithm presented in this paper is used for learning the internal state/action mapping of a behavior. In this preliminary work, we demonstrate its feasibility with simulated experiments using the underwater robot GARBI in a target reaching task