16 resultados para MODULAR
Filtro por publicador
- ABACUS. Repositorio de Producción Científica - Universidad Europea (1)
- Acceda, el repositorio institucional de la Universidad de Las Palmas de Gran Canaria. España (3)
- AMS Tesi di Dottorato - Alm@DL - Università di Bologna (1)
- AMS Tesi di Laurea - Alm@DL - Università di Bologna (2)
- Andina Digital - Repositorio UASB-Digital - Universidade Andina Simón Bolívar (1)
- Aquatic Commons (7)
- ArchiMeD - Elektronische Publikationen der Universität Mainz - Alemanha (1)
- Archive of European Integration (1)
- Archivo Digital para la Docencia y la Investigación - Repositorio Institucional de la Universidad del País Vasco (16)
- Aston University Research Archive (16)
- Biblioteca Digital - Universidad Icesi - Colombia (1)
- Biblioteca Digital da Câmara dos Deputados (1)
- Biblioteca Digital da Produção Intelectual da Universidade de São Paulo (5)
- Biblioteca Digital da Produção Intelectual da Universidade de São Paulo (BDPI/USP) (17)
- Biblioteca Digital de Teses e Dissertações Eletrônicas da UERJ (23)
- BORIS: Bern Open Repository and Information System - Berna - Suiça (15)
- Boston University Digital Common (4)
- Brock University, Canada (1)
- Bucknell University Digital Commons - Pensilvania - USA (5)
- Bulgarian Digital Mathematics Library at IMI-BAS (2)
- CaltechTHESIS (12)
- Cámara de Comercio de Bogotá, Colombia (2)
- Cambridge University Engineering Department Publications Database (30)
- CentAUR: Central Archive University of Reading - UK (53)
- Chinese Academy of Sciences Institutional Repositories Grid Portal (33)
- CiencIPCA - Instituto Politécnico do Cávado e do Ave, Portugal (1)
- Cochin University of Science & Technology (CUSAT), India (2)
- Coffee Science - Universidade Federal de Lavras (1)
- Comissão Econômica para a América Latina e o Caribe (CEPAL) (10)
- CORA - Cork Open Research Archive - University College Cork - Ireland (13)
- Dalarna University College Electronic Archive (8)
- Digital Commons at Florida International University (2)
- Digital Peer Publishing (1)
- DigitalCommons@The Texas Medical Center (1)
- DigitalCommons@University of Nebraska - Lincoln (2)
- Doria (National Library of Finland DSpace Services) - National Library of Finland, Finland (1)
- Duke University (3)
- eResearch Archive - Queensland Department of Agriculture; Fisheries and Forestry (1)
- FAUBA DIGITAL: Repositorio institucional científico y académico de la Facultad de Agronomia de la Universidad de Buenos Aires (1)
- Funes: Repositorio digital de documentos en Educación Matemática - Colombia (3)
- Greenwich Academic Literature Archive - UK (3)
- Helda - Digital Repository of University of Helsinki (7)
- Illinois Digital Environment for Access to Learning and Scholarship Repository (1)
- Indian Institute of Science - Bangalore - Índia (70)
- Institutional Repository of Leibniz University Hannover (1)
- INSTITUTO DE PESQUISAS ENERGÉTICAS E NUCLEARES (IPEN) - Repositório Digital da Produção Técnico Científica - BibliotecaTerezine Arantes Ferra (1)
- Instituto Politécnico do Porto, Portugal (10)
- Livre Saber - Repositório Digital de Materiais Didáticos - SEaD-UFSCar (1)
- Lume - Repositório Digital da Universidade Federal do Rio Grande do Sul (20)
- Martin Luther Universitat Halle Wittenberg, Germany (1)
- Massachusetts Institute of Technology (8)
- Memoria Académica - FaHCE, UNLP - Argentina (3)
- Ministerio de Cultura, Spain (88)
- National Center for Biotechnology Information - NCBI (14)
- Nottingham eTheses (2)
- Plymouth Marine Science Electronic Archive (PlyMSEA) (2)
- QUB Research Portal - Research Directory and Institutional Repository for Queen's University Belfast (66)
- Queensland University of Technology - ePrints Archive (96)
- RCAAP - Repositório Científico de Acesso Aberto de Portugal (1)
- RDBU - Repositório Digital da Biblioteca da Unisinos (2)
- ReCiL - Repositório Científico Lusófona - Grupo Lusófona, Portugal (4)
- Repositório Científico da Universidade de Évora - Portugal (1)
- Repositório Científico do Instituto Politécnico de Lisboa - Portugal (7)
- Repositório digital da Fundação Getúlio Vargas - FGV (7)
- Repositório Digital da UNIVERSIDADE DA MADEIRA - Portugal (6)
- Repositório Digital da Universidade Municipal de São Caetano do Sul - USCS (2)
- Repositório Institucional da Universidade de Aveiro - Portugal (15)
- Repositório Institucional da Universidade Estadual de São Paulo - UNESP (2)
- Repositorio Institucional de la Universidad de Málaga (1)
- Repositório Institucional UNESP - Universidade Estadual Paulista "Julio de Mesquita Filho" (16)
- Repositorio Institucional Universidad Católica de Colombia (1)
- Repositorio Institucional Universidad de Medellín (1)
- RUN (Repositório da Universidade Nova de Lisboa) - FCT (Faculdade de Cienecias e Technologia), Universidade Nova de Lisboa (UNL), Portugal (10)
- SAPIENTIA - Universidade do Algarve - Portugal (5)
- Universidad Autónoma de Nuevo León, Mexico (2)
- Universidad de Alicante (4)
- Universidad del Rosario, Colombia (8)
- Universidad Politécnica de Madrid (30)
- Universidade Complutense de Madrid (1)
- Universidade de Lisboa - Repositório Aberto (7)
- Universidade de Madeira (1)
- Universidade Federal do Pará (1)
- Universidade Federal do Rio Grande do Norte (UFRN) (17)
- Universitat de Girona, Spain (13)
- Universitätsbibliothek Kassel, Universität Kassel, Germany (8)
- Université de Lausanne, Switzerland (3)
- Université de Montréal, Canada (17)
- University of Michigan (18)
- University of Queensland eSpace - Australia (7)
- University of Southampton, United Kingdom (4)
- University of Washington (1)
- WestminsterResearch - UK (4)
- Worcester Research and Publications - Worcester Research and Publications - UK (1)
Resumo:
Multi-Agent Reinforcement Learning (MARL) algorithms face two main difficulties: the curse of dimensionality, and environment non-stationarity due to the independent learning processes carried out by the agents concurrently. In this paper we formalize and prove the convergence of a Distributed Round Robin Q-learning (D-RR-QL) algorithm for cooperative systems. The computational complexity of this algorithm increases linearly with the number of agents. Moreover, it eliminates environment non sta tionarity by carrying a round-robin scheduling of the action selection and execution. That this learning scheme allows the implementation of Modular State-Action Vetoes (MSAV) in cooperative multi-agent systems, which speeds up learning convergence in over-constrained systems by vetoing state-action pairs which lead to undesired termination states (UTS) in the relevant state-action subspace. Each agent's local state-action value function learning is an independent process, including the MSAV policies. Coordination of locally optimal policies to obtain the global optimal joint policy is achieved by a greedy selection procedure using message passing. We show that D-RR-QL improves over state-of-the-art approaches, such as Distributed Q-Learning, Team Q-Learning and Coordinated Reinforcement Learning in a paradigmatic Linked Multi-Component Robotic System (L-MCRS) control problem: the hose transportation task. L-MCRS are over-constrained systems with many UTS induced by the interaction of the passive linking element and the active mobile robots.