1 resultado para Bicycle riding
em Massachusetts Institute of Technology
Filtro por publicador
- JISC Information Environment Repository (4)
- Aberystwyth University Repository - Reino Unido (1)
- Academic Archive On-line (Jönköping University; Sweden) (2)
- Academic Research Repository at Institute of Developing Economies (1)
- Acceda, el repositorio institucional de la Universidad de Las Palmas de Gran Canaria. España (6)
- AMS Tesi di Dottorato - Alm@DL - Università di Bologna (5)
- Aquatic Commons (3)
- ArchiMeD - Elektronische Publikationen der Universität Mainz - Alemanha (1)
- Archive of European Integration (6)
- Archivo Digital para la Docencia y la Investigación - Repositorio Institucional de la Universidad del País Vasco (3)
- Aston University Research Archive (4)
- Biblioteca de Teses e Dissertações da USP (2)
- Biblioteca Digital | Sistema Integrado de Documentación | UNCuyo - UNCUYO. UNIVERSIDAD NACIONAL DE CUYO. (1)
- Biblioteca Digital da Produção Intelectual da Universidade de São Paulo (3)
- Biblioteca Digital de la Universidad Católica Argentina (1)
- Biblioteca Digital de Teses e Dissertações Eletrônicas da UERJ (2)
- Bioline International (1)
- BORIS: Bern Open Repository and Information System - Berna - Suiça (35)
- Brock University, Canada (9)
- CaltechTHESIS (1)
- Cambridge University Engineering Department Publications Database (14)
- CamPuce - an association for the promotion of science and humanities in African Countries (1)
- CentAUR: Central Archive University of Reading - UK (24)
- Center for Jewish History Digital Collections (12)
- Chapman University Digital Commons - CA - USA (1)
- Chinese Academy of Sciences Institutional Repositories Grid Portal (1)
- Cochin University of Science & Technology (CUSAT), India (1)
- Comissão Econômica para a América Latina e o Caribe (CEPAL) (1)
- CORA - Cork Open Research Archive - University College Cork - Ireland (1)
- Corvinus Research Archive - The institutional repository for the Corvinus University of Budapest (1)
- CUNY Academic Works (1)
- Dalarna University College Electronic Archive (4)
- Digital Commons - Michigan Tech (1)
- Digital Commons - Montana Tech (6)
- Digital Commons @ DU | University of Denver Research (4)
- Digital Commons @ Winthrop University (1)
- Digital Commons at Florida International University (9)
- Digital Peer Publishing (2)
- Digital Repository at Iowa State University (1)
- DigitalCommons@The Texas Medical Center (2)
- DigitalCommons@University of Nebraska - Lincoln (1)
- DRUM (Digital Repository at the University of Maryland) (2)
- Duke University (2)
- eResearch Archive - Queensland Department of Agriculture; Fisheries and Forestry (8)
- Harvard University (4)
- Helda - Digital Repository of University of Helsinki (7)
- Helvia: Repositorio Institucional de la Universidad de Córdoba (2)
- Illinois Digital Environment for Access to Learning and Scholarship Repository (1)
- Indian Institute of Science - Bangalore - Índia (5)
- Instituto Politécnico de Viseu (1)
- Instituto Politécnico do Porto, Portugal (1)
- Iowa Publications Online (IPO) - State Library, State of Iowa (Iowa), United States (2)
- Massachusetts Institute of Technology (1)
- Memoria Académica - FaHCE, UNLP - Argentina (3)
- Ministerio de Cultura, Spain (3)
- National Center for Biotechnology Information - NCBI (1)
- Plymouth Marine Science Electronic Archive (PlyMSEA) (1)
- Portal de Revistas Científicas Complutenses - Espanha (3)
- Publishing Network for Geoscientific & Environmental Data (10)
- QUB Research Portal - Research Directory and Institutional Repository for Queen's University Belfast (19)
- Queensland University of Technology - ePrints Archive (146)
- RCAAP - Repositório Científico de Acesso Aberto de Portugal (1)
- Repositório Aberto da Universidade Aberta de Portugal (1)
- Repositório Científico do Instituto Politécnico de Lisboa - Portugal (1)
- Repositório digital da Fundação Getúlio Vargas - FGV (3)
- Repositório Institucional da Universidade de Aveiro - Portugal (2)
- Repositório Institucional da Universidade de Brasília (1)
- Repositório Institucional da Universidade Estadual de São Paulo - UNESP (1)
- Repositório Institucional da Universidade Tecnológica Federal do Paraná (RIUT) (1)
- Repositório Institucional UNESP - Universidade Estadual Paulista "Julio de Mesquita Filho" (38)
- RUN (Repositório da Universidade Nova de Lisboa) - FCT (Faculdade de Cienecias e Technologia), Universidade Nova de Lisboa (UNL), Portugal (1)
- SAPIENTIA - Universidade do Algarve - Portugal (1)
- School of Medicine, Washington University, United States (1)
- Scientific Open-access Literature Archive and Repository (1)
- South Carolina State Documents Depository (1)
- Universidad de Alicante (3)
- Universidad del Rosario, Colombia (11)
- Universidad Politécnica de Madrid (12)
- Universidade de Lisboa - Repositório Aberto (1)
- Universidade Federal do Pará (2)
- Universidade Federal do Rio Grande do Norte (UFRN) (6)
- Universidade Técnica de Lisboa (2)
- Universitat de Girona, Spain (1)
- Universitätsbibliothek Kassel, Universität Kassel, Germany (3)
- Université de Montréal (1)
- Université de Montréal, Canada (15)
- University of Connecticut - USA (1)
- University of Michigan (115)
- University of Queensland eSpace - Australia (12)
- University of Washington (5)
- WestminsterResearch - UK (2)
- Worcester Research and Publications - Worcester Research and Publications - UK (2)
Resumo:
This paper presents a novel algorithm for learning in a class of stochastic Markov decision processes (MDPs) with continuous state and action spaces that trades speed for accuracy. A transform of the stochastic MDP into a deterministic one is presented which captures the essence of the original dynamics, in a sense made precise. In this transformed MDP, the calculation of values is greatly simplified. The online algorithm estimates the model of the transformed MDP and simultaneously does policy search against it. Bounds on the error of this approximation are proven, and experimental results in a bicycle riding domain are presented. The algorithm learns near optimal policies in orders of magnitude fewer interactions with the stochastic MDP, using less domain knowledge. All code used in the experiments is available on the project's web site.