1 resultado para Eschatological opponent
em Massachusetts Institute of Technology
Filtro por publicador
- Acceda, el repositorio institucional de la Universidad de Las Palmas de Gran Canaria. España (1)
- Adam Mickiewicz University Repository (2)
- AMS Tesi di Dottorato - Alm@DL - Università di Bologna (2)
- ArchiMeD - Elektronische Publikationen der Universität Mainz - Alemanha (3)
- Archive of European Integration (2)
- Archivo Digital para la Docencia y la Investigación - Repositorio Institucional de la Universidad del País Vasco (3)
- Aston University Research Archive (4)
- Biblioteca Digital | Sistema Integrado de Documentación | UNCuyo - UNCUYO. UNIVERSIDAD NACIONAL DE CUYO. (1)
- Biblioteca Digital da Produção Intelectual da Universidade de São Paulo (3)
- Biblioteca Digital de la Universidad Católica Argentina (4)
- BORIS: Bern Open Repository and Information System - Berna - Suiça (8)
- Boston University Digital Common (11)
- Brock University, Canada (3)
- Brunel University (1)
- Bucknell University Digital Commons - Pensilvania - USA (2)
- Cambridge University Engineering Department Publications Database (2)
- CentAUR: Central Archive University of Reading - UK (3)
- Chinese Academy of Sciences Institutional Repositories Grid Portal (1)
- Coffee Science - Universidade Federal de Lavras (1)
- CORA - Cork Open Research Archive - University College Cork - Ireland (3)
- Corvinus Research Archive - The institutional repository for the Corvinus University of Budapest (1)
- Dalarna University College Electronic Archive (1)
- Digital Commons at Florida International University (1)
- DigitalCommons@The Texas Medical Center (1)
- DRUM (Digital Repository at the University of Maryland) (1)
- Duke University (4)
- Glasgow Theses Service (1)
- Harvard University (1)
- Helda - Digital Repository of University of Helsinki (5)
- Helvia: Repositorio Institucional de la Universidad de Córdoba (1)
- Indian Institute of Science - Bangalore - Índia (4)
- Instituto Gulbenkian de Ciência (2)
- Instituto Politécnico de Santarém (1)
- Instituto Superior de Psicologia Aplicada - Lisboa (1)
- Lume - Repositório Digital da Universidade Federal do Rio Grande do Sul (1)
- Massachusetts Institute of Technology (1)
- Memoria Académica - FaHCE, UNLP - Argentina (15)
- National Center for Biotechnology Information - NCBI (3)
- Portal de Revistas Científicas Complutenses - Espanha (4)
- QSpace: Queen's University - Canada (2)
- QUB Research Portal - Research Directory and Institutional Repository for Queen's University Belfast (20)
- Queensland University of Technology - ePrints Archive (18)
- RCAAP - Repositório Científico de Acesso Aberto de Portugal (2)
- Repositorio Académico de la Universidad Nacional de Costa Rica (1)
- Repositório Científico da Universidade de Évora - Portugal (1)
- Repositório Científico do Instituto Politécnico de Lisboa - Portugal (1)
- Repositório Científico do Instituto Politécnico de Santarém - Portugal (1)
- Repositório Institucional da Universidade de Aveiro - Portugal (2)
- Repositório Institucional UNESP - Universidade Estadual Paulista "Julio de Mesquita Filho" (13)
- RUN (Repositório da Universidade Nova de Lisboa) - FCT (Faculdade de Cienecias e Technologia), Universidade Nova de Lisboa (UNL), Portugal (2)
- SAPIENTIA - Universidade do Algarve - Portugal (2)
- South Carolina State Documents Depository (2)
- Universidad de Alicante (2)
- Universidad del Rosario, Colombia (1)
- Universidad Politécnica de Madrid (6)
- Universidade de Lisboa - Repositório Aberto (1)
- Universidade Federal do Pará (1)
- Universidade Federal do Rio Grande do Norte (UFRN) (2)
- Universidade Metodista de São Paulo (12)
- Universidade Técnica de Lisboa (1)
- Universitätsbibliothek Kassel, Universität Kassel, Germany (1)
- Université de Montréal (1)
- Université de Montréal, Canada (5)
- University of Michigan (3)
- University of Queensland eSpace - Australia (5)
- WestminsterResearch - UK (1)
Resumo:
There has been recent interest in using temporal difference learning methods to attack problems of prediction and control. While these algorithms have been brought to bear on many problems, they remain poorly understood. It is the purpose of this thesis to further explore these algorithms, presenting a framework for viewing them and raising a number of practical issues and exploring those issues in the context of several case studies. This includes applying the TD(lambda) algorithm to: 1) learning to play tic-tac-toe from the outcome of self-play and of play against a perfectly-playing opponent and 2) learning simple one-dimensional segmentation tasks.