1 resultado para Philosophy of the difference
em Massachusetts Institute of Technology
Filtro por publicador
- Aberystwyth University Repository - Reino Unido (2)
- Acceda, el repositorio institucional de la Universidad de Las Palmas de Gran Canaria. España (1)
- Aquatic Commons (21)
- Archive of European Integration (2)
- Archivo Digital para la Docencia y la Investigación - Repositorio Institucional de la Universidad del País Vasco (6)
- Aston University Research Archive (5)
- Biblioteca Digital | Sistema Integrado de Documentación | UNCuyo - UNCUYO. UNIVERSIDAD NACIONAL DE CUYO. (5)
- Biblioteca Digital da Produção Intelectual da Universidade de São Paulo (BDPI/USP) (2)
- Biodiversity Heritage Library, United States (2)
- BORIS: Bern Open Repository and Information System - Berna - Suiça (38)
- Boston University Digital Common (2)
- Brock University, Canada (3)
- Bucknell University Digital Commons - Pensilvania - USA (3)
- Bulgarian Digital Mathematics Library at IMI-BAS (2)
- CaltechTHESIS (9)
- Cambridge University Engineering Department Publications Database (13)
- CentAUR: Central Archive University of Reading - UK (21)
- Center for Jewish History Digital Collections (1)
- Central European University - Research Support Scheme (1)
- Chinese Academy of Sciences Institutional Repositories Grid Portal (99)
- Comissão Econômica para a América Latina e o Caribe (CEPAL) (1)
- CORA - Cork Open Research Archive - University College Cork - Ireland (8)
- Corvinus Research Archive - The institutional repository for the Corvinus University of Budapest (3)
- Dalarna University College Electronic Archive (3)
- Deakin Research Online - Australia (36)
- DI-fusion - The institutional repository of Université Libre de Bruxelles (1)
- Digital Archives@Colby (2)
- Digital Commons at Florida International University (2)
- Digitale Sammlungen - Goethe-Universität Frankfurt am Main (3)
- Diposit Digital de la UB - Universidade de Barcelona (1)
- DRUM (Digital Repository at the University of Maryland) (1)
- Duke University (7)
- eResearch Archive - Queensland Department of Agriculture; Fisheries and Forestry (8)
- Gallica, Bibliotheque Numerique - Bibliothèque nationale de France (French National Library) (BnF), France (1)
- Glasgow Theses Service (2)
- Greenwich Academic Literature Archive - UK (7)
- Harvard University (2)
- Helda - Digital Repository of University of Helsinki (35)
- Indian Institute of Science - Bangalore - Índia (41)
- Instituto Superior de Psicologia Aplicada - Lisboa (1)
- Massachusetts Institute of Technology (1)
- Memoria Académica - FaHCE, UNLP - Argentina (3)
- National Center for Biotechnology Information - NCBI (1)
- Plymouth Marine Science Electronic Archive (PlyMSEA) (3)
- Portal de Revistas Científicas Complutenses - Espanha (6)
- Publishing Network for Geoscientific & Environmental Data (1)
- QUB Research Portal - Research Directory and Institutional Repository for Queen's University Belfast (18)
- Queensland University of Technology - ePrints Archive (180)
- Repositório Institucional da Universidade de Brasília (1)
- Repositório Institucional UNESP - Universidade Estadual Paulista "Julio de Mesquita Filho" (14)
- RUN (Repositório da Universidade Nova de Lisboa) - FCT (Faculdade de Cienecias e Technologia), Universidade Nova de Lisboa (UNL), Portugal (2)
- Scielo Uruguai (1)
- Universidad Autónoma de Nuevo León, Mexico (4)
- Universidad de Alicante (2)
- Universidad Politécnica de Madrid (6)
- Universidade Complutense de Madrid (2)
- Université de Montréal, Canada (1)
- University of Connecticut - USA (1)
- University of Michigan (303)
- University of Queensland eSpace - Australia (13)
- University of Southampton, United Kingdom (1)
- WestminsterResearch - UK (4)
- Worcester Research and Publications - Worcester Research and Publications - UK (2)
Resumo:
There has been recent interest in using temporal difference learning methods to attack problems of prediction and control. While these algorithms have been brought to bear on many problems, they remain poorly understood. It is the purpose of this thesis to further explore these algorithms, presenting a framework for viewing them and raising a number of practical issues and exploring those issues in the context of several case studies. This includes applying the TD(lambda) algorithm to: 1) learning to play tic-tac-toe from the outcome of self-play and of play against a perfectly-playing opponent and 2) learning simple one-dimensional segmentation tasks.