Spike-based decision learning of nash equilibria in two-player games
Data(s) |
2012
|
---|---|
Resumo |
Humans and animals face decision tasks in an uncertain multi-agent environment where an agent's strategy may change in time due to the co-adaptation of others strategies. The neuronal substrate and the computational algorithms underlying such adaptive decision making, however, is largely unknown. We propose a population coding model of spiking neurons with a policy gradient procedure that successfully acquires optimal strategies for classical game-theoretical tasks. The suggested population reinforcement learning reproduces data from human behavioral experiments for the blackjack and the inspector game. It performs optimally according to a pure (deterministic) and mixed (stochastic) Nash equilibrium, respectively. In contrast, temporal-difference(TD)-learning, covariance-learning, and basic reinforcement learning fail to perform optimally for the stochastic strategy. Spike-based population reinforcement learning, shown to follow the stochastic reward gradient, is therefore a viable candidate to explain automated decision learning of a Nash equilibrium in two-player games. |
Formato |
application/pdf |
Identificador |
http://boris.unibe.ch/14214/1/journal.pcbi.1002691.pdf Friedrich, Johannes; Senn, Walter (2012). Spike-based decision learning of nash equilibria in two-player games. PLoS computational biology, 8(9), pp. 1-12. San Francisco, Calif.: Public Library of Science 10.1371/journal.pcbi.1002691 <http://dx.doi.org/10.1371/journal.pcbi.1002691> doi:10.7892/boris.14214 info:doi:10.1371/journal.pcbi.1002691 info:pmid:23028289 urn:issn:1553-734X |
Idioma(s) |
eng |
Publicador |
Public Library of Science |
Relação |
http://boris.unibe.ch/14214/ |
Direitos |
info:eu-repo/semantics/openAccess |
Fonte |
Friedrich, Johannes; Senn, Walter (2012). Spike-based decision learning of nash equilibria in two-player games. PLoS computational biology, 8(9), pp. 1-12. San Francisco, Calif.: Public Library of Science 10.1371/journal.pcbi.1002691 <http://dx.doi.org/10.1371/journal.pcbi.1002691> |
Palavras-Chave | #610 Medicine & health |
Tipo |
info:eu-repo/semantics/article info:eu-repo/semantics/publishedVersion PeerReviewed |