Biblioteca Digital

31 resultados para Board Game

em BORIS: Bern Open Repository and Information System - Berna - Suiça

Spatio-temporal credit assignment in population learning

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Learning by reinforcement is important in shaping animal behavior, and in particular in behavioral decision making. Such decision making is likely to involve the integration of many synaptic events in space and time. However, using a single reinforcement signal to modulate synaptic plasticity, as suggested in classical reinforcement learning algorithms, a twofold problem arises. Different synapses will have contributed differently to the behavioral decision, and even for one and the same synapse, releases at different times may have had different effects. Here we present a plasticity rule which solves this spatio-temporal credit assignment problem in a population of spiking neurons. The learning rule is spike-time dependent and maximizes the expected reward by following its stochastic gradient. Synaptic plasticity is modulated not only by the reward, but also by a population feedback signal. While this additional signal solves the spatial component of the problem, the temporal one is solved by means of synaptic eligibility traces. In contrast to temporal difference (TD) based approaches to reinforcement learning, our rule is explicit with regard to the assumed biophysical mechanisms. Neurotransmitter concentrations determine plasticity and learning occurs fully online. Further, it works even if the task to be learned is non-Markovian, i.e. when reinforcement is not determined by the current state of the system but may also depend on past events. The performance of the model is assessed by studying three non-Markovian tasks. In the first task, the reward is delayed beyond the last action with non-related stimuli and actions appearing in between. The second task involves an action sequence which is itself extended in time and reward is only delivered at the last action, as it is the case in any type of board-game. The third task is the inspection game that has been studied in neuroeconomics, where an inspector tries to prevent a worker from shirking. Applying our algorithm to this game yields a learning behavior which is consistent with behavioral data from humans and monkeys, revealing themselves properties of a mixed Nash equilibrium. The examples show that our neuronal implementation of reward based learning copes with delayed and stochastic reward delivery, and also with the learning of mixed strategies in two-opponent games.

Spatio-temporal credit assignment in population learning

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Learning by reinforcement is important in shaping animal behavior. But behavioral decision making is likely to involve the integration of many synaptic events in space and time. So in using a single reinforcement signal to modulate synaptic plasticity a twofold problem arises. Different synapses will have contributed differently to the behavioral decision and, even for one and the same synapse, releases at different times may have had different effects. Here we present a plasticity rule which solves this spatio-temporal credit assignment problem in a population of spiking neurons. The learning rule is spike time dependent and maximizes the expected reward by following its stochastic gradient. Synaptic plasticity is modulated not only by the reward but by a population feedback signal as well. While this additional signal solves the spatial component of the problem, the temporal one is solved by means of synaptic eligibility traces. In contrast to temporal difference based approaches to reinforcement learning, our rule is explicit with regard to the assumed biophysical mechanisms. Neurotransmitter concentrations determine plasticity and learning occurs fully online. Further, it works even if the task to be learned is non-Markovian, i.e. when reinforcement is not determined by the current state of the system but may also depend on past events. The performance of the model is assessed by studying three non-Markovian tasks. In the first task the reward is delayed beyond the last action with non-related stimuli and actions appearing in between. The second one involves an action sequence which is itself extended in time and reward is only delivered at the last action, as is the case in any type of board-game. The third is the inspection game that has been studied in neuroeconomics. It only has a mixed Nash equilibrium and exemplifies that the model also copes with stochastic reward delivery and the learning of mixed strategies.

Too exhausted for Operation? Anxiety, depleted self-control strength, and perceptual-motor performance

Relevância:

60.00% 60.00%

Publicador:

Resumo:

We tested the hypothesis that the interaction of self-control strength and state anxiety predicts perceptual–motor performance in a hand–eye coordination task. We predicted a stronger negative relation between anxiety and performance in a perceptual–motor task for participants whose self-control strength had been temporarily depleted compared to participants whose self-control strength was intact. In an experiment (N = 60), we manipulated self-control strength, measured state anxiety after an evaluative instruction, and assessed performance in the board game Operation as an indicator of perceptual–motor performance. The data supported our hypothesis: Only for participants whose self-control strength was temporarily depleted was there a statistically significant negative relation between anxiety and performance. Boosting self-control strength may help to prevent the potentially negative anxiety effects.

E-Type Asteroid (2867) Steins as Imaged by OSIRIS on Board Rosetta

Relevância:

20.00% 20.00%

Publicador:

Group dynamics in the Dictator Game, Workshop Rational Choice Sociology, Venice International University, 29.11.2010-3.12.2010

Relevância:

20.00% 20.00%

Publicador:

The Group Dictator Game, Vortrag am Workshop Rational Choice Sociology: Theory and Empirical Applications vom 29. November bis 3. Dezember 2010 an der Venice International University

Relevância:

20.00% 20.00%

Publicador:

Measure for Measure? Wittgenstein on Language-Game Criteria and the Paris Standard Metre Bar

Relevância:

20.00% 20.00%

Publicador:

The External Validity of Giving in the Dictator Game

Relevância:

20.00% 20.00%

Publicador:

The Group Dictator Game. American Sociological Association, Annual Meeting Las Vegas, 17.-22.08.2011. Section on Rationality and Society Paper Session. New Developments in Rational Choice Theory and Research

Relevância:

20.00% 20.00%

Publicador:

Multi-level governance and Compliance in den Kantonen, presentation given before the Executive Board of the Federal Office for Public Health, Überstorf, April 13, 2010

Relevância:

20.00% 20.00%

Publicador:

Autonomic stress responses elicited by a live broadcast soccer game: A pilot study

Relevância:

20.00% 20.00%

Publicador:

UBS: Discharge of former Board members and Managers

Relevância:

20.00% 20.00%

Publicador:

Board examination for anatomical pathology in Switzerland: two intense days to verify professional competence

Relevância:

20.00% 20.00%

Publicador:

Resumo:

About 15 years ago, the Swiss Society of Pathology has developed and implemented a board examination in anatomical pathology. We describe herein the contents covered by this 2-day exam (autopsy pathology, cytology, histopathology, molecular pathology, and basic knowledge about mechanisms of disease) and its exact modalities, sketch a brief history of the exam, and finish with a concise discussion about the possible objectives and putative benefits weighed against the hardship that it imposes on the candidates.

Game over? Computerspiel simuliert Klimakatastrophe

Relevância:

20.00% 20.00%

Publicador:

Allergic responses in the lung and skin: new players in the game

Relevância:

20.00% 20.00%

Publicador:

«
1
2
3
»