21 resultados para Reward
Resumo:
The processing of spatial and mnemonic information is believed to depend on hippocampal theta oscillations (5–12 Hz). However, in rats both the power and the frequency of the theta rhythm are modulated by locomotor activity, which is a major confounding factor when estimating its cognitive correlates. Previous studies have suggested that hippocampal theta oscillations support decision-making processes. In this study, we investigated to what extent spatial decision making modulates hippocampal theta oscillations when controlling for variations in locomotion speed. We recorded local field potentials from the CA1 region of rats while animals had to choose one arm to enter for reward (goal) in a four-arm radial maze. We observed prominent theta oscillations during the decision-making period of the task, which occurred in the center of the maze before animals deliberately ran through an arm toward goal location. In speed-controlled analyses, theta power and frequency were higher during the decision period when compared to either an intertrial delay period (also at the maze center), or to the period of running toward goal location. In addition, theta activity was higher during decision periods preceding correct choices than during decision periods preceding incorrect choices. Altogether, our data support a cognitive function for the hippocampal theta rhythm in spatial decision making
Resumo:
Three populations of neurons expressing the vesicular glutamate transporter 2 (Vglut2) were recently described in the A10 area of the mouse midbrain, of which two populations were shown to express the gene encoding, the rate-limiting enzyme for catecholamine synthesis, tyrosine hydroxylase (TH).One of these populations (‘‘TH– Vglut2 Class1’’) also expressed the dopamine transporter (DAT) gene while one did not ("TH–Vglut2 Class2"), and the remaining population did not express TH at all ("TH-Vglut2-only"). TH is known to be expressed by a promoter which shows two phases of activation, a transient one early during embryonal development, and a later one which gives rise to stable endogenous expression of the TH gene. The transient phase is, however, not specific to catecholaminergic neurons, a feature taken to advantage here as it enabled Vglut2 gene targeting within all three A10 populations expressing this gene, thus creating a new conditional knockout. These knockout mice showed impairment in spatial memory function. Electrophysiological analyses revealed a profound alteration of oscillatory activity in the CA3 region of the hippocampus. In addition to identifying a novel role for Vglut2 in hippocampus function, this study points to the need for improved genetic tools for targeting of the diversity of subpopulations of the A10 area
Resumo:
We propose a new paradigm for collective learning in multi-agent systems (MAS) as a solution to the problem in which several agents acting over the same environment must learn how to perform tasks, simultaneously, based on feedbacks given by each one of the other agents. We introduce the proposed paradigm in the form of a reinforcement learning algorithm, nominating it as reinforcement learning with influence values. While learning by rewards, each agent evaluates the relation between the current state and/or action executed at this state (actual believe) together with the reward obtained after all agents that are interacting perform their actions. The reward is a result of the interference of others. The agent considers the opinions of all its colleagues in order to attempt to change the values of its states and/or actions. The idea is that the system, as a whole, must reach an equilibrium, where all agents get satisfied with the obtained results. This means that the values of the state/actions pairs match the reward obtained by each agent. This dynamical way of setting the values for states and/or actions makes this new reinforcement learning paradigm the first to include, naturally, the fact that the presence of other agents in the environment turns it a dynamical model. As a direct result, we implicitly include the internal state, the actions and the rewards obtained by all the other agents in the internal state of each agent. This makes our proposal the first complete solution to the conceptual problem that rises when applying reinforcement learning in multi-agent systems, which is caused by the difference existent between the environment and agent models. With basis on the proposed model, we create the IVQ-learning algorithm that is exhaustive tested in repetitive games with two, three and four agents and in stochastic games that need cooperation and in games that need collaboration. This algorithm shows to be a good option for obtaining solutions that guarantee convergence to the Nash optimum equilibrium in cooperative problems. Experiments performed clear shows that the proposed paradigm is theoretical and experimentally superior to the traditional approaches. Yet, with the creation of this new paradigm the set of reinforcement learning applications in MAS grows up. That is, besides the possibility of applying the algorithm in traditional learning problems in MAS, as for example coordination of tasks in multi-robot systems, it is possible to apply reinforcement learning in problems that are essentially collaborative
Resumo:
In practically all vertical markets and in every region of the planet, loyalty marketers have adopted the tactic of recognition and reward to identify, maintain and increase the yield of their customers. Several strategies have been adopted by companies, and the most popular among them is the loyalty program, which displays a loyalty club to manage these rewards. But the problem with loyalty programs is that customer identification and transfer of loyalty points are made in a semiautomatic. Aiming at this, this paper presents a master's embedded business automation solution called e-Points. The goal of e-Points is munir clubs allegiances with fully automated tooling technology to identify customers directly at the point of sales, ensuring greater control over the loyalty of associate members. For this, we developed a hardware platform with embedded system and RFID technology to be used in PCs tenant, a smart card to accumulate points with every purchase and a web server, which will provide services of interest to retailers and customers membership to the club
Resumo:
Self-organizing maps (SOM) are artificial neural networks widely used in the data mining field, mainly because they constitute a dimensionality reduction technique given the fixed grid of neurons associated with the network. In order to properly the partition and visualize the SOM network, the various methods available in the literature must be applied in a post-processing stage, that consists of inferring, through its neurons, relevant characteristics of the data set. In general, such processing applied to the network neurons, instead of the entire database, reduces the computational costs due to vector quantization. This work proposes a post-processing of the SOM neurons in the input and output spaces, combining visualization techniques with algorithms based on gravitational forces and the search for the shortest path with the greatest reward. Such methods take into account the connection strength between neighbouring neurons and characteristics of pattern density and distances among neurons, both associated with the position that the neurons occupy in the data space after training the network. Thus, the goal consists of defining more clearly the arrangement of the clusters present in the data. Experiments were carried out so as to evaluate the proposed methods using various artificially generated data sets, as well as real world data sets. The results obtained were compared with those from a number of well-known methods existent in the literature
Resumo:
The processing of spatial and episodic information during memory tasks depends on hippocampal theta oscillations. In the present study, I investigated the relationship between theta power and choice selection during spatial decision-making. I recorded local field potentials from the CA1 region of rats retrieving reward locations in a 4-arm maze. In trained but not in naïve animals, I observed a significant increase in theta power during decision-making, which could not be explained by changes in locomotion speed. Furthermore, a Bayesian decoder based on theta power predicted choice outcomes in speed-matched trials. The decoding time course revealed that performance increased above chance before the decision moment exclusively for theta power, remaining flat for other frequency bands. These results occurred for trained animals, but no significant prediction could be made for naïve animals. Altogether, the data support a mnemonic function of theta rhythm during spatial decision-making, indicating that these oscillations correlate with the retrieval of memories required for successful decisions
Resumo:
Timeplace learning is the capacity of organisms to associate both space and time with a biological relevant stimulus such as food. Experiments are usually done with food restricted animals due to the belief that food system activation is necessary for timeplace learning. Another line of thought suggest that, in addition to food system activation, response cost should be increased to effectively allow timeplace discrimination. The purpose of this experiment was to test whether a complex environment, which presumably implied in a heightened response cost, would facilitate timeplace association in satiated rats using a highly palatable food as reward. Nine rats were trained in a timeplace task for 30 nonconsecutive days. A large experimental box (1x1m) divided in four compartments was used. To access each compartment the animal had to overcome a series of obstacles such as ramps, staircases and mazes. Two feeders localized in opposite compartments were rewarded with sunflower seeds in two daily sessions. One feeder offered the reward during the morning sessions while the second feeder in afternoon sessions. After the 15th day of training, the animals began to show a preference for the correct feeder during the correct time of day expressed by increased frequency of visits as well as lower latency to access the feeders. These results suggest that satiated animals are also capable of learning a timespace task as far as the experimental context is complex enough to result in a higher response cost
Resumo:
The plus-maze discriminative avoidance paradigm has been used to study the relationship between aversive memory and anxiety. The present study aims to verify if the elevated plus-maze can provide information about appetitive memory and anxiety, through a task motivated by food reward. Animals were allowed to explore an elevated plus-maze and received reinforcement in one of the enclosed arms. In a test session performed 24h later, in the absence of reward, rats showed preference for the previously rewarded enclosed arm over the neutral enclosed arm. The administration of diazepam and pentylenetetrazole before training induced, respectively, anxiolytic and anxiogenic effects (as evaluated by open-arm exploration). Both drugs induced amnestic effects, i.e., lack of preference for the rewarded arm in the test session. The results suggest that appetitive memory can be influenced by anxiety levels as well. The plus-maze appetitive discrimination task seems to be a useful model to investigate the relationship between memory and anxiety
Resumo:
The ability to predict future rewards or threats is crucial for survival. Recent studies have addressed future event prediction by the hippocampus. Hippocampal neurons exhibit robust selectivity for spatial location. Thus, the activity of hippocampal neurons represents a cognitive map of space during navigation as well as during planning and recall. Spatial selectivity allows the hippocampus to be involved in the formation of spatial and episodic memories, including the sequential ordering of events. On the other hand, the discovery of reverberatory activity in multiple forebrain areas during slow wave and REM sleep underscored the role of sleep on the consolidation of recently acquired memory traces. To this date, there are no studies addressing whether neuronal activity in the hippocampus during sleep can predict regular environmental shifts. The aim of the present study was to investigate the activity of neuronal populations in the hippocampus during sleep sessions intercalated by spatial exploration periods, in which the location of reward changed in a predictable way. To this end, we performed the chronic implantation of 32-channel multielectrode arrays in the CA1 regions of the hippocampus in three male rats of the Wistar strain. In order to activate different neuronal subgroups at each cycle of the task, we exposed the animals to four spatial exploration sessions in a 4-arm elevated maze in which reward was delivered in a single arm per session. Reward location changed regularly at every session in a clockwise manner, traversing all the arms at the end of the daily recordings. Animals were recorded from 2-12 consecutive days. During spatial exploration of the 4-arm elevated maze, 67,5% of the recorded neurons showed firing rate differences across the maze arms. Furthermore, an average of 42% of the neurons showed increased correlation (R>0.3) between neuronal pairs in each arm. This allowed us to sort representative neuronal subgroups for each maze arm, and to analyze the activity of these subgroups across sleep sessions. We found that neuronal subgroups sorted by firing rate differences during spatial exploration sustained these differences across sleep sessions. This was not the case with neuronal subgroups sorted according to synchrony (correlation). In addition, the correlation levels between sleep sessions and waking patterns sampled in each arm were larger for the entire population of neurons than for the rate or synchrony subgroups. Neuronal activity during sleep of the entire neuronal population or subgroups did not show different correlations among the four arm mazes. On the other hand, we verified that neuronal activity during pre-exploration sleep sessions was significantly more similar to the activity patterns of the target arm than neuronal activity during pre-exploration sleep sessions. In other words, neuronal activity during sleep that precedes the task reflects more strongly the location of reward than neuronal activity during sleep that follows the task. Our results suggest that neuronal activity during sleep can predict regular environmental changes
Resumo:
The discrimination learning is assessed through instrumental tasks in which the individual is rewarded for choosing one item over another. Thus, in concurrent visual discrimination of objects the animal must learn that only one of the objects will be rewarded. The concurrent visual discrimination is relatively simple, and already been observed Callithrix jacchus is able to accomplish this task. As yet wasn't seen the influence of the qualitative aspects of the rewards, in the performance of concurrent visual discrimination of objects in nonhuman primates, and as in most tests are used isolated animals, the present study had two stages: at first we had as objective to analyze the influence of the caloric value of the reward on the performance in concurrent visual discrimination of objects in isolated animals; in the second, we had the intention analyze performance of C. jacchus in realization of discrimination task in different social contexts, as well as, analyze the influence of previous experience in task performance. In the first stage (Study 1), the animals were not able to discriminate foods that presented small caloric differences . This incapacity in discriminates the rewards was responsible by generating randomness in task of concurrent visual discrimination of objects. In the second stage (Study 2), observed that, independent of social context in which the task was presented, the performance both of the experienced animals as the inexperienced animals tended to randomness. In the first case, is likely that the pattern of responses of the experienced animals is a reflection of their own performance when they were observed in isolation. In the second case, in turn, the randomness was probably due to the small number of sessions. Although present a pattern of performance similar to inexperienced individuals, we verify that the experienced animals monopolize the food consumption when they were in the presence of inexperienced individuals. This was a consequence of the experienced animals have presented lower latency the approximation of apparatus and, consequently, obtain more food. In turn, the inexperienced animals, when were in the presence of experienced, had to adopt alternative strategies to obtain food. Thus, C. jacchus is able to use the previous information he had about the task of solving their own benefit.
Resumo:
The 3-hydroxytyramine/dopamine (DA) is a monoamine of catecholamineric group and consists in the progenitor substantia of synthesis of noradrenaline and adrenaline, having the enzyme tyrosine hydroxylase as a regulator of this process. Nuclei of midbrain expressing DA are the retrorubral field (RRF, A8 group), the substantia nigra pars compacta (SNc, A9 group) and the ventral tegmental area (VTA, A10 group). These nuclei are involved in three complex circuitry called mesostriatal, mesocortical and mesolimbic, which are related directly with various behavioral manifestations such as motor control, reward signaling in behavioural learning, motivation and pathological manifestations of Parkinson s disease and schizophrenia. The aim of this study was describe the morphology of midbrain dopaminergic neurons (A8, A9 and A10) of the rock cavy (Kerodon rupestris), a rodent belonging to the family Caviidae typical of the Brazilian Northeast, which is being adopted as a model for neuroanatomical studies in laboratory of neuroanatomy of the Federal University of Rio Grande do Norte. Coronal sections of brains of the rock cavies were submitted to staining by Nissl s method and immunohistochemistry against tyrosine hydroxylase. The nuclear organization of the midbrain dopaminergic nuclei of the rock cavy is very similar to that found in other animals of the order Rodentia, except by the presence of the tail of substantia nigra, which was found only in the studied species. We concluded that the midbrain dopaminergic nuclei are phylogenetically stable among species, but we think to be it necessary to expand the studies about the particularity found the rock cavy, investigating its occurrence in other species of rodents or investigating its functional relevance
Resumo:
Toxoplasma gondii is a protozoan parasite that induces behavioral changes in rodents. The aim of this study was to evaluate the effect of infection by T. gondii during the chronic phase in working memory and impulsivity in rodents as well as the effect of antipsychotics to reverse any behavioral changes resulting from infection. Female Wistar rats (n = 40) were infected with 25 cysts of the strain ME-49 T. gondii after 4 months the animals were subjected to behavioral tests: tolerance to delay gratification, in which the animal must choose between two rewards, a smaller and more immediate, but delayed and the test of spontaneous alternation, in which the animal must use spatial cues to remember previously visited arms. Antipsychotic drugs were intraperitoneally administered during the testing of the behavioral experiments, the antipsychotic is haloperidol (1.5 mg / kg) administered 60 min before the start of the session and the antipsychotic clozapine (2.5 mg / kg) 30 min before. Animals infected with the parasite did not show operating deficits of memory, and motor impairment did not develop, however motor impairment was observed only in animals treated with haloperidol. It was found that administration of clozapine and haloperidol increased the percentage of alternation in infected and control groups in task switching espontânea.Não no distinction between control animals and infected the test of tolerance to delay gratification in relation to the percentage of choices greatest reward, during the pre-training and training, in which there is a delay of 15 s to access the great reward, however it was observed that infected animals prefer the greatest reward, when there is a delay of 30 s when compared to control group. The administration of clozapine possible that infected animals chose the greatest reward in the delay of 30 seconds during the test. These data suggest that infected mice do not exhibit deficits in working memory and that clozapine has therapeutic efficacy in improving cognitive performance of mice infected
Resumo:
Recognizing the importance the workplace has on mental health of the individual, the objective of this study was to investigate the relationship between the burnout syndrome and the sources of physical and emotional wear which permeate the work conditions of the urban public transport system of the city of Natal. Although existent in international literature, research on burnout in the professional transport category and studies directed to this category are not a tradition in Brazil. The research was carried out using 412 drivers and money-changers of two transport companies of Natal. To collect the data, two questionnaires and a semi-structured interview were used. The first instrument, developed and validated during the research, investigated the sources of wear and the second, the syndrome of burnout. As its main results, two sources of empirical wear were identified as follows: (1) the Conflict of Values and the Lack of Justice at the Workplace, (2) Union and Reward. Besides these, it was observed that there is an incidence of the syndrome of burnout among the drivers and money-changers of urban transport by bus, not only in the caring occupations studied before in Natal and Brazil and that this incidence is related to the sources of wear which permeate the work conditions of these professionals
Resumo:
This master s dissertation deals with motivation and the meaning of work amongst bank employees. This is done considering a cognitive perception. Work is understood here under a social and subjective comprehension, once it deals with significance attribution. Motivation is the process that rules choice of the different possibilities of individual behavior, all of which according to the Expectation Theory. This study aims to analyze the implications of the productive restructure, since it is related to technological innovation, organizational changes and management, in motivation and work significance. Thus, the objective of the research is to verify motivational differences and the meaning of work amongst bank employees. This is done in two distinct moments of the productive restructure of bank employees in Natal-RN. The research is divided in two parts. In the first one, changes that occurred in banks between 1999 until 2005 were identified by the means of interviews with 7 bank managers. The analyzed perspective was training intensifying, quality emphasis of customer attendance, the use of automation/technology, staff stabilization, change in staff profile, work intensification, etc. In the second study the Inventory of Motivation and Work Meaning was applied. Thus, questions related to work focus, social demographic data, in 187 bank employees were dealt with. The collected data was compared to data from previous work. It was observed that productive restructure has a reflection in the meaning of work increasing self-expression, economical reward, and responsibility in work conditions. All of the item mention beforehand maintain the level of inhumanness and consummation and respond as being the characteristics of the real work environment. On the other hand, bank employees value less justice, self-expression and more the survival perspective, implying instrumental values to work. As for motivation, it is increased among bank employees. These employees have greater expectations that their work produce results since they believe in their interference in work results
Resumo:
RePART (Reward/Punishment ART) is a neural model that constitutes a variation of the Fuzzy Artmap model. This network was proposed in order to minimize the inherent problems in the Artmap-based model, such as the proliferation of categories and misclassification. RePART makes use of additional mechanisms, such as an instance counting parameter, a reward/punishment process and a variable vigilance parameter. The instance counting parameter, for instance, aims to minimize the misclassification problem, which is a consequence of the sensitivity to the noises, frequently presents in Artmap-based models. On the other hand, the use of the variable vigilance parameter tries to smoouth out the category proliferation problem, which is inherent of Artmap-based models, decreasing the complexity of the net. RePART was originally proposed in order to minimize the aforementioned problems and it was shown to have better performance (higer accuracy and lower complexity) than Artmap-based models. This work proposes an investigation of the performance of the RePART model in classifier ensembles. Different sizes, learning strategies and structures will be used in this investigation. As a result of this investigation, it is aimed to define the main advantages and drawbacks of this model, when used as a component in classifier ensembles. This can provide a broader foundation for the use of RePART in other pattern recognition applications