Biblioteca Digital

957 resultados para Function Learning

A Reinforcement Learning Approach to Economic Dispatch using Neural Networks

Relevância:

30.00% 30.00%

Publicador:

Resumo:

This paper presents a Reinforcement Learning (RL) approach to economic dispatch (ED) using Radial Basis Function neural network. We formulate the ED as an N stage decision making problem. We propose a novel architecture to store Qvalues and present a learning algorithm to learn the weights of the neural network. Even though many stochastic search techniques like simulated annealing, genetic algorithm and evolutionary programming have been applied to ED, they require searching for the optimal solution for each load demand. Also they find limitation in handling stochastic cost functions. In our approach once we learn the Q-values, we can find the dispatch for any load demand. We have recently proposed a RL approach to ED. In that approach, we could find only the optimum dispatch for a set of specified discrete values of power demand. The performance of the proposed algorithm is validated by taking IEEE 6 bus system, considering transmission losses

The Informational Complexity of Learning from Examples

Relevância:

30.00% 30.00%

Publicador:

Resumo:

This thesis attempts to quantify the amount of information needed to learn certain tasks. The tasks chosen vary from learning functions in a Sobolev space using radial basis function networks to learning grammars in the principles and parameters framework of modern linguistic theory. These problems are analyzed from the perspective of computational learning theory and certain unifying perspectives emerge.

Comparing Support Vector Machines with Gaussian Kernels to Radial Basis Function Classifiers

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The Support Vector (SV) machine is a novel type of learning machine, based on statistical learning theory, which contains polynomial classifiers, neural networks, and radial basis function (RBF) networks as special cases. In the RBF case, the SV algorithm automatically determines centers, weights and threshold such as to minimize an upper bound on the expected test error. The present study is devoted to an experimental comparison of these machines with a classical approach, where the centers are determined by $k$--means clustering and the weights are found using error backpropagation. We consider three machines, namely a classical RBF machine, an SV machine with Gaussian kernel, and a hybrid system with the centers determined by the SV method and the weights trained by error backpropagation. Our results show that on the US postal service database of handwritten digits, the SV machine achieves the highest test accuracy, followed by the hybrid approach. The SV approach is thus not only theoretically well--founded, but also superior in a practical application.

Sequential Optimal Recovery: A Paradigm for Active Learning

Relevância:

30.00% 30.00%

Publicador:

Resumo:

In most classical frameworks for learning from examples, it is assumed that examples are randomly drawn and presented to the learner. In this paper, we consider the possibility of a more active learner who is allowed to choose his/her own examples. Our investigations are carried out in a function approximation setting. In particular, using arguments from optimal recovery (Micchelli and Rivlin, 1976), we develop an adaptive sampling strategy (equivalent to adaptive approximation) for arbitrary approximation schemes. We provide a general formulation of the problem and show how it can be regarded as sequential optimal recovery. We demonstrate the application of this general formulation to two special cases of functions on the real line 1) monotonically increasing functions and 2) functions with bounded derivative. An extensive investigation of the sample complexity of approximating these functions is conducted yielding both theoretical and empirical results on test functions. Our theoretical results (stated insPAC-style), along with the simulations demonstrate the superiority of our active scheme over both passive learning as well as classical optimal recovery. The analysis of active function approximation is conducted in a worst-case setting, in contrast with other Bayesian paradigms obtained from optimal design (Mackay, 1992).

Learning from Incomplete Data

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Real-world learning tasks often involve high-dimensional data sets with complex patterns of missing features. In this paper we review the problem of learning from incomplete data from two statistical perspectives---the likelihood-based and the Bayesian. The goal is two-fold: to place current neural network approaches to missing data within a statistical framework, and to describe a set of algorithms, derived from the likelihood-based framework, that handle clustering, classification, and function approximation from incomplete data in a principled and efficient manner. These algorithms are based on mixture modeling and make two distinct appeals to the Expectation-Maximization (EM) principle (Dempster, Laird, and Rubin 1977)---both for the estimation of mixture components and for coping with the missing data.

Efficient learning of reactive robot behaviors with a Neural-Q_learning approach

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The purpose of this paper is to propose a Neural-Q_learning approach designed for online learning of simple and reactive robot behaviors. In this approach, the Q_function is generalized by a multi-layer neural network allowing the use of continuous states and actions. The algorithm uses a database of the most recent learning samples to accelerate and guarantee the convergence. Each Neural-Q_learning function represents an independent, reactive and adaptive behavior which maps sensorial states to robot control actions. A group of these behaviors constitutes a reactive control scheme designed to fulfill simple missions. The paper centers on the description of the Neural-Q_learning based behaviors showing their performance with an underwater robot in a target following task. Real experiments demonstrate the convergence and stability of the learning system, pointing out its suitability for online robot learning. Advantages and limitations are discussed

Towards Direct Policy Search Reinforcement Learning for Robot Control

Relevância:

30.00% 30.00%

Publicador:

Resumo:

This paper proposes a high-level reinforcement learning (RL) control system for solving the action selection problem of an autonomous robot. Although the dominant approach, when using RL, has been to apply value function based algorithms, the system here detailed is characterized by the use of direct policy search methods. Rather than approximating a value function, these methodologies approximate a policy using an independent function approximator with its own parameters, trying to maximize the future expected reward. The policy based algorithm presented in this paper is used for learning the internal state/action mapping of a behavior. In this preliminary work, we demonstrate its feasibility with simulated experiments using the underwater robot GARBI in a target reaching task

An artificial economy based on reinforcement learning and agent based modeling

Relevância:

30.00% 30.00%

Publicador:

Resumo:

In this paper, we employ techniques from artificial intelligence such as reinforcement learning and agent based modeling as building blocks of a computational model for an economy based on conventions. First we model the interaction among firms in the private sector. These firms behave in an information environment based on conventions, meaning that a firm is likely to behave as its neighbors if it observes that their actions lead to a good pay off. On the other hand, we propose the use of reinforcement learning as a computational model for the role of the government in the economy, as the agent that determines the fiscal policy, and whose objective is to maximize the growth of the economy. We present the implementation of a simulator of the proposed model based on SWARM, that employs the SARSA(λ) algorithm combined with a multilayer perceptron as the function approximation for the action value function.

Efficiency and Equity in Learning Communities and schools

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Traditionally, school efficiency has been measured as a function of educational production. In the last two decades, however, studies in the economics of education have indicated that more is required to improve school efficiency: researchers must explore how significant changes in school organization affect the performance of at-risk students. In this paper we introduce Henry Levin’s adoption of the X-efficiency approach to education and we describe the efficient and cost-effective characteristics of one Learning Communities Project School that significantly improved its student outcomes and enrollment numbers and reduced its absenteeism rate to zero. The organizational change that facilitated these improvements defined specific issues to address. Students’ school success became the focus of the school project, which also offered specific incentives, selected teachers, involved parents and community members in decisions, and used the most efficient technologies and methods. This case analysis reveals new two elements—family training and community involvement—that were not explicit parts of Levin’s adaptation. The case of the Antonio Machado Public School should attract the attention of both social scientists and policy makers

Gradient-based reinforcement learning techniques for underwater robotics behavior learning

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Darrerament, l'interès pel desenvolupament d'aplicacions amb robots submarins autònoms (AUV) ha crescut de forma considerable. Els AUVs són atractius gràcies al seu tamany i el fet que no necessiten un operador humà per pilotar-los. Tot i això, és impossible comparar, en termes d'eficiència i flexibilitat, l'habilitat d'un pilot humà amb les escasses capacitats operatives que ofereixen els AUVs actuals. L'utilització de AUVs per cobrir grans àrees implica resoldre problemes complexos, especialment si es desitja que el nostre robot reaccioni en temps real a canvis sobtats en les condicions de treball. Per aquestes raons, el desenvolupament de sistemes de control autònom amb l'objectiu de millorar aquestes capacitats ha esdevingut una prioritat. Aquesta tesi tracta sobre el problema de la presa de decisions utilizant AUVs. El treball presentat es centra en l'estudi, disseny i aplicació de comportaments per a AUVs utilitzant tècniques d'aprenentatge per reforç (RL). La contribució principal d'aquesta tesi consisteix en l'aplicació de diverses tècniques de RL per tal de millorar l'autonomia dels robots submarins, amb l'objectiu final de demostrar la viabilitat d'aquests algoritmes per aprendre tasques submarines autònomes en temps real. En RL, el robot intenta maximitzar un reforç escalar obtingut com a conseqüència de la seva interacció amb l'entorn. L'objectiu és trobar una política òptima que relaciona tots els estats possibles amb les accions a executar per a cada estat que maximitzen la suma de reforços totals. Així, aquesta tesi investiga principalment dues tipologies d'algoritmes basats en RL: mètodes basats en funcions de valor (VF) i mètodes basats en el gradient (PG). Els resultats experimentals finals mostren el robot submarí Ictineu en una tasca autònoma real de seguiment de cables submarins. Per portar-la a terme, s'ha dissenyat un algoritme anomenat mètode d'Actor i Crític (AC), fruit de la fusió de mètodes VF amb tècniques de PG.

Food for thought: the role of dietary flavonoids in enhancing human memory, learning and neuro-cognitive performance

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Emerging evidence suggests that dietary-derived flavonoids have the potential to improve human memory and neuro-cognitive performance via their ability to protect vulnerable neurons, enhance existing neuronal function and stimulate neuronal regeneration. Long-term potentiation (LTP) is widely considered to be one of the major mechanisms underlying memory acquisition, consolidation and storage in the brain and is known to be controlled at the molecular level by the activation of a number of neuronal signalling pathways. These pathways include the phosphatidylinositol-3 kinase/protein kinase B/Akt (Akt), protein kinase C, protein kinase A, Ca-calmodulin kinase and mitogen-activated protein kinase pathways. Growing evidence suggests that flavonoids exert effects on LTP, and consequently memory and cognitive performance, through their interactions with these signalling pathways. Of particular interest is the ability of flavonoids to activate the extracellular signal-regulated kinase and the Akt signalling pathways leading to the activation of the cAMP-response element-binding protein, a transcription factor responsible for increasing the expression of a number of neurotrophins important in LTP and long-term memory. One such neurotrophin is brain-derived neurotrophic factor, which is known to be crucial in controlling synapse growth, in promoting an increase in dendritic spine density and in enhancing synaptic receptor density. The present review explores the potential of flavonoids and their metabolite forms to promote memory and learning through their interactions with neuronal signalling pathways pivotal in controlling LTP and memory in human subjects.

Dynamic automata for mobile robot learning

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Researchers at the University of Reading have developed over many years some simple mobile robots that explore an environment they perceive through simple ultrasonic sensors. Information from these sensors has allowed the robots to learn the simple task of moving around while avoiding dynamic obstacles using a static set of fuzzy automata, the choice of which has been criticised, due to its arbitrary nature. This paper considers how a dynamic set of automata can overcome this criticism. In addition, a new reinforcement learning function is outlined which is both scalable to different numbers and types of sensors. The innovations compare successfully with earlier work.

Neurofuzzy mixture of experts network parallel learning and model construction algorithms

Relevância:

30.00% 30.00%

Publicador:

Resumo:

A connection between a fuzzy neural network model with the mixture of experts network (MEN) modelling approach is established. Based on this linkage, two new neuro-fuzzy MEN construction algorithms are proposed to overcome the curse of dimensionality that is inherent in the majority of associative memory networks and/or other rule based systems. The first construction algorithm employs a function selection manager module in an MEN system. The second construction algorithm is based on a new parallel learning algorithm in which each model rule is trained independently, for which the parameter convergence property of the new learning method is established. As with the first approach, an expert selection criterion is utilised in this algorithm. These two construction methods are equivalent in their effectiveness in overcoming the curse of dimensionality by reducing the dimensionality of the regression vector, but the latter has the additional computational advantage of parallel processing. The proposed algorithms are analysed for effectiveness followed by numerical examples to illustrate their efficacy for some difficult data based modelling problems.

Dual-orthogonal radial basis function networks for nonlinear time series prediction

Relevância:

30.00% 30.00%

Publicador:

Resumo:

A new structure of Radial Basis Function (RBF) neural network called the Dual-orthogonal RBF Network (DRBF) is introduced for nonlinear time series prediction. The hidden nodes of a conventional RBF network compare the Euclidean distance between the network input vector and the centres, and the node responses are radially symmetrical. But in time series prediction where the system input vectors are lagged system outputs, which are usually highly correlated, the Euclidean distance measure may not be appropriate. The DRBF network modifies the distance metric by introducing a classification function which is based on the estimation data set. Training the DRBF networks consists of two stages. Learning the classification related basis functions and the important input nodes, followed by selecting the regressors and learning the weights of the hidden nodes. In both cases, a forward Orthogonal Least Squares (OLS) selection procedure is applied, initially to select the important input nodes and then to select the important centres. Simulation results of single-step and multi-step ahead predictions over a test data set are included to demonstrate the effectiveness of the new approach.

Flavonoids as modulators of memory and learning: molecular interactions resulting in behavioural effects

Relevância:

30.00% 30.00%

Publicador:

Resumo:

There is considerable interest in the potential of a group of dietary-derived phytochemicals known as flavonoids in modulating neuronal function and thereby influencing memory, learning and cognitive function. The present review begins by detailing the molecular events that underlie the acquisition and consolidation of new memories in the brain in order to provide a critical background to understanding the impact of flavonoid-rich diets or pure flavonoids on memory. Data suggests that despite limited brain bioavailability, dietary supplementation with flavonoid-rich foods, such as blueberry, green tea and Ginkgo biloba lead to significant reversals of age-related deficits on spatial memory and learning. Furthermore, animal and cellular studies suggest that the mechanisms underpinning their ability to induce improvements in memory are linked to the potential of absorbed flavonoids and their metabolites to interact with and modulate critical signalling pathways, transcription factors and gene and/or protein expression which control memory and learning processes in the hippocampus; the brain structure where spatial learning occurs. Overall, current evidence suggests that human translation of these animal investigations are warranted, as are further studies, to better understand the precise cause-and-effect relationship between flavonoid intake and cognitive outputs.

«
1
2
3
4
5
6
7
8
...
63
64
»