A new Jacobian matrix for optimal learning of single-layer neural networks


Autoria(s): Peng, Jian Xun; Li, Kang; Irwin, George
Data(s)

01/01/2008

Resumo

This paper investigates the learning of a wide class of single-hidden-layer feedforward neural networks (SLFNs) with two sets of adjustable parameters, i.e., the nonlinear parameters in the hidden nodes and the linear output weights. The main objective is to both speed up the convergence of second-order learning algorithms such as Levenberg-Marquardt (LM), as well as to improve the network performance. This is achieved here by reducing the dimension of the solution space and by introducing a new Jacobian matrix. Unlike conventional supervised learning methods which optimize these two sets of parameters simultaneously, the linear output weights are first converted into dependent parameters, thereby removing the need for their explicit computation. Consequently, the neural network (NN) learning is performed over a solution space of reduced dimension. A new Jacobian matrix is then proposed for use with the popular second-order learning methods in order to achieve a more accurate approximation of the cost function. The efficacy of the proposed method is shown through an analysis of the computational complexity and by presenting simulation results from four different examples.

Identificador

http://pure.qub.ac.uk/portal/en/publications/a-new-jacobian-matrix-for-optimal-learning-of-singlelayer-neural-networks(b0841e88-6627-42de-9150-e6f9cdd5010a).html

http://dx.doi.org/10.1109/TNN.2007.903150

http://www.scopus.com/inward/record.url?scp=39549096279&partnerID=8YFLogxK

Idioma(s)

eng

Direitos

info:eu-repo/semantics/restrictedAccess

Fonte

Peng , J X , Li , K & Irwin , G 2008 , ' A new Jacobian matrix for optimal learning of single-layer neural networks ' IEEE Transactions on Neural Networks , vol 19 , no. 1 , pp. 119-129 . DOI: 10.1109/TNN.2007.903150

Palavras-Chave #/dk/atira/pure/subjectarea/asjc/2200/2207 #Control and Systems Engineering #/dk/atira/pure/subjectarea/asjc/2600/2614 #Theoretical Computer Science #/dk/atira/pure/subjectarea/asjc/2200/2208 #Electrical and Electronic Engineering #/dk/atira/pure/subjectarea/asjc/1700/1702 #Artificial Intelligence #/dk/atira/pure/subjectarea/asjc/1700/1703 #Computational Theory and Mathematics #/dk/atira/pure/subjectarea/asjc/1700/1708 #Hardware and Architecture
Tipo

article