Fully probabilistic control design in an adaptive critic framework


Autoria(s): Herzallah, Randa; Kárný, Miroslav
Data(s)

01/12/2011

Resumo

Optimal stochastic controller pushes the closed-loop behavior as close as possible to the desired one. The fully probabilistic design (FPD) uses probabilistic description of the desired closed loop and minimizes Kullback-Leibler divergence of the closed-loop description to the desired one. Practical exploitation of the fully probabilistic design control theory continues to be hindered by the computational complexities involved in numerically solving the associated stochastic dynamic programming problem. In particular very hard multivariate integration and an approximate interpolation of the involved multivariate functions. This paper proposes a new fully probabilistic contro algorithm that uses the adaptive critic methods to circumvent the need for explicitly evaluating the optimal value function, thereby dramatically reducing computational requirements. This is a main contribution of this short paper.

Formato

application/pdf

Identificador

http://eprints.aston.ac.uk/24939/1/NeuralNetwork11.pdf

Herzallah, Randa and Kárný, Miroslav (2011). Fully probabilistic control design in an adaptive critic framework. Neural Networks, 24 (10), pp. 1128-1135.

Relação

http://eprints.aston.ac.uk/24939/

Tipo

Article

PeerReviewed