Biblioteca Digital

2 resultados para Hierarchical stochastic learning

em Massachusetts Institute of Technology

On the Convergence of Stochastic Iterative Dynamic Programming Algorithms

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Recent developments in the area of reinforcement learning have yielded a number of new algorithms for the prediction and control of Markovian environments. These algorithms, including the TD(lambda) algorithm of Sutton (1988) and the Q-learning algorithm of Watkins (1989), can be motivated heuristically as approximations to dynamic programming (DP). In this paper we provide a rigorous proof of convergence of these DP-based learning algorithms by relating them to the powerful techniques of stochastic approximation theory via a new convergence theorem. The theorem establishes a general class of convergent algorithms to which both TD(lambda) and Q-learning belong.

Veja mais

Hierarchical Mixtures of Experts and the EM Algorithm

Relevância:

30.00% 30.00%

Publicador:

Resumo:

We present a tree-structured architecture for supervised learning. The statistical model underlying the architecture is a hierarchical mixture model in which both the mixture coefficients and the mixture components are generalized linear models (GLIM's). Learning is treated as a maximum likelihood problem; in particular, we present an Expectation-Maximization (EM) algorithm for adjusting the parameters of the architecture. We also develop an on-line learning algorithm in which the parameters are updated incrementally. Comparative simulation results are presented in the robot dynamics domain.

Veja mais

2 resultados para Hierarchical stochastic learning

em Massachusetts Institute of Technology

Filtro por publicador

On the Convergence of Stochastic Iterative Dynamic Programming Algorithms

Hierarchical Mixtures of Experts and the EM Algorithm