Biblioteca Digital

4 resultados para deep learning

em Cambridge University Engineering Department Publications Database

Learning the Structure of Deep Sparse Graphical Models

Relevância:

40.00% 40.00%

Publicador:

Resumo:

Deep belief networks are a powerful way to model complex probability distributions. However, learning the structure of a belief network, particularly one with hidden units, is difficult. The Indian buffet process has been used as a nonparametric Bayesian prior on the directed structure of a belief network with a single infinitely wide hidden layer. In this paper, we introduce the cascading Indian buffet process (CIBP), which provides a nonparametric prior on the structure of a layered, directed belief network that is unbounded in both depth and width, yet allows tractable inference. We use the CIBP prior with the nonlinear Gaussian belief network so each unit can additionally vary its behavior between discrete and continuous representations. We provide Markov chain Monte Carlo algorithms for inference in these belief networks and explore the structures learned on several image data sets.

Veja mais

Learning the Structure of Deep Sparse Graphical Models.

Relevância:

40.00% 40.00%

Publicador:

Veja mais

Flexible Martingale Priors for Deep Hierarchies.

Relevância:

30.00% 30.00%

Publicador:

Veja mais

Avoiding pathologies in very deep networks

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Choosing appropriate architectures and regularization strategies of deep networks is crucial to good predictive performance. To shed light on this problem, we analyze the analogous problem of constructing useful priors on compositions of functions. Specifically, we study the deep Gaussian process, a type of infinitely-wide, deep neural network. We show that in standard architectures, the representational capacity of the network tends to capture fewer degrees of freedom as the number of layers increases, retaining only a single degree of freedom in the limit. We propose an alternate network architecture which does not suffer from this pathology. We also examine deep covariance functions, obtained by composing infinitely many feature transforms. Lastly, we characterize the class of models obtained by performing dropout on Gaussian processes.

Veja mais

4 resultados para deep learning

em Cambridge University Engineering Department Publications Database

Filtro por publicador

Learning the Structure of Deep Sparse Graphical Models

Learning the Structure of Deep Sparse Graphical Models.

Flexible Martingale Priors for Deep Hierarchies.

Avoiding pathologies in very deep networks