Biblioteca Digital

112 resultados para automatic feeding

The modified Kanerva model for automatic speech recognition

Relevância:

20.00% 20.00%

Publicador:

Resumo:

A parallel processing network derived from Kanerva's associative memory theory Kanerva 1984 is shown to be able to train rapidly on connected speech data and recognize further speech data with a label error rate of 0·68%. This modified Kanerva model can be trained substantially faster than other networks with comparable pattern discrimination properties. Kanerva presented his theory of a self-propagating search in 1984, and showed theoretically that large-scale versions of his model would have powerful pattern matching properties. This paper describes how the design for the modified Kanerva model is derived from Kanerva's original theory. Several designs are tested to discover which form may be implemented fastest while still maintaining versatile recognition performance. A method is developed to deal with the time varying nature of the speech signal by recognizing static patterns together with a fixed quantity of contextual information. In order to recognize speech features in different contexts it is necessary for a network to be able to model disjoint pattern classes. This type of modelling cannot be performed by a single layer of links. Network research was once held back by the inability of single-layer networks to solve this sort of problem, and the lack of a training algorithm for multi-layer networks. Rumelhart, Hinton & Williams 1985 provided one solution by demonstrating the "back propagation" training algorithm for multi-layer networks. A second alternative is used in the modified Kanerva model. A non-linear fixed transformation maps the pattern space into a space of higher dimensionality in which the speech features are linearly separable. A single-layer network may then be used to perform the recognition. The advantage of this solution over the other using multi-layer networks lies in the greater power and speed of the single-layer network training algorithm. © 1989.

Automatic face recognition for film character retrieval in feature-length films

Relevância:

20.00% 20.00%

Publicador:

Application of fuzzy logic to the automatic processing of particle image velocimetry images of a flow induced by adjacent spinning cylinders

Relevância:

20.00% 20.00%

Publicador:

Morpho-syntactic Information for Automatic Error Analysis of Statistical Machine Translation Output

Relevância:

20.00% 20.00%

Publicador:

Model predictive control: a new paradigm for automatic control - 2002 Presidential address

Relevância:

20.00% 20.00%

Publicador:

Model predictive control: a new paradigm for automatic control

Relevância:

20.00% 20.00%

Publicador:

Calibration and use of a new automatic sand pourer

Relevância:

20.00% 20.00%

Publicador:

Spectral Methods for Automatic Multiscale Data Clustering.

Relevância:

20.00% 20.00%

Publicador:

Predictive automatic relevance determination by expectation propagation.

Relevância:

20.00% 20.00%

Publicador:

Automatic object segmentation from calibrated images

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This paper addresses the problem of automatically obtaining the object/background segmentation of a rigid 3D object observed in a set of images that have been calibrated for camera pose and intrinsics. Such segmentations can be used to obtain a shape representation of a potentially texture-less object by computing a visual hull. We propose an automatic approach where the object to be segmented is identified by the pose of the cameras instead of user input such as 2D bounding rectangles or brush-strokes. The key behind our method is a pairwise MRF framework that combines (a) foreground/background appearance models, (b) epipolar constraints and (c) weak stereo correspondence into a single segmentation cost function that can be efficiently solved by Graph-cuts. The segmentation thus obtained is further improved using silhouette coherency and then used to update the foreground/background appearance models which are fed into the next Graph-cut computation. These two steps are iterated until segmentation convergences. Our method can automatically provide a 3D surface representation even in texture-less scenes where MVS methods might fail. Furthermore, it confers improved performance in images where the object is not readily separable from the background in colour space, an area that previous segmentation approaches have found challenging. © 2011 IEEE.

Use of automatic sand pourers for loose sand models

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The consistency of laboratory sand model preparation for physical testing is a fundamental criterion in representing identical geotechnical issues at prototype scale. This objective led to the development of robotic apparatus to eliminate the non-uniformity in manual pouring. Previous studies have shown consistent sand models with high relative density between 50 to 90% produced by the automatic moving-hopper sand pourer at the University of Cambridge, based primarily on a linear correlation to flow rate. However, in the case of loose samples, the influence of other parameters, particularly the drop height, becomes more apparent. In this paper, findings on the effect of flow rate and drop height are discussed in relation to the layer thickness and relative density of loose sand samples. Design charts are presented to illustrate their relationships. The effect of these factors on different sand types is also covered to extend the use of the equipment. © 2010 Taylor & Francis Group, London.

An automatic speaker recognition system for intelligence applications

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This paper presents an automatic speaker recognition system for intelligence applications. The system has to provide functionalities for a speaker skimming application in which databases of recorded conversations belonging to an ongoing investigation can be annotated and quickly browsed by an operator. The paper discusses the criticalities introduced by the characteristics of the audio signals under consideration - in particular background noise and channel/coding distortions - as well as the requirements and functionalities of the system under development. It is shown that the performance of state-of-the-art approaches degrades significantly in presence of moderately high background noise. Finally, a novel speaker recognizer based on phonetic features and an ensemble classifier is presented. Results show that the proposed approach improves performance on clean audio, and suggest that it can be employed towards improved real-world robustness. © EURASIP, 2009.

Automatic Integration of Construction Images using Digital Imaging and Pattern Recognition Tools

Relevância:

20.00% 20.00%

Publicador:

Detection of Construction Workers in Video Frames for Automatic Initialization of Vision Trackers

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Vision based tracking can provide the spatial location of construction entities such as equipment, workers, and materials in large scale, congested construction sites. It tracks entities in video streams by inferring their locations based on the entities’ visual features and motion histories. To initiate the process, it is necessary to determine the pixel areas corresponding to the construction entities to be tracked in the following consecutive video frames. In order to fully automate the process, an automated way of initialization is needed. This paper presents the method for construction worker detection which can automatically recognize and localize construction workers in video frames. The method first finds the foreground areas of moving objects using a background subtraction method. Within these foreground areas, construction workers are recognized based on the histogram of oriented gradients (HOG) and histogram of the HSV colors. HOG’s have proved to work effectively for detection of people, and the histogram of HSV colors helps differentiate between pedestrians and construction workers wearing safety vests. Preliminary experiments show that the proposed method has the potential to automate the initialization process of vision based tracking.

Learning based automatic face annotation for arbitrary poses and expressions from frontal images only

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Statistical approaches for building non-rigid deformable models, such as the Active Appearance Model (AAM), have enjoyed great popularity in recent years, but typically require tedious manual annotation of training images. In this paper, a learning based approach for the automatic annotation of visually deformable objects from a single annotated frontal image is presented and demonstrated on the example of automatically annotating face images that can be used for building AAMs for fitting and tracking. This approach employs the idea of initially learning the correspondences between landmarks in a frontal image and a set of training images with a face in arbitrary poses. Using this learner, virtual images of unseen faces at any arbitrary pose for which the learner was trained can be reconstructed by predicting the new landmark locations and warping the texture from the frontal image. View-based AAMs are then built from the virtual images and used for automatically annotating unseen images, including images of different facial expressions, at any random pose within the maximum range spanned by the virtually reconstructed images. The approach is experimentally validated by automatically annotating face images from three different databases. © 2009 IEEE.

«
1
2
3
4
5
6
7
8
»