Biblioteca Digital

In recent years there has been a growing interest amongst the speech research community into the use of spectral estimators which circumvent the traditional quasi-stationary assumption and provide greater time-frequency (t-f) resolution than conventional spectral estimators, such as the short time Fourier power spectrum (STFPS). One distribution in particular, the Wigner distribution (WD), has attracted considerable interest. However, experimental studies have indicated that, despite its improved t-f resolution, employing the WD as the front end of speech recognition system actually reduces recognition performance; only by explicitly re-introducing t-f smoothing into the WD are recognition rates improved. In this paper we provide an explanation for these findings. By treating the spectral estimation problem as one of optimization of a bias variance trade off, we show why additional t-f smoothing improves recognition rates, despite reducing the t-f resolution of the spectral estimator. A practical adaptive smoothing algorithm is presented, whicy attempts to match the degree of smoothing introduced into the WD with the time varying quasi-stationary regions within the speech waveform. The recognition performance of the resulting adaptively smoothed estimator is found to be comparable to that of conventional filterbank estimators, yet the average temporal sampling rate of the resulting spectral vectors is reduced by around a factor of 10. © 1992.

Veja mais

Real time feature-based facial tracking using Lie algebras

Relevância:

20.00% 20.00%

Publicador:

Resumo:

We have developed a novel human facial tracking system that operates in real time at a video frame rate without needing any special hardware. The approach is based on the use of Lie algebra, and uses three-dimensional feature points on the targeted human face. It is assumed that the roughly estimated facial model (relative coordinates of the three-dimensional feature points) is known. First, the initial feature positions of the face are determined using a model fitting technique. Then, the tracking is operated by the following sequence: (1) capture the new video frame and render feature points to the image plane; (2) search for new positions of the feature points on the image plane; (3) get the Euclidean matrix from the moving vector and the three-dimensional information for the points; and (4) rotate and translate the feature points by using the Euclidean matrix, and render the new points on the image plane. The key algorithm of this tracker is to estimate the Euclidean matrix by using a least square technique based on Lie algebra. The resulting tracker performed very well on the task of tracking a human face.

Veja mais

On the use of a decimative spectral estimation method based on eigenanalysis and SVD for formant and bandwidth tracking of speech signals

Relevância:

20.00% 20.00%

Publicador:

Resumo:

In this paper, a Decimative Spectral estimation method based on Eigenanalysis and SVD (Singular Value Decomposition) is presented and applied to speech signals in order to estimate Formant/Bandwidth values. The underlying model decomposes a signal into complex damped sinusoids. The algorithm is applied not only on speech samples but on a small amount of the autocorrelation coefficients of a speech frame as well, for finer estimation. Correct estimation of Formant/Bandwidth values depend on the model order thus, the requested number of poles. Overall, experimentation results indicate that the proposed methodology successfully estimates formant trajectories and their respective bandwidths.

Veja mais

Using a semiconductor optical amplifier integrated with a pump laser for mid-span spectral inversion and wavelength conversion

Relevância:

20.00% 20.00%

Publicador:

Resumo:

A semiconductor optical amplifier monolithically integrated with a distributed feedback pump laser is used for non-degenerate four wave mixing applications. Experimental results are presented which illustrate the use of this compact device for both wavelength conversion and dispersion compensation applications at high data rates.

Veja mais

A single all-optical processor for multiple spectral amplitude code label recognition using four wave mixing

Relevância:

20.00% 20.00%

Publicador:

Resumo:

We propose a novel label processor which can recognize multiple spectral-amplitude-code labels using four-wave-mixing sidebands and selective optical filtering. Ten code-labels x 10 Gbps variable-length packets are transmitted over a 200 km single-hop switched network.

Veja mais

21 Port self wavelength switching of 40 Gb/s spectral-amplitude-encoded DPSK signals

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Ultrafast self-switching of spectral-amplitude-encoded 40 Gb/s DPSK signals is demonstrated for the first time. Switching between 21 ports with 15nm maximum bin separation is achieved using a single correlator based on HNLF and an AWG. © 2009 IEEE.

Veja mais

On film character retrieval in feature-length films

Relevância:

20.00% 20.00%

Publicador:

Veja mais

112 resultados para spectral ridge feature

Filtro por publicador