Biblioteca Digital

972 resultados para Audio acoustics

Autoregressive decomposition and pole tracking applied to vocal fold nodule signals

Relevância:

60.00% 60.00%

Publicador:

Resumo:

This letter describes a novel algorithm that is based on autoregressive decomposition and pole tracking used to recognize two patterns of speech data: normal voice and disphonic voice caused by nodules. The presented method relates the poles and the peaks of the signal spectrum which represent the periodic components of the voice. The results show that the perturbation contained in the signal is clearly depicted by pole's positions. Their variability is related to jitter and shimmer. The pole dispersion for pathological voices is about 20% higher than for normal voices, therefore, the proposed approach is a more trustworthy measure than the classical ones. © 2007.

A Bitstream Scalable Audio Coder Using a Hybrid WLPC-Wavelet Representation

Relevância:

30.00% 30.00%

Publicador:

A New Audio Coder using a Warped Linear Prediction Model and the Wavelet Transform

Relevância:

30.00% 30.00%

Publicador:

The navigation and visualisation of environmental audio using zooming spectrograms

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Acoustic recordings play an increasingly important role in monitoring terrestrial and aquatic environments. However, rapid advances in technology make it possible to accumulate thousands of hours of recordings, more than ecologists can ever listen to. Our approach to this big-data challenge is to visualize the content of long-duration audio recordings on multiple scales, from minutes, hours, days to years. The visualization should facilitate navigation and yield ecologically meaningful information prior to listening to the audio. To construct images, we calculate acoustic indices, statistics that describe the distribution of acoustic energy and reflect content of ecological interest. We combine various indices to produce false-color spectrogram images that reveal acoustic content and facilitate navigation. The technical challenge we investigate in this work is how to navigate recordings that are days or even months in duration. We introduce a method of zooming through multiple temporal scales, analogous to Google Maps. However, the “landscape” to be navigated is not geographical and not therefore intrinsically visual, but rather a graphical representation of the underlying audio. We describe solutions to navigating spectrograms that range over three orders of magnitude of temporal scale. We make three sets of observations: 1. We determine that at least ten intermediate scale steps are required to zoom over three orders of magnitude of temporal scale; 2. We determine that three different visual representations are required to cover the range of temporal scales; 3. We present a solution to the problem of maintaining visual continuity when stepping between different visual representations. Finally, we demonstrate the utility of the approach with four case studies.

Robust noise reduction for speech and audio signals

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Statistical model-based methods are presented for the reconstruction of autocorrelated signals in impulsive plus continuous noise environments. Signals are modelled as autoregressive and noise sources as discrete and continuous mixtures of Gaussians, allowing for robustness in highly impulsive and non-Gaussian environments. Markov Chain Monte Carlo methods are used for reconstruction of the corrupted waveforms within a Bayesian probabilistic framework and results are presented for contaminated voice and audio signals.

Towards a perceptually optimal spectral amplitude estimator for audio signal enhancement

Relevância:

30.00% 30.00%

Publicador:

Resumo:

We present a statistical model-based approach to signal enhancement in the case of additive broadband noise. Because broadband noise is localised in neither time nor frequency, its removal is one of the most pervasive and difficult signal enhancement tasks. In order to improve perceived signal quality, we take advantage of human perception and define a best estimate of the original signal in terms of a cost function incorporating perceptual optimality criteria. We derive the resultant signal estimator and implement it in a short-time spectral attenuation framework. Audio examples, references, and further information may be found at http://www-sigproc.eng.cam.ac.uk/~pjw47.

Bayesian extensions to non-negative matrix factorisation for audio signal modelling

Relevância:

30.00% 30.00%

Publicador:

Multi-Object tracking of sinusoidal components in audio with the gaussian mixture probability hypothesis density filter

Relevância:

30.00% 30.00%

Publicador:

Sequential inference of rhythmic structure in musical audio

Relevância:

30.00% 30.00%

Publicador:

Sparse regression with structured priors: application to audio denoising

Relevância:

30.00% 30.00%

Publicador:

The multi-channel AR model for real-time audio restoration

Relevância:

30.00% 30.00%

Publicador:

Interpolation of missing data values for audio signal restoration using a Gabor regression model

Relevância:

30.00% 30.00%

Publicador:

A Gabor regression scheme for audio signal analysis

Relevância:

30.00% 30.00%

Publicador:

Detection of abrupt spectral changes using support vector machines; an application to audio signal segmentation

Relevância:

30.00% 30.00%

Publicador:

Sequential Monte Carlo simulation of dynamical models with slowly varying parameters: application to audio

Relevância:

30.00% 30.00%

Publicador:

«
1
2
3
4
5
6
7
8
...
64
65
»