925 resultados para Compressed speech


Relevância:

20.00% 20.00%

Publicador:

Resumo:

We present results of a study into the performance of a variety of different image transform-based feature types for speaker-independent visual speech recognition of isolated digits. This includes the first reported use of features extracted using a discrete curvelet transform. The study will show a comparison of some methods for selecting features of each feature type and show the relative benefits of both static and dynamic visual features. The performance of the features will be tested on both clean video data and also video data corrupted in a variety of ways to assess each feature type's robustness to potential real-world conditions. One of the test conditions involves a novel form of video corruption we call jitter which simulates camera and/or head movement during recording.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

We have investigated the angular variation in elastic x-ray scattering from a dense, laser-shock-compressed aluminum foil. A comparison of the experiment with simulations using an embedded atom potential in a molecular dynamics simulation shows a significantly better agreement than simulations based on an unscreened one-component plasma model. These data illustrate, experimentally, the importance of screening for the dense plasma static structure factor.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

We have performed short-pulse x-ray scattering measurements on laser-driven shock-compressed plastic samples in the warm dense matter regime, providing instantaneous snapshots of the system evolution. Time-resolved and angularly resolved scattered spectra sensitive to the correlation effects in the plasma show the appearance of short-range order within a few interionic separations. Comparison with radiation-hydrodynamic simulations indicates that the shocked plastic is compressed with a temperature of a few electron volts. These results are important for the understanding of the thermodynamic behavior of strongly correlated matter for conditions relevant to both laboratory astrophysics and inertial confinement fusion research.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Audio scrambling can be employed to ensure confidentiality in audio distribution. We first describe scrambling for raw audio using the discrete wavelet transform (DWT) first and then focus on MP3 audio scrambling. We perform scrambling based on a set of keys which allows for a set of audio outputs having different qualities. During descrambling, the number of keys provided and the number of rounds of descrambling performed will decide the audio output quality. We also perform scrambling by using multiple keys on the MP3 audio format. With a subset of keys, we can descramble to obtain a low quality audio. However, we can obtain the original quality audio by using all of the keys. Our experiments show that the proposed algorithms are effective, fast, simple to implement while providing flexible control over the progressive quality of the audio output. The security level provided by the scheme is sufficient for protecting MP3 music content.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

In this paper we present the application of Hidden Conditional Random Fields (HCRFs) to modelling speech for visual speech recognition. HCRFs may be easily adapted to model long range dependencies across an observation sequence. As a result visual word recognition performance can be improved as the model is able to take more of a contextual approach to generating state sequences. Results are presented from a speaker-dependent, isolated digit, visual speech recognition task using comparisons with a baseline HMM system. We firstly illustrate that word recognition rates on clean video using HCRFs can be improved by increasing the number of past and future observations being taken into account by each state. Secondly we compare model performances using various levels of video compression on the test set. As far as we are aware this is the first attempted use of HCRFs for visual speech recognition.