100 resultados para Audio acoustics


Relevância:

20.00% 20.00%

Publicador:

Resumo:

One important issue in designing state-of-the-art LVCSR systems is the choice of acoustic units. Context dependent (CD) phones remain the dominant form of acoustic units. They can capture the co-articulatory effect in speech via explicit modelling. However, for other more complicated phonological processes, they rely on the implicit modelling ability of the underlying statistical models. Alternatively, it is possible to construct acoustic models based on higher level linguistic units, for example, syllables, to explicitly capture these complex patterns. When sufficient training data is available, this approach may show an advantage over implicit acoustic modelling. In this paper a wide range of acoustic units are investigated to improve LVCSR system performance. Significant error rate gains up to 7.1% relative (0.8% abs.) were obtained on a state-of-the-art Mandarin Chinese broadcast audio recognition task using word and syllable position dependent triphone and quinphone models. © 2011 IEEE.

Relevância:

20.00% 20.00%

Publicador:

Relevância:

20.00% 20.00%

Publicador:

Resumo:

We present in this paper a new multivariate probabilistic approach to Acoustic Pulse Recognition (APR) for tangible interface applications. This model uses Principle Component Analysis (PCA) in a probabilistic framework to classify tapping pulses with a high degree of variability. It was found that this model, achieves a higher robustness to pulse variability than simpler template matching methods, specifically when allowed to train on data containing high variability. © 2011 IEEE.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The use of boundary-layer-ingesting, embedded propulsion systems can result in inlet flow distortions where the interaction of the boundary layer vorticity and the inlet lip causes horseshoe vortex formation and the ingestion of streamwise vortices into the inlet. A previously-developed body-force-based fan modeling approach was used to assess the change in fan rotor shock noise generation and propagation in a boundary-layer-ingesting, serpentine inlet. This approach is employed here in a parametric study to assess the effects of inlet geometry parameters (offset-to-diameter ratio and downstream-to-upstream area ratio) on flow distortion and rotor shock noise. Mechanisms related to the vortical inlet structures were found to govern changes in the rotor shock noise generation and propagation. The vortex whose circulation is in the opposite direction to the fan rotation (counter-swirling vortex) increases incidence angles on the fan blades near the tip, enhancing noise generation. The vortex with circulation in the direction of fan rotation (co-swirling vortex) creates a region of subsonic relative flow near the blade tip radius which decreases the sound power propagated to the far-field. The parametric study revealed that the overall sound power level at the fan leading edge is set by the ingested streamwise circulation, and that for inlet designs in which the streamwise vortices are displaced away from the duct wall, the sound power at the upstream inlet plane increased by as much as 9 dB. By comparing the far-field noise results obtained to those for a conventional inlet, it is deduced that the changes in rotor shock noise are predominantly due to the ingestion of streamwise vorticity.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The use of boundary-layer-ingesting, embedded propulsion systems can result in inlet flow distortions where the interaction of the boundary layer vorticity and the inlet lip causes horseshoe vortex formation and the ingestion of streamwise vortices into the inlet. A previously-developed body-force-based fan modeling approach was used to assess the change in fan rotor shock noise generation and propagation in a boundary-layer-ingesting, serpentine inlet. This approach is employed here in a parametric study to assess the effects of inlet geometry parameters (offset-to-diameter ratio and downstream-to-upstream area ratio) on flow distortion and rotor shock noise. Mechanisms related to the vortical inlet structures were found to govern changes in the rotor shock noise generation and propagation. The vortex whose circulation is in the opposite direction to the fan rotation (counter-swirling vortex) increases incidence angles on the fan blades near the tip, enhancing noise generation. The vortex with circulation in the direction of fan rotation (co-swirling vortex) creates a region of subsonic relative flow near the blade tip radius which decreases the sound power propagated to the far-field. The parametric study revealed that the overall sound power level at the fan leading edge is set by the ingested streamwise circulation, and that for inlet designs in which the streamwise vortices are displaced away from the duct wall, the sound power at the upstream inlet plane increased by as much as 9 dB. By comparing the far-field noise results obtained to those for a conventional inlet, it is deduced that the changes in rotor shock noise are predominantly due to the ingestion of streamwise vorticity.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This paper discusses the development of a computationally efficient numerical method for predicting the acoustics of rattle events upfront in the design cycle. The method combines Finite Elements, Boundary Elements and SEA and enables the loudness of a large number of rattle events to be efficiently predicted across a broad frequency range. A low frequency random vibro-acoustic model is used in conjunction with various closed form analytical expressions in order to quickly predict impact probabilities and locations. An existing method has been extended to estimate the statistics of the contact forces across a broad frequency range. Finally, broadband acoustic radiation is predicted using standard low, mid and high frequency vibro-acoustic methods and used to estimate impact loudness. The approach is discussed and a number of validation examples are presented.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

We present a system for keyword search on Cantonese conversational telephony audio, collected for the IARPA Babel program, that achieves good performance by combining postings lists produced by diverse speech recognition systems from three different research groups. We describe the keyword search task, the data on which the work was done, four different speech recognition systems, and our approach to system combination for keyword search. We show that the combination of four systems outperforms the best single system by 7%, achieving an actual term-weighted value of 0.517. © 2013 IEEE.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The task in keyword spotting (KWS) is to hypothesise times at which any of a set of key terms occurs in audio. An important aspect of such systems are the scores assigned to these hypotheses, the accuracy of which have a significant impact on performance. Estimating these scores may be formulated as a confidence estimation problem, where a measure of confidence is assigned to each key term hypothesis. In this work, a set of discriminative features is defined, and combined using a conditional random field (CRF) model for improved confidence estimation. An extension to this model to directly address the problem of score normalisation across key terms is also introduced. The implicit score normalisation which results from applying this approach to separate systems in a hybrid configuration yields further benefits. Results are presented which show notable improvements in KWS performance using the techniques presented in this work. © 2013 IEEE.