993 resultados para word prediction
Resumo:
This paper describes recent improvements to the Cambridge Arabic Large Vocabulary Continuous Speech Recognition (LVCSR) Speech-to-Text (STT) system. It is shown that wordboundary context markers provide a powerful method to enhance graphemic systems by implicit phonetic information, improving the modelling capability of graphemic systems. In addition, a robust technique for full covariance Gaussian modelling in the Minimum Phone Error (MPE) training framework is introduced. This reduces the full covariance training to a diagonal covariance training problem, thereby solving related robustness problems. The full system results show that the combined use of these and other techniques within a multi-branch combination framework reduces the Word Error Rate (WER) of the complete system by up to 5.9% relative. Copyright © 2011 ISCA.
Resumo:
The pressure oscillation within combustion chambers of aeroengines and industrial gas turbines is a major technical challenge to the development of high-performance and low-emission propulsion systems. In this paper, an approach integrating computational fluid dynamics and one-dimensional linear stability analysis is developed to predict the modes of oscillation in a combustor and their frequencies and growth rates. Linear acoustic theory was used to describe the acoustic waves propagating upstream and downstream of the combustion zone, which enables the computational fluid dynamics calculation to be efficiently concentrated on the combustion zone. A combustion oscillation was found to occur with its predicted frequency in agreement with experimental measurements. Furthermore, results from the computational fluid dynamics calculation provide the flame transfer function to describe unsteady heat release rate. Departures from ideal one-dimensional flows are described by shape factors. Combined with this information, low-order models can work out the possible oscillation modes and their initial growth rates. The approach developed here can be used in more general situations for the analysis of combustion oscillations. Copyright © 2012 by the American Institute of Aeronautics and Astronautics, Inc. All rights reserved.
Resumo:
We present a new online psycholinguistic resource for Greek based on analyses of written corpora combined with text processing technologies developed at the Institute for Language & Speech Processing (ILSP), Greece. The "ILSP PsychoLinguistic Resource" (IPLR) is a freely accessible service via a dedicated web page, at http://speech.ilsp.gr/iplr. IPLR provides analyses of user-submitted letter strings (words and nonwords) as well as frequency tables for important units and conditions such as syllables, bigrams, and neighbors, calculated over two word lists based on printed text corpora and their phonetic transcription. Online tools allow retrieval of words matching user-specified orthographic or phonetic patterns. All results and processing code (in the Python programming language) are freely available for noncommercial educational or research use. © 2010 Springer Science+Business Media B.V.
Resumo:
Optical motion capture systems suffer from marker occlusions resulting in loss of useful information. This paper addresses the problem of real-time joint localisation of legged skeletons in the presence of such missing data. The data is assumed to be labelled 3d marker positions from a motion capture system. An integrated framework is presented which predicts the occluded marker positions using a Variable Turn Model within an Unscented Kalman filter. Inferred information from neighbouring markers is used as observation states; these constraints are efficient, simple, and real-time implementable. This work also takes advantage of the common case that missing markers are still visible to a single camera, by combining predictions with under-determined positions, resulting in more accurate predictions. An Inverse Kinematics technique is then applied ensuring that the bone lengths remain constant over time; the system can thereby maintain a continuous data-flow. The marker and Centre of Rotation (CoR) positions can be calculated with high accuracy even in cases where markers are occluded for a long period of time. Our methodology is tested against some of the most popular methods for marker prediction and the results confirm that our approach outperforms these methods in estimating both marker and CoR positions. © 2012 Springer-Verlag.
Resumo:
Emissions, fuel burn, and noise are the main drivers for innovative aircraft design. Embedded propulsion systems, such as for example used in hybrid-wing body aircraft, can offer fuel burn and noise reduction benefits but the impact of inlet flow distortion on the generation and propagation of turbomachinery noise has yet to be assessed. A novel approach is used to quantify the effects of non-uniform flow on the creation and propagation of multiple pure tone (MPT) noise. The ultimate goal is to conduct a parametric study of S-duct inlets to quantify the effects of inlet design parameters on the acoustic signature. The key challenge is that the effects of distortion transfer, noise source generation and propagation through the non-uniform flow field are inherently coupled such that a simultaneous computation of the aerodynamics and acoustics is required to capture the mechanisms at play. The technical approach is based on a body force description of the fan blade row that is able to capture the distortion transfer and the blade-to-blade flow variations that cause the MPT noise while reducing computational cost. A single, 3-D full-wheel CFD simulation, in which the Euler equations are solved to second-order spatial and temporal accuracy, simultaneously computes the MPT noise generation and its propagation in distorted inlet flow. A new method of producing the blade-to-blade variations in the body force field for MPT noise generation has been developed and validated. The numerical dissipation inherent to the solver is quantified and used to correct for non-physical attenuation in the far-field noise spectra. Source generation, acoustic propagation and acoustic energy transfer between modes is examined in detail. The new method is validated on NASA's Source Diagnostic Test fan and inlet, showing good agreement with experimental data for aerodynamic performance, acoustic source generation, and far-field noise spectra. The next steps involve the assessment of MPT noise in serpentine inlet ducts and the development of a reduced order formulation suitable for incorporation into NASA's ANOPP framework. © 2010 by Jeff Defoe, Alex Narkaj & Zoltan Spakovszky.
Resumo:
Embedded propulsion systems, such as for example used in advanced hybrid-wing body aircraft, can potentially offer major fuel burn and noise reduction benefits but introduce challenges in the aerodynamic and acoustic integration of the high-bypass ratio fan system. A novel approach is proposed to quantify the effects of non-uniform flow on the generation and propagation of multiple pure tone noise (MPTs). The new method is validated on a conventional inlet geometry first. The ultimate goal is to conduct a parametric study of S-duct inlets in order to quantify the effects of inlet design parameters on the acoustic signature. The key challenge is that the mechanism underlying the distortion transfer, noise source generation and propagation through the non-uniform flow field are inherently coupled such that a simultaneous computation of the aerodynamics and acoustics is required. The technical approach is based on a body force description of the fan blade row that is able to capture the distortion transfer and the MPT noise generation mechanisms while greatly reducing computational cost. A single, 3-D full-wheel unsteady CFD simulation, in which the Euler equations are solved to second-order spatial and temporal accuracy, simultaneously computes the MPT noise generation and its propagation in distorted mean flow. Several numerical tools were developed to enable the implementation of this new approach. Parametric studies were conducted to determine appropriate grid and time step sizes for the propagation of acoustic waves. The Ffowcs-Williams and Hawkings integral method is used to propagate the noise to far field receivers. Non-reflecting boundary conditions are implemented through the use of acoustic buffer zones. The body force modeling approach is validated and proof-of-concept studies demonstrate the generation of disturbances at both blade-passing and shaft-order frequencies using the perturbed body force method. The full methodology is currently being validated using NASA's Source Diagnostic Test (SDT) fan and inlet geometry. Copyright © 2009 by Jeff Defoe, Alex Narkaj & Zoltan Spakovszky.
Resumo:
Current commercial dialogue systems typically use hand-crafted grammars for Spoken Language Understanding (SLU) operating on the top one or two hypotheses output by the speech recogniser. These systems are expensive to develop and they suffer from significant degradation in performance when faced with recognition errors. This paper presents a robust method for SLU based on features extracted from the full posterior distribution of recognition hypotheses encoded in the form of word confusion networks. Following [1], the system uses SVM classifiers operating on n-gram features, trained on unaligned input/output pairs. Performance is evaluated on both an off-line corpus and on-line in a live user trial. It is shown that a statistical discriminative approach to SLU operating on the full posterior ASR output distribution can substantially improve performance both in terms of accuracy and overall dialogue reward. Furthermore, additional gains can be obtained by incorporating features from the previous system output. © 2012 IEEE.