919 resultados para Non-thresholding speech noise reduction


Relevância:

30.00% 30.00%

Publicador:

Resumo:

Model-based approaches to handling additive background noise and channel distortion, such as Vector Taylor Series (VTS), have been intensively studied and extended in a number of ways. In previous work, VTS has been extended to handle both reverberant and background noise, yielding the Reverberant VTS (RVTS) scheme. In this work, rather than assuming the observation vector is generated by the reverberation of a sequence of background noise corrupted speech vectors, as in RVTS, the observation vector is modelled as a superposition of the background noise and the reverberation of clean speech. This yields a new compensation scheme RVTS Joint (RVTSJ), which allows an easy formulation for joint estimation of both additive and reverberation noise parameters. These two compensation schemes were evaluated and compared on a simulated reverberant noise corrupted AURORA4 task. Both yielded large gains over VTS baseline system, with RVTSJ outperforming the previous RVTS scheme. © 2011 IEEE.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Recently there has been interest in combining generative and discriminative classifiers. In these classifiers features for the discriminative models are derived from the generative kernels. One advantage of using generative kernels is that systematic approaches exist to introduce complex dependencies into the feature-space. Furthermore, as the features are based on generative models standard model-based compensation and adaptation techniques can be applied to make discriminative models robust to noise and speaker conditions. This paper extends previous work in this framework in several directions. First, it introduces derivative kernels based on context-dependent generative models. Second, it describes how derivative kernels can be incorporated in structured discriminative models. Third, it addresses the issues associated with large number of classes and parameters when context-dependent models and high-dimensional feature-spaces of derivative kernels are used. The approach is evaluated on two noise-corrupted tasks: small vocabulary AURORA 2 and medium-to-large vocabulary AURORA 4 task. © 2011 IEEE.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The modelling of the non-linear behaviour of MEMS oscillators is of interest to understand the effects of non-linearities on start-up, limit cycle behaviour and performance metrics such as output frequency and phase noise. This paper proposes an approach to integrate the non-linear modelling of the resonator, transducer and sustaining amplifier in a single numerical modelling environment so that their combined effects may be investigated simultaneously. The paper validates the proposed electrical model of the resonator through open-loop frequency response measurements on an electrically addressed flexural silicon MEMS resonator driven to large motional amplitudes. A square wave oscillator is constructed by embedding the same resonator as the primary frequency determining element. Measurements of output power and output frequency of the square wave oscillator as a function of resonator bias and driving voltage are consistent with model predictions ensuring that the model captures the essential non-linear behaviour of the resonator and the sustaining amplifier in a single mathematical equation. © 2012 IEEE.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Vibration modes of a submerged hull are excited by fluctuating forces generated at the propeller and transmitted to the hull via the propeller-shafting system. The low frequency hull vibrational modes result in significant sound radiation. This work investigates the reduction of the far-field radiated sound pressure by optimising the connection point of the shafting system to the hull. The submarine hull is modelled as a fluid loaded cylindrical hull with truncated conical shells at each end. The propeller-shafting system consists of the propeller, shaft, thrust bearing and foundation, and is modelled in a modular approach using a combination of spring-mass-damper elements and continuous systems (beams, plates, shells). The foundation is attached to the stern side end plate of the hull, which is modelled as a circular plate coupled to an annular plate. By tuning the connection radius of the foundation to the end plate, the maximum radiated noise in a given frequency range can be minimised.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The numerical solution of problems in unbounded physical space requires a truncation of the computational domain to a reasonable size. As a result, the conditions on the artificial boundaries are generally unknown. Assumptions like constant pressure or velocities are only valid in the far field and lead to spurious reflections if applied on the boundaries of the truncated domain. A number of attempts have been made over the past decades to design conditions that prevent such reflections. One approach is based on characteristics. The standard analysis assumes a spatially uniform mean flow field but this is often impractical. In the present paper we show how to extend the formulation to the more general case of a non-uniform mean velocity field. A number of test cases are provided and our results compare favourably with other boundary conditions. In principle the present approach can be extended to include non-uniformities in all variables.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

This paper presents new methods for computing the step sizes of the subband-adaptive iterative shrinkage-thresholding algorithms proposed by Bayram & Selesnick and Vonesch & Unser. The method yields tighter wavelet-domain bounds of the system matrix, thus leading to improved convergence speeds. It is directly applicable to non-redundant wavelet bases, and we also adapt it for cases of redundant frames. It turns out that the simplest and most intuitive setting for the step sizes that ignores subband aliasing is often satisfactory in practice. We show that our methods can be used to advantage with reweighted least squares penalty functions as well as L1 penalties. We emphasize that the algorithms presented here are suitable for performing inverse filtering on very large datasets, including 3D data, since inversions are applied only to diagonal matrices and fast transforms are used to achieve all matrix-vector products.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Abstract A theoretical model is developed for the sound scattered when a sound wave is incident on a cambered aerofoil at non-zero angle of attack. The model is based on the linearization of the Euler equations about a steady subsonic flow, and is an adaptation of previous work which considered incident vortical disturbances. Only high-frequency sound waves are considered. The aerofoil thickness, camber and angle of attack are restricted such that the steady flow past the aerofoil is a small perturbation to a uniform flow. The singular perturbation analysis identifies asymptotic regions around the aerofoil; local 'inner' regions, which scale on the incident wavelength, at the leading and trailing edges of the aerofoil; Fresnel regions emanating from the leading and trailing edges of the aerofoil due to the coalescence of singularities and points of stationary phase; a wake transition region downstream of the aerofoil leading and trailing edge; and an outer region far from the aerofoil and wake. An acoustic boundary layer on the aerofoil surface and within the transition region accounts for the effects of curvature. The final result is a uniformly-valid solution for the far-field sound; the effects of angle of attack, camber and thickness are investigated. © 2013 Cambridge University Press.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Spoken dialogue systems provide a convenient way for users to interact with a machine using only speech. However, they often rely on a rigid turn taking regime in which a voice activity detection (VAD) module is used to determine when the user is speaking and decide when is an appropriate time for the system to respond. This paper investigates replacing the VAD and discrete utterance recogniser of a conventional turn-taking system with a continuously operating recogniser that is always listening, and using the recogniser 1-best path to guide turn taking. In this way, a flexible framework for incremental dialogue management is possible. Experimental results show that it is possible to remove the VAD component and successfully use the recogniser best path to identify user speech, with more robustness to noise, potentially smaller latency times, and a reduction in overall recognition error rate compared to using the conventional approach. © 2013 IEEE.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Large margin criteria and discriminative models are two effective improvements for HMM-based speech recognition. This paper proposed a large margin trained log linear model with kernels for CSR. To avoid explicitly computing in the high dimensional feature space and to achieve the nonlinear decision boundaries, a kernel based training and decoding framework is proposed in this work. To make the system robust to noise a kernel adaptation scheme is also presented. Previous work in this area is extended in two directions. First, most kernels for CSR focus on measuring the similarity between two observation sequences. The proposed joint kernels defined a similarity between two observation-label sequence pairs on the sentence level. Second, this paper addresses how to efficiently employ kernels in large margin training and decoding with lattices. To the best of our knowledge, this is the first attempt at using large margin kernel-based log linear models for CSR. The model is evaluated on a noise corrupted continuous digit task: AURORA 2.0. © 2013 IEEE.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

We present a novel reference compensation method for eliminating environmental noise in interferometric wavelength shift demodulation for dynamic fiber Bragg grating (FBG) sensors. By employing a shielded wavelength-division-multiplexed reference FBG in the system the environmental noise is mea, sured from the reference channel, and then subtracted from the demodulation result of each sensor channel. An approximate 40 dB reduction of the environmental noise has been experimentally achieved over a frequency range from 20 Hz to 2 kHz. This method is also suitable for the elimination of broadband environmental noise. The corresponding FBG sensor array system proposed in this paper has shown a wave-length resolution of 7 x 10(-4) pm/root Hz. (c) 2009 Elsevier B.V. All rights reserved.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Conventional quantum trajectory theory developed in quantum optics is largely based on the physical unravelling of a Lindblad-type master equation, which constitutes the theoretical basis of continuous quantum measurement and feedback control. In this work, in the context of continuous quantum measurement and feedback control of a solid-state charge qubit, we present a physical unravelling scheme of a non-Lindblad-type master equation. Self-consistency and numerical efficiency are well demonstrated. In particular, the control effect is manifested in the detector noise spectrum, and the effect of measurement voltage is discussed.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

This paper proposes a fast-settling frequency-presetting PLL frequency synthesizer. A mixed-signal VCO and a digital processor are developed to accurately preset the frequency of VCO and greatly reduce the settling time. An auxiliary tuning loop is introduced in order to reduce reference spur caused by leakage current. The digital processor can automatically compensate presetting frequency variation with process and temperature, and control the operation of the auxiliary tuning loop. A 1.2 GHz integer-N synthesizer with 1 MHz reference input Was implemented in a 0.18μm process. The measured results demonstrate that the typical settling time of the synthesizer is less than 3μs,and the phase noise is -108 dBc/Hz@1MHz.The reference spur is -52 dBc.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

In the paper, we report an efficient method to prepare high yield (up to 97%) of silver nanoplates. Synthesis of silver nanoplates was carried Out in a binary solvent system of N,N-dimethylformamide (DMF) and toluene, in which DMF served as the reductant and polyvinylpyrrolidone (PVP) as the capping agent. By increasing the ratio of toluene to DMF to 7:6, silver nanoplates can be Successfully synthesized; otherwise other shaped nanoparticles would be the major products. The nanoplate sample was characterized by TEM, HRTEM, SAED, XRD, AFM and UV-visible spectroscopy, proving the high nanoplate purity of this sample. The influence of toluene content, other solvents, AgNO3 concentration, preparation temperature and chloride ions was also examined, which suggests that the function of nonpolar solvents in this system is to enhance the PVP coverage on silver surface and, furthermore, to facilitate the preferential adsorption of PVP on two (I I I) facets of silver nanoplates.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The reduction of Eu3+ to Eu2+ in SrB6O10 prepared in air by a high-temperature solid state reaction was studied. The luminescent properties of Eu2+ in this matrix show f-d broad band emission peaking at about 386 and 432 nm at room temperature. A charge compensation mechanism is proposed as a possible explanation. The luminescence of Eu3+ with f-f transitions was studied in this sample and reflected that the Eu3+ ion occupied a site with non-centro-symmetry. The ESR spectrum was used to detect the existence of Eu2+ in the samples. (C) 1998 Elsevier Science S.A.