105 resultados para Speech in Noise


Relevância:

90.00% 90.00%

Publicador:

Resumo:

Narrowband spectrograms of voiced speech can be modeled as an outcome of two-dimensional (2-D) modulation process. In this paper, we develop a demodulation algorithm to estimate the 2-D amplitude modulation (AM) and carrier of a given spectrogram patch. The demodulation algorithm is based on the Riesz transform, which is a unitary, shift-invariant operator and is obtained as a 2-D extension of the well known 1-D Hilbert transform operator. Existing methods for spectrogram demodulation rely on extension of sinusoidal demodulation method from the communications literature and require precise estimate of the 2-D carrier. On the other hand, the proposed method based on Riesz transform does not require a carrier estimate. The proposed method and the sinusoidal demodulation scheme are tested on real speech data. Experimental results show that the demodulated AM and carrier from Riesz demodulation represent the spectrogram patch more accurately compared with those obtained using the sinusoidal demodulation. The signal-to-reconstruction error ratio was found to be about 2 to 6 dB higher in case of the proposed demodulation approach.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

The goal in the whisper activity detection (WAD) is to find the whispered speech segments in a given noisy recording of whispered speech. Since whispering lacks the periodic glottal excitation, it resembles an unvoiced speech. This noise-like nature of the whispered speech makes WAD a more challenging task compared to a typical voice activity detection (VAD) problem. In this paper, we propose a feature based on the long term variation of the logarithm of the short-time sub-band signal energy for WAD. We also propose an automatic sub-band selection algorithm to maximally discriminate noisy whisper from noise. Experiments with eight noise types in four different signal-to-noise ratio (SNR) conditions show that, for most of the noises, the performance of the proposed WAD scheme is significantly better than that of the existing VAD schemes and whisper detection schemes when used for WAD.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Oversmoothing of speech parameter trajectories is one of the causes for quality degradation of HMM-based speech synthesis. Various methods have been proposed to overcome this effect, the most recent ones being global variance (GV) and modulation-spectrum-based post-filter (MSPF). However, there is still a significant quality gap between natural and synthesized speech. In this paper, we propose a two-fold post-filtering technique to alleviate to a certain extent the oversmoothing of spectral and excitation parameter trajectories of HMM-based speech synthesis. For the spectral parameters, we propose a sparse coding-based post-filter to match the trajectories of synthetic speech to that of natural speech, and for the excitation trajectory, we introduce a perceptually motivated post-filter. Experimental evaluations show quality improvement compared with existing methods.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Algorithms for extracting epochs or glottal closure instants (GCIs) from voiced speech typically fall into two categories: i) ones which operate on linear prediction residual (LPR) and ii) those which operate directly on the speech signal. While the former class of algorithms (such as YAGA and DPI) tend to be more accurate, the latter ones (such as ZFR and SEDREAMS) tend to be more noise-robust. In this letter, a temporal measure termed the cumulative impulse strength is proposed for locating the impulses in a quasi-periodic impulse-sequence embedded in noise. Subsequently, it is applied for detecting the GCIs from the inverted integrated LPR using a recursive algorithm. Experiments on two large corpora of speech with simultaneous electroglottographic recordings demonstrate that the proposed method is more robust to additive noise than the state-of-the-art algorithms, despite operating on the LPR.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

The problem of detecting an unknown transient signal in noise is considered. The SNR of the observed data is first enhanced using wavelet domain filter The output of the wavelet domain filter is then transformed using a Wigner-Ville transform,which separates the spectrum of the observed signal into narrow frequency bands. Each subband signal at the output of the Wigner-ville block is subjected kto wavelet based level dependent denoising (WBLDD)to supress colored noise A weighted sum of the absolute value of outputs of WBLDD is passed through an energy detector, whose output is used as test statistic to take the final decision. By assigning weights proportional to the energy of the corresponding subband signals, the proposed detector approximates a frequency domain matched filter Simulation results are presented to show that the performance of the proposed detector is better than that of the wavelet packet transform based detector.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

A fairly comprehensive computer program incorporating explicit expressions for the four-pole parameters of concentric-tube resonators, plug mufflers, and three-duct cross-flow perforated elements has been used for parametric studies. The parameters considered are hole diameter, the center-to-center distance between consecutive holes (which decides porosity), the incoming mean flow Mach number, the area expansion ratio, the number of partitions of chambers within a given overall shell length, and the relative lengths of these partitions or chambers, all normalized with respect to the exhaust pipe diameter. Transmission loss has been plotted as a function of a normalized frequency parameter. Additionally, the effect of the tail pipe length on insertion loss for an anechoic source has also been studied. These studies have been supplemented by empirical expressions for the normalized static pressure drop for different types of perforated-element mufflers developed from experimental observations.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

This paper considers the problem of spectrum sensing in cognitive radio networks when the primary user employs Orthogonal Frequency Division Multiplexing (OFDM). We specifically consider the scenario when the channel between the primary and a secondary user is frequency selective. We develop cooperative sequential detection algorithms based on energy detectors. We modify the detectors to mitigate the effects of some common model uncertainties such as timing and frequency offset, IQ-imbalance and uncertainty in noise and transmit power. The performance of the proposed algorithms are studied via simulations. We show that the performance of the energy detector is not affected by the frequency selective channel. We also provide a theoretical analysis for some of our algorithms.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Modeling of wave propagation in hoses, unlike in rigid pipes or waveguides, introduces a coupling between the inside medium, the hose wall, and the outside medium, This alters the axial wave number and thence the corresponding effective speed of sound inside the hose resulting in sound radiation into the outside medium, also called the breakout or shell noise, The existing literature on the subject is such that a hose cannot be integrated into the,whole piping system made up of sections of hoses, pipes, and mufflers to predict the acoustical performance in terms of transmission loss (TL), The present paper seeks to fill this gap, Three one-dimensional coupled wave equations are written to account for the presence of a yielding wall with a finite lumped transverse impedance of the hose material, The resulting wave equation can readily be reduced to a transfer matrix form using an effective wave number for a moving medium in a hose section, Incorporating the effect of fluid loading due to the outside medium also allows prediction of the transverse TL and the breakout noise, Axial TL and transverse TL have been combined into net TL needed by designers, Predictions of the axial as well as transverse TL are shown to compare well with those of a rigorous 3-D analysis using only one-hundredth of the computation time, Finally, results of some parametric studies are reported for engineers involved in the acoustical design of hoses. (C) 1996 Institute of Noise Control Engineering.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Variable cross-sectional area ducts are often used for attenuation at lower frequencies (of the order of firing frequency), whereas concentric tube resonators provide attenuation at relatively higher frequencies. In this paper, analysis of one dimensional control volume approach of conical concentric tube resonators is validated experimentally. The effects of mean flow and taper are investigated. The experimental setup is specially designed to measure the pressure transfer function in the form of Level Difference or Noise Reduction across the test muffler. It is shown that there is a reasonably good agreement between the predicted values of the Noise Reduction and the measured ones for incompressible mean flow as well as stationary medium. (C) 2011 Institute of Noise Control Engineering.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

This paper considers the problem of spectrum sensing in cognitive radio networks when the primary user is using Orthogonal Frequency Division Multiplexing (OFDM). For this we develop cooperative sequential detection algorithms that use the autocorrelation property of cyclic prefix (CP) used in OFDM systems. We study the effect of timing and frequency offset, IQ-imbalance and uncertainty in noise and transmit power. We also modify the detector to mitigate the effects of these impairments. The performance of the proposed algorithms is studied via simulations. We show that sequential detection can significantly improve the performance over a fixed sample size detector.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

This paper presents computational and experimental results on a new burner configuration with a mild combustion concept with heat release rates up to 10 MW/m(3). The burner configuration is shown to achieve mild combustion by using air at ambient temperature at high recirculation rates (similar to250%-290%) both experimentally and computationally. The principal features of the configuration are: (1) a burner with forward exit for exhaust gases; (2) injection of gaseous fuel and air as multiple, alternate, peripheral highspeed jets at the bottom at ambient temperature, thus creating high enough recirculation rates of the hot combustion products into fresh incoming reactants; and (3) use of a suitable geometric artifice-a frustum of a cone to help recirculation. The computational studies have been used to reveal the details of the flow and to optimize the combustor geometry based on recirculation rates. Measures, involving root mean square temperature fluctuations, distribution of temperature and oxidizer concentration inside the proposed burner, and a classical turbulent diffusion jet flame, are used to distinguish between them quantitatively. The system, operated at heat release rates of 2 to 10 MW/m(3) (compared to 0.02 to 0.32 MW/m(3) in the earlier studies), shows a 10-15 dB reduction in noise in the mild combustion mode compared to a simple open-top burner and exhaust NOx emission below 10 ppm for a 3 kW burner with 10% excess air. The peak temperature is measured around 1750 K, approximately 300 K lower than the peak temperature in a conventional burner.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

In this work, one-dimensional flow-acoustic analysis of two basic configurations of air cleaners, (i) Rectangular Axial-Inlet, Axial-Outlet (RAIAO) and (ii) Rectangular Transverse-Inlet, Transverse-Outlet (RTITO), has been presented. This 1-D analytical approach has been verified with the help of 3-D FEM based software. Through subtraction of the acoustic performance of the bare plenum (without filter element) from that of the complete air cleaner box, the solitary performance of the filter element has been evaluated. Part of the present analysis illustrates that the analytical formulation remains effective even with offset positioning of the air pipes from the centre of the cross section of the air cleaner. The 1-D analytical tool computes much faster than its 3-D simulation counterpart. The present analysis not only predicts the acoustical impact of mean flow, but it also depicts the scenario with increased resistance of the filter element. Thus, the proposed 1-D analysis would help in the design of acoustically efficient air cleaners for automotive applications. (C) 2011 Institute of Noise Control Engineering.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Transmission loss (TL) of an elliptical cylindrical chamber muffler having a single side/end inlet and multiple side/end outlet is analyzed by means of the 3-D semi-analytical formulation based upon the modal expansion (in terms of the angular and radial Mathieu functions) and the Green's function. The acoustic pressure response obtained in terms of Green's function is integrated over surface area of the side/end ports (modeled as rigid pistons) and upon subsequent division by the port area, yields the acoustic pressure response or impedance Z] matrix parameters due to the uniform piston-driven model. The 3-D semi-analytical results are found to be in excellent agreement with the results obtained by means of 3-D FEA (SYSNOISE) simulations, thereby validating the semi-analytical procedure suggested in this work. Parametric studies such as the effect of chamber length (L), angular and axial locations of the ports, interchanging the locations of inlet and outlet ports as well as the addition of an outlet port for double outlet mufflers on the TL performance are reported, thereby leading to the formulation of design guidelines for obtaining muffler configurations exhibiting a broad-band TL spectrum. One such configuration is an axially long chamber having side-inlet and side-outlet ports such that one of the side ports is located at half the axial length on themajor/minor axis and the other side port is located at three-quarters (or one-quarter) of the axial length on the minor/major axis. (C) 2012 Institute of Noise Control Engineering.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

The analytic signal (AS) was proposed by Gabor as a complex signal corresponding to a given real signal. The AS has a one-sided spectrum and gives rise to meaningful spectral averages. The Hilbert transform (HT) is a key component in Gabor's AS construction. We generalize the construction methodology by employing the fractional Hilbert transform (FrHT), without going through the standard fractional Fourier transform (FrFT) route. We discuss some properties of the fractional Hilbert operator and show how decomposition of the operator in terms of the identity and the standard Hilbert operators enables the construction of a family of analytic signals. We show that these analytic signals also satisfy Bedrosian-type properties and that their time-frequency localization properties are unaltered. We also propose a generalized-phase AS (GPAS) using a generalized-phase Hilbert transform (GPHT). We show that the GPHT shares many properties of the FrHT, in particular, selective highlighting of singularities, and a connection with Lie groups. We also investigate the duality between analyticity and causality concepts to arrive at a representation of causal signals in terms of the FrHT and GPHT. On the application front, we develop a secure multi-key single-sideband (SSB) modulation scheme and analyze its performance in noise and sensitivity to security key perturbations. (C) 2013 Elsevier B.V. All rights reserved.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Transmission loss (TL) of a simple expansion chamber (SEC) consists of periodic domes with sharp troughs. This limits practical application of the SEC in the variable-speed automobile exhaust systems. Three-fourths of the troughs of the SEC can be lifted by appropriate tuning of the extended inlet/outlet lengths. However, such mufflers suffer from high back pressure and generation of aerodynamic noise due to free shear layers at the area discontinuities. Therefore, a perforate bridge is made between the extended inlet and outlet. It is shown that the TL curve of a concentric tube resonator (CTR) can also be lifted in a similar way by proper tuning of the extended unperforated lengths. Differential lengths have to be used to correct the inlet/outlet lengths in order to account for the perforate inertance. The resonance peak frequencies calculated by means of the 1-D analysis are compared with those of the 3-D FEM, and appropriate differential lengths are calculated. It is shown how different geometric characteristics of the muffler and mean flow affect the differential lengths. A general correlation is obtained for the differential lengths by considering seven relevant geometric and environmental parameters in a comprehensive parametric study. The resulting expressions would help in design of extended-tube CTR for wide-band TL. (C) 2014 Institute of Noise Control Engineering.