919 resultados para Non-thresholding speech noise reduction
Resumo:
We propose a two-dimensional (2-D) multicomponent amplitude-modulation, frequency-modulation (AM-FM) model for a spectrogram patch corresponding to voiced speech, and develop a new demodulation algorithm to effectively separate the AM, which is related to the vocal tract response, and the carrier, which is related to the excitation. The demodulation algorithm is based on the Riesz transform and is developed along the lines of Hilbert-transform-based demodulation for 1-D AM-FM signals. We compare the performance of the Riesz transform technique with that of the sinusoidal demodulation technique on real speech data. Experimental results show that the Riesz-transform-based demodulation technique represents spectrogram patches accurately. The spectrograms reconstructed from the demodulated AM and carrier are inverted and the corresponding speech signal is synthesized. The signal-to-noise ratio (SNR) of the reconstructed speech signal, with respect to clean speech, was found to be 2 to 4 dB higher in case of the Riesz transform technique than the sinusoidal demodulation technique.
Resumo:
The effect of multiplicative noise on a signal when compared with that of additive noise is very large. In this paper, we address the problem of suppressing multiplicative noise in one-dimensional signals. To deal with signals that are corrupted with multiplicative noise, we propose a denoising algorithm based on minimization of an unbiased estimator (MURE) of meansquare error (MSE). We derive an expression for an unbiased estimate of the MSE. The proposed denoising is carried out in wavelet domain (soft thresholding) by considering time-domain MURE. The parameters of thresholding function are obtained by minimizing the unbiased estimator MURE. We show that the parameters for optimal MURE are very close to the optimal parameters considering the oracle MSE. Experiments show that the SNR improvement for the proposed denoising algorithm is competitive with a state-of-the-art method.
Resumo:
This paper proposes a denoising algorithm which performs non-local means bilateral filtering. As existing literature suggests, non-local means (NLM) is one of the widely used denoising techniques, but has a critical drawback of smoothing of edges. In order to improve this, we perform fast and efficient NLM using Approximate Nearest Neighbour Fields and improve the edge content in denoising by formulating a joint-bilateral filter. Using the proposed joint bilateral, we are able to denoise smooth regions using the NLM approach and efficient edge reconstruction is obtained from the bilateral filter. Furthermore, to avoid tedious parameter selection, we carry out a noise estimation before performing joint bilateral filtering. The proposed approach is observed to perform well on high noise images.
Resumo:
Bacterial biofilms display a collective lifestyle, wherein the cells secrete extracellular polymeric substances (EPS) that helps in adhesion, aggregation, stability, and to protect the bacteria from antimicrobials. We asked whether the BPS could act as a public good for the biofilm and observed that infiltration of cells that do not produce matrix components weakened the biofilm of Salmonella enterica serovar Typhimurium. PS production was costly for the producing cells, as indicated by a significant reduction in the fitness of wild type (WT) cells during competitive planktonic growth relative to the non-producers. Infiltration frequency of non-producers in the biofilm showed a concomitant decrease in overall productivity. It was apparent in the confocal images that the non producing cells benefit from the BPS produced by the Wild Type (WT) to stay in the biofilm. The biofilm containing non-producing cells were more significantly susceptible to sodium hypochlorite and ciprofloxacin treatment than the WT biofilm. Biofilm infiltrated with non-producers delayed the pathogenesis, as tested in a murine model. The cell types were spatially assorted, with non producers being edged out in the biofilm. However, cellulose was found to act as a barrier to keep the non-producers away from the WT microcolony. Our results show that the infiltration of non-cooperating cell types can substantially weaken the biofilm making it vulnerable to antibacterials and delay their pathogenesis. Cellulose, a component of BPS, was shown to play a pivotal role of acting as the main public good, and to edge-out the non-producers away from the cooperating microcolony.
Resumo:
We report the synthesis of nitrogen doped vertically aligned multi-walled (MWNCNTs) carbon nanotubes by pyrolysis and its catalytic performance for degradation of methylene blue (MB) dye & oxygen reduction reaction (ORR). The degradation of MB was monitored spectrophotometrically with time. Kinetic studies show the degradation of MB follows a first order kinetic with rate constant k=0.0178 min(-1). The present rate constant is better than that reported for various supported/non-supported semiconducting nanomaterials. Further ORR performance in alkaline media makes MWNCNTs a promising cost-effective, fuel crossover tolerance, metal-free, eco-friendly cathode catalyst for direct alcohol fuel cell.
Resumo:
We propose apractical, feature-level and score-level fusion approach by combining acoustic and estimated articulatory information for both text independent and text dependent speaker verification. From a practical point of view, we study how to improve speaker verification performance by combining dynamic articulatory information with the conventional acoustic features. On text independent speaker verification, we find that concatenating articulatory features obtained from measured speech production data with conventional Mel-frequency cepstral coefficients (MFCCs) improves the performance dramatically. However, since directly measuring articulatory data is not feasible in many real world applications, we also experiment with estimated articulatory features obtained through acoustic-to-articulatory inversion. We explore both feature level and score level fusion methods and find that the overall system performance is significantly enhanced even with estimated articulatory features. Such a performance boost could be due to the inter-speaker variation information embedded in the estimated articulatory features. Since the dynamics of articulation contain important information, we included inverted articulatory trajectories in text dependent speaker verification. We demonstrate that the articulatory constraints introduced by inverted articulatory features help to reject wrong password trials and improve the performance after score level fusion. We evaluate the proposed methods on the X-ray Microbeam database and the RSR 2015 database, respectively, for the aforementioned two tasks. Experimental results show that we achieve more than 15% relative equal error rate reduction for both speaker verification tasks. (C) 2015 Elsevier Ltd. All rights reserved.
Resumo:
Speech enhancement in stationary noise is addressed using the ideal channel selection framework. In order to estimate the binary mask, we propose to classify each time-frequency (T-F) bin of the noisy signal as speech or noise using Discriminative Random Fields (DRF). The DRF function contains two terms - an enhancement function and a smoothing term. On each T-F bin, we propose to use an enhancement function based on likelihood ratio test for speech presence, while Ising model is used as smoothing function for spectro-temporal continuity in the estimated binary mask. The effect of the smoothing function over successive iterations is found to reduce musical noise as opposed to using only enhancement function. The binary mask is inferred from the noisy signal using Iterated Conditional Modes (ICM) algorithm. Sentences from NOIZEUS corpus are evaluated from 0 dB to 15 dB Signal to Noise Ratio (SNR) in 4 kinds of additive noise settings: additive white Gaussian noise, car noise, street noise and pink noise. The reconstructed speech using the proposed technique is evaluated in terms of average segmental SNR, Perceptual Evaluation of Speech Quality (PESQ) and Mean opinion Score (MOS).
Resumo:
A comparative study of field-induced domain switching and lattice strain was carried out by in situ electric-field-dependent high-energy synchrotron x-ray diffraction on a morphotropic phase boundary (MPB) and a near-MPB rhombohedral/pseudomonoclinic composition of a high-performance piezoelectric alloy (1-x) PbTiO3-(x)BiScO3. It is demonstrated that the MPB composition showing large d(33) similar to 425 pC/N exhibits significantly reduced propensity of field-induced domain switching as compared to the non-MPB rhombohedral composition (d(33) similar to 260 pC/N). These experimental observations contradict the basic premise of the martensitic-theory-based explanation which emphasizes on enhanced domain wall motion as the primary factor for the anomalous piezoelectric response in MPB piezoelectrics. Our results favor field-induced structural transformation to be the primary mechanism contributing to the large piezoresponse of the critical MPB composition of this system.
Resumo:
The method of density matching between the solid and liquid phases is often adopted to effectively eliminate the effect of sedimentation of suspensions in studies on dynamic behaviour of a colloidal system. However, the associated changes in the solvent composition may bring side effects to the properties investigated and therefore might lead to a faulty conclusion if the relevant correction is not made. To illustrate the importance of this side effect, we present an example of the sedimentation influence on the coagulation rate of suspensions of 2 μm (diameter) polystyrene. The liquid mixtures, in the proper proportions of water (H2O), deuterium oxide (D2O) and methanol (MeOH) as the liquid phase, density-matched and unmatched experiments are performed. Besides the influence of viscosity, the presence of methanol in solvent media, used to enhance the sedimentation effect, causes significant changes (reduction) in rapid coagulation rates compared to that in pure water. Without the relevant corrections for those non-gravitational factors it seems that gravitational sedimentation would retard the coagulation. The magnitude of the contribution from the non-gravitational factor is quantitatively determined, making the relevant correction possible. After necessary corrections for all factors, our experiments show that the influence of the sedimentation on coagulation rates at the initial stage of the coagulation is not observable.
Resumo:
This paper describes a curve-fitting approach for the design of capacity approaching coded modulation for orthogonal signal sets with non-coherent detection. In particular, bit-interleaved coded modulation with iterative decoding is considered. Decoder metrics are developed that do not require knowledge of the signal-to-noise ratio, yet still offer very good performance. © 2007 IEEE.
Resumo:
In this paper methods are developed for enhancement and analysis of autoregressive moving average (ARMA) signals observed in additive noise which can be represented as mixtures of heavy-tailed non-Gaussian sources and a Gaussian background component. Such models find application in systems such as atmospheric communications channels or early sound recordings which are prone to intermittent impulse noise. Markov Chain Monte Carlo (MCMC) simulation techniques are applied to the joint problem of signal extraction, model parameter estimation and detection of impulses within a fully Bayesian framework. The algorithms require only simple linear iterations for all of the unknowns, including the MA parameters, which is in contrast with existing MCMC methods for analysis of noise-free ARMA models. The methods are illustrated using synthetic data and noise-degraded sound recordings.
Resumo:
In this work, the drag reduction by gas injection for power-law fluid flow in stratified and slug flow regimes has been studied. Experimentswere conducted to measure the pressure gradient within air/CMC solutions in a horizontal Plexiglas pipe that had a diameter of 50mm and a length of 30 m. The drag reduction ratio in stratified flow regime was predicted using the two-fluid model. The results showed that the drag reduction should occur over the large range of the liquid holdup when the flow behaviour index remained at the low value. Furthermore, for turbulent gas-laminar liquid stratified flow, the drag reduction by gas injection for Newtonian fluid was more effective than that for shear-shinning fluid, when the dimensionless liquid height remained in the area of high value. The pressure gradient model for a gas/Newtonian liquid slug flow was extended to liquids possessing the Ostwald–de Waele power law model. The proposed model was validated against 340 experimental data point over a wide range of operating conditions, fluid characteristics and pipe diameters. The dimensionless pressure drop predicted was well inside the 20% deviation region for most of the experimental data. These results substantiated the general validity of the model presented for gas/non-Newtonian two-phase slug flows.
Resumo:
In this work. co-current flow characteristics of air/non-Newtonian liquid systems in inclined smooth pipes are studied experimentally and theoretically using transparent tubes of 20, 40 and 60 turn in diameter. Each tube includes two 10 m lone pipe branches connected by a U-bend that is capable of being inclined to any angle, from a completely horizontal to a fully vertical position. The flow rate of each phase is varied over a wide range. The studied flow phenomena are bubbly, plug flow, slug flow, churn flow and annular flow. These are observed and recorded by a high flow. stratified flow. -speed camera over a wide range of operating conditions. The effects of the liquid phase properties, the inclination angle and the pipe diameter on two-phase flow characteristics are systematically studied. The Heywood-Charles model for horizontal flow was modified to accommodate stratified flow in inclined pipes, taking into account the average void fraction and pressure drop of the mixture flow of a gas/non-Newtonian liquid. The pressure drop gradient model of Taitel and Barnea for a gas/Newtonian liquid slug flow was extended to include liquids possessing shear-thinning flow behaviour in inclined pipes. The comparison of the predicted values with the experimental data shows that the models presented here provide a reasonable estimate of the average void fraction and the corresponding pressure drop for the mixture flow of a gas/ non-Newtonian liquid. (C) 2007 Elsevier Ltd. All rights reserved.