133 resultados para Speech Production
Resumo:
Automatic and accurate detection of the closure-burst transition events of stops and affricates serves many applications in speech processing. A temporal measure named the plosion index is proposed to detect such events, which are characterized by an abrupt increase in energy. Using the maxima of the pitch-synchronous normalized cross correlation as an additional temporal feature, a rule-based algorithm is designed that aims at selecting only those events associated with the closure-burst transitions of stops and affricates. The performance of the algorithm, characterized by receiver operating characteristic curves and temporal accuracy, is evaluated using the labeled closure-burst transitions of stops and affricates of the entire TIMIT test and training databases. The robustness of the algorithm is studied with respect to global white and babble noise as well as local noise using the TIMIT test set and on telephone quality speech using the NTIMIT test set. For these experiments, the proposed algorithm, which does not require explicit statistical training and is based on two one-dimensional temporal measures, gives a performance comparable to or better than the state-of-the-art methods. In addition, to test the scalability, the algorithm is applied on the Buckeye conversational speech corpus and databases of two Indian languages. (C) 2014 Acoustical Society of America.
Resumo:
Narrowband spectrograms of voiced speech can be modeled as an outcome of two-dimensional (2-D) modulation process. In this paper, we develop a demodulation algorithm to estimate the 2-D amplitude modulation (AM) and carrier of a given spectrogram patch. The demodulation algorithm is based on the Riesz transform, which is a unitary, shift-invariant operator and is obtained as a 2-D extension of the well known 1-D Hilbert transform operator. Existing methods for spectrogram demodulation rely on extension of sinusoidal demodulation method from the communications literature and require precise estimate of the 2-D carrier. On the other hand, the proposed method based on Riesz transform does not require a carrier estimate. The proposed method and the sinusoidal demodulation scheme are tested on real speech data. Experimental results show that the demodulated AM and carrier from Riesz demodulation represent the spectrogram patch more accurately compared with those obtained using the sinusoidal demodulation. The signal-to-reconstruction error ratio was found to be about 2 to 6 dB higher in case of the proposed demodulation approach.
Resumo:
We consider ZH and WH production at the Large Hadron Collider, where the Higgs decays to a b (b) over bar pair. We use jet substructure techniques to reconstruct the Higgs boson and construct angular observables involving leptonic decay products of the vector bosons. These efficiently discriminate between the tensor structure of the HVV vertex expected in the Standard Model and that arising from possible new physics, as quantified by higher dimensional operators. This can then be used to examine the CP nature of the Higgs as well as CP mixing effects in the HZZ and HWW vertices separately. (C) 2014 Elsevier B.V.
Resumo:
We develop noise robust features using Gammatone wavelets derived from the popular Gammatone functions. These wavelets incorporate the characteristics of human peripheral auditory systems, in particular the spatially-varying frequency response of the basilar membrane. We refer to the new features as Gammatone Wavelet Cepstral Coefficients (GWCC). The procedure involved in extracting GWCC from a speech signal is similar to that of the conventional Mel-Frequency Cepstral Coefficients (MFCC) technique, with the difference being in the type of filterbank used. We replace the conventional mel filterbank in MFCC with a Gammatone wavelet filterbank, which we construct using Gammatone wavelets. We also explore the effect of Gammatone filterbank based features (Gammatone Cepstral Coefficients (GCC)) for robust speech recognition. On AURORA 2 database, a comparison of GWCCs and GCCs with MFCCs shows that Gammatone based features yield a better recognition performance at low SNRs.
Resumo:
Taxol (R) (generic name paclitaxel) represents one of the most clinically valuable natural products known to mankind in the recent past. More than two decades have elapsed since the notable discovery of the first Taxol (R) producing endophytic fungus, which was followed by a plethora of reports on other endophytes possessing similar biosynthetic potential. However, industrial-scale Taxol (R) production using fungal endophytes, although seemingly promising, has not seen the light of the day. In this opinion article, we embark on the current state of knowledge on Taxol (R) biosynthesis focusing on the chemical ecology of its producers, and ask whether it is actually possible to produce Taxol (R) using endophyte biotechnology. The key problems that have prevented the exploitation of potent endophytic fungi by industrial bioprocesses for sustained production of Taxol (R) are discussed.
Resumo:
This paper proposes an automatic acoustic-phonetic method for estimating voice-onset time of stops. This method requires neither transcription of the utterance nor training of a classifier. It makes use of the plosion index for the automatic detection of burst onsets of stops. Having detected the burst onset, the onset of the voicing following the burst is detected using the epochal information and a temporal measure named the maximum weighted inner product. For validation, several experiments are carried out on the entire TIMIT database and two of the CMU Arctic corpora. The performance of the proposed method compares well with three state-of-the-art techniques. (C) 2014 Acoustical Society of America
Resumo:
We estimate transverse spin single spin asymmetry(TSSA) in the process e + p(up arrow) -> J/psi + X using color evaporation model of charmonium production. We take into account transverse momentum dependent(TMD) evolution of Sivers function and parton distribution function and show that the there is a reduction in the asymmetry as compared to our earlier estimates wherein the Q(2) - evolution was implemented only through DGLAP evolution of unpolarized gluon densities.
Resumo:
Measurement of the self-coupling of the 125 GeV Higgs boson is one of the most crucial tasks for a high luminosity run of the LHC, and it can only be measured in the di-Higgs final state. In the minimal supersymmetric standard model, heavy CP even Higgs (H) can decay into a lighter 125 GeV Higgs boson (h) and, therefore, can influence the rate of di-Higgs production. We investigate the role of single H production in the context of measuring the self-coupling of h. We have found that the H -> hh decay can change the value of Higgs (h) self-coupling substantially, in a low tan beta regime where the mass of the heavy Higgs boson lies between 250 and 600 GeV and, depending on the parameter space, it may be seen as an enhancement of the self-coupling of the 125 GeV Higgs boson.
Resumo:
We analyse the hVV (V = W, Z) vertex in a model independent way using Vh production. To that end, we consider possible corrections to the Standard Model Higgs Lagrangian, in the form of higher dimensional operators which parametrise the effects of new physics. In our analysis, we pay special attention to linear observables that can be used to probe CP violation in the same. By considering the associated production of a Higgs boson with a vector boson (W or Z), we use jet substructure methods to define angular observables which are sensitive to new physics effects, including an asymmetry which is linearly sensitive to the presence of CP odd effects. We demonstrate how to use these observables to place bounds on the presence of higher dimensional operators, and quantify these statements using a log likelihood analysis. Our approach allows one to probe separately the hZZ and hWW vertices, involving arbitrary combinations of BSM operators, at the Large Hadron Collider.
Resumo:
A comparative study of two bacterial strains namely, Bacillus licheniformis and Bacillus firmus in the production of bioflocculants was made. The highest bioflocculant yield of 16.55 g/L was obtained from B. licheniformis (L) and 10 g/L from B. firmus (F). The bioflocculants obtained from the bacterial species were water soluble and insoluble in organic solvents. FTIR spectral analysis revealed the presence of hydroxyl, carboxyl and sugar derivatives in the bioflocculants. Thermal characterization by differential scanning calorimetry (DSC) showed the crystalline transition and the melting point (T-m) at 90-100 degrees C. Effects of bioflocculant dosage and pH on the flocculation of clay fines were evaluated. Highest bioflocculation efficiency on kaolin clay suspensions was observed at an optimum bioflocculant dosage of 5 g/L. The optimum pH range for the maximum bioflocculation was at pH 7-9. Bioflocculants exhibited high efficiency in dye decolorization. The maximum Cr (VI) removal was found to be 85 % for L (bioflocculant dosage at 2 g/L). This study demonstrates that microbial bioflocculants find potential applications in mineral processing such as selective flocculation of mineral fines, decolorization of dye solutions and in the remediation of toxic metal solutions. (C) 2015 Elsevier B.V. All rights reserved.
Resumo:
In this work, a methodology to achieve ordinary-, medium-, and high-strength self-consolidating concrete (SCC) with and without mineral additions is proposed. The inclusion of Class F fly ash increases the density of SCC but retards the hydration rate, resulting in substantial strength gain only after 28 days. This delayed strength gain due to the use of fly ash has been considered in the mixture design model. The accuracy of the proposed mixture design model is validated with the present test data and mixture and strength data obtained from diverse sources reported in the literature.
Resumo:
Bacteria can utilize multiple sources of carbon for growth, and for pathogenic bacteria like Mycobacterium tuberculosis, this ability is crucial for survival within the host. In addition, phenotypic changes are seen in mycobacteria grown under different carbon sources. In this study, we use Raman spectroscopy to analyze the biochemical components present in M. smegmatis cells when grown in three differently metabolized carbon sources. Our results show that carotenoid biosynthesis is enhanced when M. smegmatis is grown in glucose compared to glycerol and acetate. We demonstrate that this difference is most likely due to transcriptional upregulation of the carotenoid biosynthesis operon (crt) mediated by higher levels of the stress-responsive sigma factor SigF. Moreover, we find that increased SigF and carotenoid levels correlate with greater resistance of glucose-grown cells to oxidative stress. Thus, we demonstrate the use of Raman spectroscopy in unraveling unknown aspects of mycobacterial physiology and describe a novel effect of carbon source variation on mycobacteria.
Resumo:
The clever designs of natural transducers are a great source of inspiration for man-made systems. At small length scales, there are many transducers in nature that we are now beginning to understand and learn from. Here, we present an example of such a transducer that is used by field crickets to produce their characteristic song. This transducer uses two distinct components-a file of discrete teeth and a plectrum that engages intermittently to produce a series of impulses forming the loading, and an approximately triangular membrane, called the harp, that acts as a resonator and vibrates in response to the impulse-train loading. The file-and-plectrum act as a frequency multiplier taking the low wing beat frequency as the input and converting it into an impulse-train of sufficiently high frequency close to the resonant frequency of the harp. The forced vibration response results in beats producing the characteristic sound of the cricket song. With careful measurements of the harp geometry and experimental measurements of its mechanical properties (Young's modulus determined from nanoindentation tests), we construct a finite element (FE) model of the harp and carry out modal analysis to determine its natural frequency. We fine tune the model with appropriate elastic boundary conditions to match the natural frequency of the harp of a particular species-Gryllus bimaculatus. We model impulsive loading based on a loading scheme reported in literature and predict the transient response of the harp. We show that the harp indeed produces beats and its frequency content matches closely that of the recorded song. Subsequently, we use our FE model to show that the natural design is quite robust to perturbations in the file. The characteristic song frequency produced is unaffected by variations in the spacing of file-teeth and even by larger gaps. Based on the understanding of how this natural transducer works, one can design and fabricate efficient microscale acoustic devices such as microelectromechanical systems (MEMS) loudspeakers.