244 resultados para Speech Processing


Relevância:

20.00% 20.00%

Publicador:

Resumo:

We present two discriminative language modelling techniques for Lempel-Ziv-Welch (LZW) based LID system. The previous approach to LID using LZW algorithm was to directly use the LZW pattern tables forlanguage modelling. But, since the patterns in a language pattern table are shared by other language pattern tables, confusability prevailed in the LID task. For overcoming this, we present two pruning techniques (i) Language Specific (LS-LZW)-in which patterns common to more than one pattern table are removed. (ii) Length-Frequency product based (LF-LZW)-in which patterns having their length-frequency product below a threshold are removed. These approaches reduce the classification score (Compression Ratio [LZW-CR] or the weighted discriminant score [LZW-WDS]) for non native languages and increases the LID performance considerably. Also the memory and computational requirements of these techniques are much less compared to basic LZW techniques.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Multiresolution synthetic aperture radar (SAR) image formation has been proven to be beneficial in a variety of applications such as improved imaging and target detection as well as speckle reduction. SAR signal processing traditionally carried out in the Fourier domain has inherent limitations in the context of image formation at hierarchical scales. We present a generalized approach to the formation of multiresolution SAR images using biorthogonal shift-invariant discrete wavelet transform (SIDWT) in both range and azimuth directions. Particularly in azimuth, the inherent subband decomposition property of wavelet packet transform is introduced to produce multiscale complex matched filtering without involving any approximations. This generalized approach also includes the formulation of multilook processing within the discrete wavelet transform (DWT) paradigm. The efficiency of the algorithm in parallel form of execution to generate hierarchical scale SAR images is shown. Analytical results and sample imagery of diffuse backscatter are presented to validate the method.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

We are addressing the novel problem of jointly evaluating multiple speech patterns for automatic speech recognition and training. We propose solutions based on both the non-parametric dynamic time warping (DTW) algorithm, and the parametric hidden Markov model (HMM). We show that a hybrid approach is quite effective for the application of noisy speech recognition. We extend the concept to HMM training wherein some patterns may be noisy or distorted. Utilizing the concept of ``virtual pattern'' developed for joint evaluation, we propose selective iterative training of HMMs. Evaluating these algorithms for burst/transient noisy speech and isolated word recognition, significant improvement in recognition accuracy is obtained using the new algorithms over those which do not utilize the joint evaluation strategy.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Processing of Sesbania mosaic virus (SeMV) polyprotein 2a and 2ab was reanalyzed in the view of the new genome organization of sobemoviruses. Polyprotein 2a when expressed in E coli, from the new cDNA clone, got cleaved at the earlier identified sites E325-T326, E402-T403 and E498-S499 to release protease, VPg, P10 and P8, respectively. Additionally, a novel cleavage was identified within the protease domain at position E132-S133, which was found to be essential for efficient polyprotein processing. Products, corresponding to cleavages identified in E. coli, were also detected in infected Sesbania leaves. Interestingly, though the sites are exactly the same in polyprotein 2ab, it got cleaved between Protease-VPg but not between VPg-RdRp. This indicates to a differential cleavage preference, governed probably by the conformation of 2ab. Also, the studies revealed that, in SeMV, processing is regulated by mode of cleavage and context of the cleavage site.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Mycobacterium tuberculosis readily activates both CD4+ and Vdelta2+ gammadelta T cells. Despite similarity in function, these T-cell subsets differ in the antigens they recognize and the manners in which these antigens are presented by M. tuberculosis-infected monocytes. We investigated mechanisms of antigen processing of M. tuberculosis antigens to human CD4 and gammadelta T cells by monocytes. Initial uptake of M. tuberculosis bacilli and subsequent processing were required for efficient presentation not only to CD4 T cells but also to Vdelta2+ gammadelta T cells. For gammadelta T cells, recognition of M. tuberculosis-infected monocytes was dependent on Vdelta2+ T-cell-receptor expression. Recognition of M. tuberculosis antigens by CD4+ T cells was restricted by the class II major histocompatibility complex molecule HLA-DR. Processing of M. tuberculosis bacilli for Vdelta2+ gammadelta T cells was inhibitable by Brefeldin A, whereas processing of soluble mycobacterial antigens for gammadelta T cells was not sensitive to Brefeldin A. Processing of M. tuberculosis bacilli for CD4+ T cells was unaffected by Brefeldin A. Lysosomotropic agents such as chloroquine and ammonium chloride did not affect the processing of M. tuberculosis bacilli for CD4+ and gammadelta T cells. In contrast, both inhibitors blocked processing of soluble mycobacterial antigens for CD4+ T cells. Chloroquine and ammonium chloride insensitivity of processing of M. tuberculosis bacilli was not dependent on the viability of the bacteria, since processing of both formaldehyde-fixed dead bacteria and mycobacterial antigens covalently coupled to latex beads was chloroquine insensitive. Thus, the manner in which mycobacterial antigens were taken up by monocytes (particulate versus soluble) influenced the antigen processing pathway for CD4+ and gammadelta T cells.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

In our earlier work [1], we employed MVDR (minimum variance distortionless response) based spectral estimation instead of modified-linear prediction method [2] in pitch modification. Here, we use the Bauer method of MVDR spectral factorization, leading to a causal inverse filter rather than a noncausal filter setup with MVDR spectral estimation [1]. Further, this is employed to obtain source (or residual) signal from pitch synchronous speech frames. The residual signal is resampled using DCT/IDCT depending on the target pitch scale factor. Finally, forward filters realized from the above factorization are used to get pitch modified speech. The modified speech is evaluated subjectively by 10 listeners and mean opinion scores (MOS) are tabulated. Further, modified bark spectral distortion measure is also computed for objective evaluation of performance. We find that the proposed algorithm performs better compared to time domain pitch synchronous overlap [3] and modified-LP method [2]. A good MOS score is achieved with the proposed algorithm compared to [1] with a causal inverse and forward filter setup.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Titanium alloys like Ti-6A-4V are the backbone materials for aerospace, energy and chemical industries. Hypoeutectic boron addition to Ti-6Al-4V alloy produces a reduction in as-cast grain size by roughly an order of magnitude resulting in the possibility of avoiding ingot breakdown step and thereby reducing the processing cost. In the present study, ISM processed as-cast boron added Ti-6Al-4V alloy is deformed in (alpha+beta)-phase field, where alpha-lath bending seemed to be the dominating deformation mechanism.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

We are addressing a new problem of improving automatic speech recognition performance, given multiple utterances of patterns from the same class. We have formulated the problem of jointly decoding K multiple patterns given a single Hidden Markov Model. It is shown that such a solution is possible by aligning the K patterns using the proposed Multi Pattern Dynamic Time Warping algorithm followed by the Constrained Multi Pattern Viterbi Algorithm The new formulation is tested in the context of speaker independent isolated word recognition for both clean and noisy patterns. When 10 percent of speech is affected by a burst noise at -5 dB Signal to Noise Ratio (local), it is shown that joint decoding using only two noisy patterns reduces the noisy speech recognition error rate to about 51 percent, when compared to the single pattern decoding using the Viterbi Algorithm. In contrast a simple maximization of individual pattern likelihoods, provides only about 7 percent reduction in error rate.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Visual tracking has been a challenging problem in computer vision over the decades. The applications of Visual Tracking are far-reaching, ranging from surveillance and monitoring to smart rooms. Mean-shift (MS) tracker, which gained more attention recently, is known for tracking objects in a cluttered environment and its low computational complexity. The major problem encountered in histogram-based MS is its inability to track rapidly moving objects. In order to track fast moving objects, we propose a new robust mean-shift tracker that uses both spatial similarity measure and color histogram-based similarity measure. The inability of MS tracker to handle large displacements is circumvented by the spatial similarity-based tracking module, which lacks robustness to object's appearance change. The performance of the proposed tracker is better than the individual trackers for tracking fast-moving objects with better accuracy.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

We propose a simple and energy efficient distributed change detection scheme for sensor networks based on Page's parametric CUSUM algorithm. The sensor observations are IID over time and across the sensors conditioned on the change variable. Each sensor runs CUSUM and transmits only when the CUSUM is above some threshold. The transmissions from the sensors are fused at the physical layer. The channel is modeled as a multiple access channel (MAC) corrupted with IID noise. The fusion center which is the global decision maker, performs another CUSUM to detect the change. We provide the analysis and simulation results for our scheme and compare the performance with an existing scheme which ensures energy efficiency via optimal power selection.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

We address the issue of rate-distortion (R/D) performance optimality of the recently proposed switched split vector quantization (SSVQ) method. The distribution of the source is modeled using Gaussian mixture density and thus, the non-parametric SSVQ is analyzed in a parametric model based framework for achieving optimum R/D performance. Using high rate quantization theory, we derive the optimum bit allocation formulae for the intra-cluster split vector quantizer (SVQ) and the inter-cluster switching. For the wide-band speech line spectrum frequency (LSF) parameter quantization, it is shown that the Gaussian mixture model (GMM) based parametric SSVQ method provides 1 bit/vector advantage over the non-parametric SSVQ method.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

We propose a new weighting function which is computationally simple and an approximation to the theoretically derived optimum weighting function shown in the literature. The proposed weighting function is perceptually motivated and provides improved vector quantization performance compared to several weighting functions proposed so far, for line spectrum frequency (LSF) parameter quantization of both clean and noisy speech data.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The formation of an ω-Al7Cu2Fe phase during laser cladding of quasicrystal-forming Al65Cu23.3Fe11.7 alloy on a pure aluminium substrate is reported. This phase is found to nucleate at the periphery of primary icosahedral-phase particles. A large number of ω-phase particles form an envelope around the icosahedral phase. On the outer side, they form an interface with an agr-Al solid solution. Detailed transmission electron microscopic observations show that the ω phase exhibits an orientation relationship with the icosahedral phase. Analysis of experimental results suggests that the ω phase forms by precipitation on an icosahedral phase by heterogeneous nucleation and grows into the aluminium-rich melt until supersaturation is exhausted. The microstructural observations are explained in terms of available models of phase transformations.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The hot deformation behaviour of Mg–3Al alloy has been studied using the processing-map technique. Compression tests were conducted in the temperature range 250–550 °C and strain rate range 3 × 10−4 to 102 s−1 and the flow stress data obtained from the tests were used to develop the processing map. The various domains in the map corresponding to different dissipative characteristics have been identified as follows: (i) grain boundary sliding (GBS) domain accommodated by slip controlled by grain boundary diffusion at slow strain-rates (<10−3 s−1) in the temperature range from 350 to 450 °C, (ii) two different dynamic recrystallization (DRX) domains with a peak efficiency of 42% at 550 °C/10−1 s−1 and 425 °C/102 s−1 governed by stress-assisted cross-slip and thermally activated climb as the respective rate controlling mechanisms and (iii) dynamic recovery (DRV) domain below 300 °C in the intermediate strain rate range from 3 × 10−2 to 3 × 10−1 s−1. The regimes of flow instability have also been delineated in the processing map using an instability criterion. Adiabatic shear banding at higher strain rates (>101 s−1) and solute drag by substitutional Al atoms at intermediate strain rates (3 × 10−2 to 3 × 10−1 s−1) in the temperature range (350–450 °C) are responsible for flow instability. The relevance of these mechanisms with reference to hot working practice of the material has been indicated. The processing maps of Mg–3Al alloy and as-cast Mg have been compared qualitatively to elucidate the effect of alloying with aluminum on the deformation behaviour of magnesium.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Dense ZrB2-ZrC and ZrB2-ZrC x∼0.67 composites have been produced by reactive hot pressing (RHP) of stoichiometric and nonstoichiometric mixtures of Zr and B4C powders at 40 MPa and temperatures up to 1600 °C for 30 minutes. The role of Ni addition on reaction kinetics and densification of the composites has been studied. Composites of ∼97 pct relative density (RD) have been produced with the stoichiometric mixture at 1600 °C, while the composite with ∼99 pct RD has been obtained with excess Zr at 1200 °C, suggesting the formation of carbon deficient ZrC x that significantly aids densification by plastic flow and vacancy diffusion mechanism. Stoichiometric and nonstoichiometric composites have a hardness of ∼20 GPa. The grain sizes of ZrB2 and ZrC x∼0.67 are ∼0.6 and 0.4 μm, respectively, which are finer than those reported in the literature.