244 resultados para Speech processing


Relevância:

20.00% 20.00%

Publicador:

Resumo:

Power dissipation maps have been generated in the temperature range of 900 degrees C to 1150 degrees C and strain rate range of 10(-3) to 10 s(-1) for a cast aluminide alloy Ti-24Al-20Nb using dynamic material model. The results define two distinct regimes of temperature and strain rate in which efficiency of power dissipation is maximum. The first region, centered around 975 degrees C/0.1 s(-1), is shown to correspond to dynamic recrystallization of the alpha(2) phase and the second, centered around 1150 degrees C/0.001 s(-1), corresponds to dynamic recovery and superplastic deformation of the beta phase. Thermal activation analysis using the power law creep equation yielded apparent activation energies of 854 and 627 kJ/mol for the first and second regimes, respectively. Reanalyzing the data by alternate methods yielded activation energies in the range of 170 to 220 kJ/mol and 220 to 270 kJ/mol for the first and second regimes, respectively. Cross slip was shown to constitute the activation barrier in both cases. Two distinct regimes of processing instability-one at high strain rates and the other at the low strain rates in the lower temperature regions-have been identified, within which shear bands are formed.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Al-Li-SiCp composites were fabricated by a simple and cost effective stir casting technique. A compound billet technique has been developed to overcome the problems encountered during hot extrusion of these composites. After successful fabrication hardness measurement and room temperature compressive test were carried out on 8090 Al and its composites reinforced with 8, 12 and 18vol.% SiC particles in as extruded and peak aged conditions. The addition of SiC increases the hardness. 0.2% proof stress and compressive strength of Al-Li-8%SiC and Al-Li-12%SiC composites are higher than the unreinforced alloy. in case of the Al-Li-18%SiC composite, the 0.2% proof stress and compressive strength were higher than the unreinforced alloy but lower than those of Al-Li-8%SiC and Al-Li-12%SiC composites. This is attributed to clustering of particles and poor interfacial bonding.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

In the present investigation, two nozzle configurations are used for spray deposition, convergent nozzle (nozzle-A), and convergent nozzle with 2 mm parallel portion attached at its end (nozzle-C) without changing the exit area. First, the conditions for subambient aspiration pressure, i.e., pressure at the tip of the melt delivery tube, are established by varying the protrusion length of the melt delivery tube at different applied gas pressures for both of the nozzles. Using these conditions, spray deposits in a reproducible manner are successfully obtained for 7075 Al alloy. The effect of applied gas pressure, flight distance, and nozzle configuration on various characteristics of spray deposition, viz., yield, melt flow rate, and gas-to-metal ratio, is examined. The over-spray powder is also characterized with respect to powder size distribution, shape, and microstructure. Some of the results are explained with the help of numerical analysis presented in an earlier article.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

A systematic study of Ar ion implantation in cupric oxide films has been reported. Oriented CuO films were deposited by pulsed excimer laser ablation technique on (1 0 0) YSZ substrates. X-ray diffraction (XRD) spectra showed the highly oriented nature of the deposited CuO films. The films were subjected to ion bombardment for studies of damage formation, Implantations were carried out using 100 keV Arf over a dose range between 5 x 10(12) and 5 x 10(15) ions/cm(2). The as-deposited and ion beam processed samples were characterized by XRD technique and resistance versus temperature (R-T) measurements. The activation energies for electrical conduction were found from In [R] versus 1/T curves. Defects play an important role in the conduction mechanism in the implanted samples. The conductivity of the film increases, and the corresponding activation energy decreases with respect to the dose value.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

We address the issue of complexity for vector quantization (VQ) of wide-band speech LSF (line spectrum frequency) parameters. The recently proposed switched split VQ (SSVQ) method provides better rate-distortion (R/D) performance than the traditional split VQ (SVQ) method, even at the requirement of lower computational complexity. but at the expense of much higher memory. We develop the two stage SVQ (TsSVQ) method, by which we gain both the memory and computational advantages and still retain good R/D performance. The proposed TsSVQ method uses a full dimensional quantizer in its first stage for exploiting all the higher dimensional coding advantages and then, uses an SVQ method for quantizing the residual vector in the second stage so as to reduce the complexity. We also develop a transform domain residual coding method in this two stage architecture such that it further reduces the computational complexity. To design an effective residual codebook in the second stage, variance normalization of Voronoi regions is carried out which leads to the design of two new methods, referred to as normalized two stage SVQ (NTsSVQ) and normalized two stage transform domain SVQ (NTsTrSVQ). These two new methods have complimentary strengths and hence, they are combined in a switched VQ mode which leads to the further improvement in R/D performance, but retaining the low complexity requirement. We evaluate the performances of new methods for wide-band speech LSF parameter quantization and show their advantages over established SVQ and SSVQ methods.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The paper presents a new adaptive delta modulator, called the hybrid constant factor incremental delta modulator (HCFIDM), which uses instantaneous as well as syllabic adaptation of the step size. Three instantaneous algorithms have been used: two new instantaneous algorithms (CFIDM-3 and CFIDM-2) and the third, Song's voice ADM (SVADM). The quantisers have been simulated on a digital computer and their performances studied. The figure of merit used is the SNR with correlated, /?C-shaped Gaussian signals and real speech as the input. The results indicate that the hybrid technique is superior to the nonhybrid adaptive quantisers. Also, the two new instantaneous algorithms developed have improved SNR and fast response to step inputs as compared to the earlier systems.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

We are addressing the problem of jointly using multiple noisy speech patterns for automatic speech recognition (ASR), given that they come from the same class. If the user utters a word K times, the ASR system should try to use the information content in all the K patterns of the word simultaneously and improve its speech recognition accuracy compared to that of the single pattern based speech recognition. T address this problem, recently we proposed a Multi Pattern Dynamic Time Warping (MPDTW) algorithm to align the K patterns by finding the least distortion path between them. A Constrained Multi Pattern Viterbi algorithm was used on this aligned path for isolated word recognition (IWR). In this paper, we explore the possibility of using only the MPDTW algorithm for IWR. We also study the properties of the MPDTW algorithm. We show that using only 2 noisy test patterns (10 percent burst noise at -5 dB SNR) reduces the noisy speech recognition error rate by 37.66 percent when compared to the single pattern recognition using the Dynamic Time Warping algorithm.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Al-5 wt pct Si alloy is processed by upset forging in the temperature range 300 K to 800 K and in the strain rate range 0.02 to 200 s−1. The hardness and tensile properties of the product have been studied. A “safe” window in the strain rate-temperature field has been identified for processing of this alloy to obtain maximum tensile ductility in the product. For the above strain rate range, the temperature range of processing is 550 K to 700 K for obtaining high ductility in the product. On the basis of microstructure and the ductility of the product, the temperature-strain rate regimes of damage due to cavity formation at particles and wedge cracking have been isolated for this alloy. The tensile fracture features recorded on the product specimens are in conformity with the above damage mechanisms. A high temperature treatment above ≈600 K followed by fairly fast cooling gives solid solution strengthening in the alloy at room temperature.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This paper investigates the problem of designing reverse channel training sequences for a TDD-MIMO spatial-multiplexing system. Assuming perfect channel state information at the receiver and spatial multiplexing at the transmitter with equal power allocation to them dominant modes of the estimated channel, the pilot is designed to ensure an stimate of the channel which improves the forward link capacity. Using perturbation techniques, a lower bound on the forward link capacity is derived with respect to which the training sequence is optimized. Thus, the reverse channel training sequence makes use of the channel knowledge at the receiver. The performance of orthogonal training sequence with MMSE estimation at the transmitter and the proposed training sequence are compared. Simulation results show a significant improvement in performance.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Using analysis-by-synthesis (AbS) approach, we develop a soft decision based switched vector quantization (VQ) method for high quality and low complexity coding of wideband speech line spectral frequency (LSF) parameters. For each switching region, a low complexity transform domain split VQ (TrSVQ) is designed. The overall rate-distortion (R/D) performance optimality of new switched quantizer is addressed in the Gaussian mixture model (GMM) based parametric framework. In the AbS approach, the reduction of quantization complexity is achieved through the use of nearest neighbor (NN) TrSVQs and splitting the transform domain vector into higher number of subvectors. Compared to the current LSF quantization methods, the new method is shown to provide competitive or better trade-off between R/D performance and complexity.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Experiments are described which show that a monobath can be used for rapid in situ processing in a liquid gate for real-time holographic interferometry. This also permits utilization of a very simple solution handling system. Changes in emulsion thickness are reduced to an acceptable level and problems of matching refractive indices are eliminated by exposing and viewing the holograms in water. Excellent null patterns are obtained and real-time holographic interferometry can be carried out over long periods of time.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Recently established moderate size free piston driven hypersonic shock tunnel HST3 along with its calibration is described here. The extreme thermodynamic conditions prevalent behind the reflected shock wave have been utilized to study the catalytic and non-catalytic reactions of shock heated test gases like Ar, N2 or O2 with different material like C60 carbon, zirconia and ceria substituted zirconia. The exposed test samples are investigated using different experimental methods. These studies show the formation of carbon nitride due to the non-catalytic interaction of shock heated nitrogen gas with C60 carbon film. On the other hand, the ZrO2 undergoes only phase transformation from cubic to monoclinic structure and Ce0.5Zr0.5O2 in fluorite cubic phase changes to pyrochlore (Ce2Zr2O7±δ) phase by releasing oxygen from the lattice due to heterogeneous catalytic surface reaction.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

We propose a novel technique for robust voiced/unvoiced segment detection in noisy speech, based on local polynomial regression. The local polynomial model is well-suited for voiced segments in speech. The unvoiced segments are noise-like and do not exhibit any smooth structure. This property of smoothness is used for devising a new metric called the variance ratio metric, which, after thresholding, indicates the voiced/unvoiced boundaries with 75% accuracy for 0dB global signal-to-noise ratio (SNR). A novelty of our algorithm is that it processes the signal continuously, sample-by-sample rather than frame-by-frame. Simulation results on TIMIT speech database (downsampled to 8kHz) for various SNRs are presented to illustrate the performance of the new algorithm. Results indicate that the algorithm is robust even in high noise levels.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

We investigate the use of a two stage transform vector quantizer (TSTVQ) for coding of line spectral frequency (LSF) parameters in wideband speech coding. The first stage quantizer of TSTVQ, provides better matching of source distribution and the second stage quantizer provides additional coding gain through using an individual cluster specific decorrelating transform and variance normalization. Further coding gain is shown to be achieved by exploiting the slow time-varying nature of speech spectra and thus using inter-frame cluster continuity (ICC) property in the first stage of TSTVQ method. The proposed method saves 3-4 bits and reduces the computational complexity by 58-66%, compared to the traditional split vector quantizer (SVQ), but at the expense of 1.5-2.5 times of memory.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Further improvement in performance, to achieve near transparent quality LSF quantization, is shown to be possible by using a higher order two dimensional (2-D) prediction in the coefficient domain. The prediction is performed in a closed-loop manner so that the LSF reconstruction error is the same as the quantization error of the prediction residual. We show that an optimum 2-D predictor, exploiting both inter-frame and intra-frame correlations, performs better than existing predictive methods. Computationally efficient split vector quantization technique is used to implement the proposed 2-D prediction based method. We show further improvement in performance by using weighted Euclidean distance.