922 resultados para very low rate speech coding


Relevância:

100.00% 100.00%

Publicador:

Resumo:

This thesis presents an original approach to parametric speech coding at rates below 1 kbitsjsec, primarily for speech storage applications. Essential processes considered in this research encompass efficient characterization of evolutionary configuration of vocal tract to follow phonemic features with high fidelity, representation of speech excitation using minimal parameters with minor degradation in naturalness of synthesized speech, and finally, quantization of resulting parameters at the nominated rates. For encoding speech spectral features, a new method relying on Temporal Decomposition (TD) is developed which efficiently compresses spectral information through interpolation between most steady points over time trajectories of spectral parameters using a new basis function. The compression ratio provided by the method is independent of the updating rate of the feature vectors, hence allows high resolution in tracking significant temporal variations of speech formants with no effect on the spectral data rate. Accordingly, regardless of the quantization technique employed, the method yields a high compression ratio without sacrificing speech intelligibility. Several new techniques for improving performance of the interpolation of spectral parameters through phonetically-based analysis are proposed and implemented in this research, comprising event approximated TD, near-optimal shaping event approximating functions, efficient speech parametrization for TD on the basis of an extensive investigation originally reported in this thesis, and a hierarchical error minimization algorithm for decomposition of feature parameters which significantly reduces the complexity of the interpolation process. Speech excitation in this work is characterized based on a novel Multi-Band Excitation paradigm which accurately determines the harmonic structure in the LPC (linear predictive coding) residual spectra, within individual bands, using the concept 11 of Instantaneous Frequency (IF) estimation in frequency domain. The model yields aneffective two-band approximation to excitation and computes pitch and voicing with high accuracy as well. New methods for interpolative coding of pitch and gain contours are also developed in this thesis. For pitch, relying on the correlation between phonetic evolution and pitch variations during voiced speech segments, TD is employed to interpolate the pitch contour between critical points introduced by event centroids. This compresses pitch contour in the ratio of about 1/10 with negligible error. To approximate gain contour, a set of uniformly-distributed Gaussian event-like functions is used which reduces the amount of gain information to about 1/6 with acceptable accuracy. The thesis also addresses a new quantization method applied to spectral features on the basis of statistical properties and spectral sensitivity of spectral parameters extracted from TD-based analysis. The experimental results show that good quality speech, comparable to that of conventional coders at rates over 2 kbits/sec, can be achieved at rates 650-990 bits/sec.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Research has been undertaken to investigate the use of artificial neural network (ANN) techniques to improve the performance of a low bit-rate vector transform coder. Considerable improvements in the perceptual quality of the coded speech have been obtained. New ANN-based methods for vector quantiser (VQ) design and for the adaptive updating of VQ codebook are introduced for use in speech coding applications.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The need for low bit-rate speech coding is the result of growing demand on the available radio bandwidth for mobile communications both for military purposes and for the public sector. To meet this growing demand it is required that the available bandwidth be utilized in the most economic way to accommodate more services. Two low bit-rate speech coders have been built and tested in this project. The two coders combine predictive coding with delta modulation, a property which enables them to achieve simultaneously the low bit-rate and good speech quality requirements. To enhance their efficiency, the predictor coefficients and the quantizer step size are updated periodically in each coder. This enables the coders to keep up with changes in the characteristics of the speech signal with time and with changes in the dynamic range of the speech waveform. However, the two coders differ in the method of updating their predictor coefficients. One updates the coefficients once every one hundred sampling periods and extracts the coefficients from input speech samples. This is known in this project as the Forward Adaptive Coder. Since the coefficients are extracted from input speech samples, these must be transmitted to the receiver to reconstruct the transmitted speech sample, thus adding to the transmission bit rate. The other updates its coefficients every sampling period, based on information of output data. This coder is known as the Backward Adaptive Coder. Results of subjective tests showed both coders to be reasonably robust to quantization noise. Both were graded quite good, with the Forward Adaptive performing slightly better, but with a slightly higher transmission bit rate for the same speech quality, than its Backward counterpart. The coders yielded acceptable speech quality of 9.6kbps for the Forward Adaptive and 8kbps for the Backward Adaptive.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The aim was to analyse the growth and compositional development of the receptive and expressive lexicons between the ages 0,9 and 2;0 in the full-term (FT) and the very-low-birth-weight (VLBW) children who are acquiring Finnish. The associations between the expressive lexicon and grammar at 1;6 and 2;0 in the FT children were also studied. In addition, the language skills of the VLBW children at 2;0 were analysed, as well as the predictive value of early lexicon to the later language performance. Four groups took part in the studies: the longitudinal (N = 35) and cross-sectional (N = 146) samples of the FT children, and the longitudinal (N = 32) and cross-sectional (N = 66) samples of VLBW children. The data was gathered by applying of the structured parental rating method (the Finnish version of the Communicative Development Inventory), through analysis of the children´s spontaneous speech and by administering a a formal test (Reynell Developmental Language Scales). The FT children acquired their receptive lexicons earlier, at a faster rate and with larger individual variation than their expressive lexicons. The acquisition rate of the expressive lexicon increased from slow to faster in most children (91%). Highly parallel developmental paths for lexical semantic categories were detected in the receptive and expressive lexicons of the Finnish children when they were analysed in relation to the growth of the lexicon size, as described in the literature for children acquiring other languages. The emergence of grammar was closely associated with expressive lexical growth. The VLBW children acquired their receptive lexicons at a slower rate and had weaker language skills at 2;0 than the full-term children. The compositional development of both lexicons happened at a slower rate in the VLBW children when compared to the FT controls. However, when the compositional development was analysed in relation to the growth of lexicon size, this development occurred qualitatively in a nearly parallel manner in the VLBW children as in the FT children. Early receptive and expressive lexicon sizes were significantly associated with later language skills in both groups. The effect of the background variables (gender, length of the mother s basic education, birth weight) on the language development in the FT and the VLBW children differed. The results provide new information of early language acquisition by the Finnish FT and VLBW children. The results support the view that the early acquisition of the semantic lexical categories is related to lexicon growth. The current findings also propose that the early grammatical acquisition is closely related to the growth of expressive vocabulary size. The language development of the VLBW children should be followed in clinical work.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The dynamics of reactions with low internal barriers are studied both analytically and numerically for two different models. Exact expressions for the average rate,kI, are obtained by solving the associated first passage time problems. Both the average rate constant, kI, and the numerically calculated long-time rate constant, kL, show a fractional power law dependence on the barrier height for very low barriers. The crossover of the reaction dynamics from low to high barrier is investigated.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Nanocrystalline silicon thin films were deposited on single-crystal silicon and glass substrates simultaneously by inductively coupled plasma-assisted chemical vapor deposition from the reactive silane reactant gas diluted with hydrogen at a substrate temperature of 200 °C. The effect of hydrogen dilution ratio X (X is defined as the flow rate ratio of hydrogen to silane gas), ranging from 1 to 20, on the structural and optical properties of the deposited films, is extensively investigated by Raman spectroscopy, X-ray diffraction, Fourier transform infrared absorption spectroscopy, UV/VIS spectroscopy, and scanning electron microscopy. Our experimental results reveal that, with the increase of the hydrogen dilution ratio X, the deposition rate Rd and hydrogen content CH are reduced while the crystalline fraction Fc, mean grain size δ and optical bandgap ETauc are increased. In comparison with other plasma enhanced chemical vapor deposition methods of nanocrystalline silicon films where a very high hydrogen dilution ratio X is routinely required (e.g. X > 16), we have achieved nanocrystalline silicon films at a very low hydrogen dilution ratio of 1, featuring a high deposition rate of 1.57 nm/s, a high crystalline fraction of 67.1%, a very low hydrogen content of 4.4 at.%, an optical bandgap of 1.89 eV, and an almost vertically aligned columnar structure with a mean grain size of approximately 19 nm. We have also shown that a sufficient amount of atomic hydrogen on the growth surface essential for the formation of nanocrystalline silicon is obtained through highly-effective dissociation of silane and hydrogen molecules in the high-density inductively coupled plasmas. © 2009 The Royal Society of Chemistry.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Loose saturated sandy soils may undergo liquefaction under cyclic loading, generating positive excess pore pressures due to their contractile nature and inability to dissipate pore pressures rapidly during earthquake loading. These liquefied soils have a near-zero effective stress state, and hence have very low strength and stiffness, causing severe damage to structures founded upon them. The duration for which this near-zero effective stress state persists is a function of the rate of reconsolidation of the liquefied soil, which in turn is a function of the permeability and stiffness of the soil at this very low effective stress. Existing literature based on observation of physical model tests suggests that the consolidation coefficient C v associated with this reconsolidation of liquefied sand is significantly lower than that of the same soil at moderate stress levels. In this paper, the results of a series of novel fluidisation tests in which permeability k and coefficient of consolidation C v were independently measured will be presented. These results allow calculation of the variation of stiffness E 0 and permeability k with effective stress. It is shown that while permeability increases markedly at very low effective stresses, the simultaneous drop in stiffness measured results in a decrease in consolidation coefficient and hence an increase in the duration for which the soil remains liquefied.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Speech coding might have an impact on music perception of cochlear implant users. This questionnaire study compares the musical activities and perception of postlingually deafened cochlear implant users with three different coding strategies (CIS, ACE, SPEAK) using the Munich Music Questionnaire. Overall, the self-reported perception of music of CIS, SPEAK, and ACE users did not differ by very much.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Patients with adult GH deficiency are often dyslipidemic and may have an increased risk of cardiovascular disease. The secretion and clearance of very low density lipoprotein apolipoprotein B 100 (VLDL apoB) are important determinants of plasma lipid concentrations. This study examined the effect of GH replacement therapy on VLDL apoB metabolism using a stable isotope turnover technique. VLDL apoB kinetics were determined in 14 adult patients with GH deficiency before and after 3 months GH or placebo treatment in a randomized double blind, placebo-controlled study using a primed constant [1-(13)C]leucine infusion. VLDL apoB enrichment was determined by gas chromatography-mass spectrometry. GH replacement therapy increased plasma insulin-like growth factor I concentrations 2.9 +/- 0.5-fold (P < 0.001), fasting insulin concentrations 1.8 +/- 0.6-fold (P < 0.04), and hemoglobin A1C from 5.0 +/- 0.2% to 5.3 +/- 0.2% (mean +/- SEM; P < 0.001). It decreased fat mass by 3.4 +/- 1.3 kg (P < 0.05) and increased lean body mass by 3.5 +/- 0.8 kg (P < 0.01). The total cholesterol concentration (P < 0.02), the low density lipoprotein cholesterol concentration (P < 0.02), and the VLDL cholesterol/VLDL apoB ratio (P < 0.005) decreased. GH therapy did not significantly change the VLDL apoB pool size, but increased the VLDL apoB secretion rate from 9.2 +/- 2.0 to 25.9 +/- 10.3 mg/kg x day (P < 0.01) and the MCR from 11.5 +/- 2.7 to 20.3 +/- 3.2 mL/min (P < 0.03). No significant changes were observed in the placebo group. This study suggests that GH replacement therapy improves lipid profile by increasing the removal of VLDL apoB. Although GH therapy stimulates VLDL apoB secretion, this is offset by the increase in the VLDL apoB clearance rate, which we postulate is due to its effects in up-regulating low density lipoprotein receptors and modifying VLDL composition.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Increased cardiovascular mortality in adult growth hormone deficiency (GHD) may be, in part, explained by the dyslipidaemia associated with this condition. It is possible that abnormalities of very low density lipoprotein apolipoprotein B-100 (VLDL apoB) metabolism contribute to this dyslipidaemia. To test this hypothesis, we measured VLDL apoB kinetics in adult GH deficient patients (4 females, 3 males; age 50.1 +/- 4.7 yr (mean +/- SEM); BMI 28.2 +/- 1.1 kg/m2; total cholesterol (TC) 6.6 +/- 0.3 mmol/l; triglyceride (TG) 2.8 +/- 0.6 mmol/l; HDL cholesterol 1.1 +/- 0.1 mmol/l) and in control subjects (4 females, 3 male; age 47.0 +/- 4.7 yr; BMI 27.0 +/- 2.6 kg/m2; TC 5.0 +/- 0.4 mmol/l; TG 0.9 +/- 0.2 mmol/l; HDL cholesterol 1.4 +/- 0.1 mmol/l). [1-(13)C] leucine was administered by a primed (1 mg/kg), constant intravenous infusion (1 mg/kg/hr) and VLDL apoB enrichment with 13C leucine was determined using gas-chromatography mass-spectrometry. The GHD patients had a significantly higher hepatic secretion rate of VLDL apoB (15.5 +/- 1.8 mg/kg/day vs 9.4 +/- 0.6 mg/kg/day p = 0.007) and reduced catabolism ofVLDL apoB (metabolic clearance rate; 12.3 +/- 1.7 ml/min vs 24.3 +/- 4.8 ml/min p < 0.05) compared with control subjects. These findings suggest that GH is integrally involved in the regulation of VLDL apoB metabolism.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

BACKGROUND: There is a continuous debate regarding the best bottle nipple to be used to enhance the bottle-feeding performance of a preterm infant. Aim: To verify that feeding performance can be improved by using the bottle nipple with the physical characteristics that enhance infants' sucking skills. METHODS: Ten "healthy" VLBW infants (941+/-273 g) were recruited. Feeding performance was monitored at two time periods, when taking 1-2 and 6-8 oral feedings/d. At each time and within 24 h, performance was monitored using three different bottle nipples offered in a randomized order. Rate of milk transfer (ml/min) was the primary outcome measure. The sucking skills monitored comprised stage of sucking, suction amplitude, and duration of the generated negative intraoral suction pressure. RESULTS: At both times, infants demonstrated a similar rate of milk transfer among all three nipples. However, the stage of sucking, suction amplitude, and duration of the generated suction were significantly different between nipples at 1-2, but not 6-8 oral feedings/d.CONCLUSION: We did not identify a particular bottle nipple that enhanced bottle feeding in healthy VLBW infants. Based on the notion that afferent sensory feedback may allow infants to adapt to changing conditions, we speculate that infants can modify their sucking skills in order to maintain a rate of milk transfer that is appropriate with the level of suck-swallow-breathe coordination achieved at a particular time. Therefore, it is proposed that caretakers should be more concerned over monitoring the coordination of suck-swallow-breathe than over the selection of bottle nipples.