988 resultados para Speech Rate
Resumo:
This paper presents a systematic construction of high-rate and full-diversity space-frequency block codes for MIMO-OFDM systems. While all prior constructions offer only a maximum rate of one complex symbol per channel use, our construction yields rate equal to the number of transmit antennas and simultaneously achieves full-diversity. The proposed construction works for arbitrary number of transmit antennas and arbitrary channel power delay profile. A key step in this construction is the generalization of the stacked matrix code design criteria given by Bolcskei et.al., (IEEE WCNC 2000). Explicit equivalence of our generalized code design criteria with the Hadamard-product based criteria of W. Su et.al., (lEEE Trans. Sig. Proc. Nov 2003) is established and new high-rate codes are constructed using our criteria.
Resumo:
The problem of constructing space-time (ST) block codes over a fixed, desired signal constellation is considered. In this situation, there is a tradeoff between the transmission rate as measured in constellation symbols per channel use and the transmit diversity gain achieved by the code. The transmit diversity is a measure of the rate of polynomial decay of pairwise error probability of the code with increase in the signal-to-noise ratio (SNR). In the setting of a quasi-static channel model, let n(t) denote the number of transmit antennas and T the block interval. For any n(t) <= T, a unified construction of (n(t) x T) ST codes is provided here, for a class of signal constellations that includes the familiar pulse-amplitude (PAM), quadrature-amplitude (QAM), and 2(K)-ary phase-shift-keying (PSK) modulations as special cases. The construction is optimal as measured by the rate-diversity tradeoff and can achieve any given integer point on the rate-diversity tradeoff curve. An estimate of the coding gain realized is given. Other results presented here include i) an extension of the optimal unified construction to the multiple fading block case, ii) a version of the optimal unified construction in which the underlying binary block codes are replaced by trellis codes, iii) the providing of a linear dispersion form for the underlying binary block codes, iv) a Gray-mapped version of the unified construction, and v) a generalization of construction of the S-ary case corresponding to constellations of size S-K. Items ii) and iii) are aimed at simplifying the decoding of this class of ST codes.
Resumo:
Sequence design and resource allocation for a symbol-asynchronous chip-synchronous code division multiple access (CDMA) system is considered in this paper. A simple lower bound on the minimum sum-power required for a non-oversized system, based on the best achievable for a non-spread system, and an analogous upper bound on the sum rate are first summarised. Subsequently, an algorithm of Sundaresan and Padakandla is shown to achieve the lower bound on minimum sum power (upper bound on sum rate, respectively). Analogous to the synchronous case, by splitting oversized users in a system with processing gain N, a system with no oversized users is easily obtained, and the lower bound on sum power (upper bound on sum rate, respectively) is shown to be achieved by using N orthogonal sequences. The total number of splits is at most N - 1.
Resumo:
A better performing product code vector quantization (VQ) method is proposed for coding the line spectrum frequency (LSF) parameters; the method is referred to as sequential split vector quantization (SeSVQ). The split sub-vectors of the full LSF vector are quantized in sequence and thus uses conditional distribution derived from the previous quantized sub-vectors. Unlike the traditional split vector quantization (SVQ) method, SeSVQ exploits the inter sub-vector correlation and thus provides improved rate-distortion performance, but at the expense of higher memory. We investigate the quantization performance of SeSVQ over traditional SVQ and transform domain split VQ (TrSVQ) methods. Compared to SVQ, SeSVQ saves 1 bit and nearly 3 bits, for telephone-band and wide-band speech coding applications respectively.
Resumo:
Superplastic materials exhibit very large elongations to failure,typically >500%, and this enables commercial forming of complex shaped components at slow strain rates of similar to 10(-4) s(-1). We report extraordinary record superplastic elongations to failure of up to 5300% at both high strain rates and low temperature in electrodeposited nanocrystalline Ni and some Ni alloys. Superplasticity is not related to the presence of sulfur or a low melting phase at grain boundaries. (C) 2010 Acta Materialia Inc. Published by Elsevier Ltd. All rights reserved.
New Method for Delexicalization and its Application to Prosodic Tagging for Text-to-Speech Synthesis
Resumo:
This paper describes a new flexible delexicalization method based on glottal excited parametric speech synthesis scheme. The system utilizes inverse filtered glottal flow and all-pole modelling of the vocal tract. The method provides a possibil- ity to retain and manipulate all relevant prosodic features of any kind of speech. Most importantly, the features include voice quality, which has not been properly modeled in earlier delex- icalization methods. The functionality of the new method was tested in a prosodic tagging experiment aimed at providing word prominence data for a text-to-speech synthesis system. The ex- periment confirmed the usefulness of the method and further corroborated earlier evidence that linguistic factors influence the perception of prosodic prominence.
Resumo:
The low-temperature plastic flow of alpha-zirconium was studied by employing constantrate tensile tests and differential-stress creep experiments. The activation parameters, enthalpy and area, have been obtained as a function of stress for pure, as well as commercial zirconium. The activation area is independent of grain size and purity and falls to about 9b2 at high stresses. The deformation mechanism below about 700° K is found to be controlled by a single thermally activated process, and not a two-stage activation mechanism. Several dislocation mechanisms are examined and it is concluded that overcoming the Peierls energy humps by the formation of kink pairs in a length of dislocation is the rate-controlling mechanism. The total energy needed to nucleate a double kink is about 0.8 eV in pure zirconium and 1 eV in commercial zirconium
Resumo:
Layered LiNi1/3Co1/3Mn1/3O2, which is isostructural with LiCoO2, is considered as a potential cathode material for Li-ion batteries. Submicrometer sized porous particles are useful for high discharge rates. The present work involves a synthesis of submicrometer sized porous particles of LiNi1/3Co1/3Mn1/3O2 using a triblock copolymer as a soft template. The precursor obtained from the reaction is heated at different temperatures between 600 and 900 degrees C for 6 h to get the final product samples. The compound attains increased crystallinity with an increase in the temperature of preparation. However, there is a decrease in the surface area and also in the porosity of the sample. Nevertheless, the LiNi1/3Co1/3Mn1/3O2 sample prepared at 900 degrees C exhibits a high rate capability and stable capacity retention on cycling. The electrochemical performance of LiNi1/3Co1/3Mn1/3O2 prepared in the absence of the polymer template is inferior to that of the sample prepared in the presence of the polymer template. (C) 2010 The Electrochemical Society. [DOI: 10.1149/1.3364944] All rights reserved.
Resumo:
Nanocrystalline Li4Ti5O12 (LTO) crystallizing in cubic spinel-phase has been synthesized by single-step-solution-combustion method in less than one minute. LTO particles thus synthesized are flaky and highly porous in nature with a surface area of 12 m(2)/g. Transmission electron micrographs indicate the primary particles to be agglomerated crystallites of varying size between 20 and 50 nm with a 3-dimensional interconnected porous network. During their galvanostatic charge-discharge at varying rates, LTO electrodes yield a capacity value close to the theoretical value of 175 mA h/g at C/2 rate. The electrodes also exhibit promising capacity retention with little capacity loss over 100 cycles at varying discharge rates together with attractive discharge-rate capabilities yielding capacity values of 140 mA h/g and 70 mA h/g at 10 and 100 C discharge rates, respectively. The ameliorated electrode-performance is ascribed to nano and highly porous morphology of the electrodes that provide short diffusion-paths for Li in conjunction with electrolyte percolation through the electrode pores ensuring a high flux of Li.
Resumo:
We propose a simple speech music discriminator that uses features based on HILN(Harmonics, Individual Lines and Noise) model. We have been able to test the strength of the feature set on a standard database of 66 files and get an accuracy of around 97%. We also have tested on sung queries and polyphonic music and have got very good results. The current algorithm is being used to discriminate between sung queries and played (using an instrument like flute) queries for a Query by Humming(QBH) system currently under development in the lab.
Resumo:
Non-uniform sampling of a signal is formulated as an optimization problem which minimizes the reconstruction signal error. Dynamic programming (DP) has been used to solve this problem efficiently for a finite duration signal. Further, the optimum samples are quantized to realize a speech coder. The quantizer and the DP based optimum search for non-uniform samples (DP-NUS) can be combined in a closed-loop manner, which provides distinct advantage over the open-loop formulation. The DP-NUS formulation provides a useful control over the trade-off between bitrate and performance (reconstruction error). It is shown that 5-10 dB SNR improvement is possible using DP-NUS compared to extrema sampling approach. In addition, the close-loop DP-NUS gives a 4-5 dB improvement in reconstruction error.
Resumo:
For p x n complex orthogonal designs in k variables, where p is the number of channels uses and n is the number of transmit antennas, the maximal rate L of the design is asymptotically half as n increases. But, for such maximal rate codes, the decoding delay p increases exponentially. To control the delay, if we put the restriction that p = n, i.e., consider only the square designs, then, the rate decreases exponentially as n increases. This necessitates the study of the maximal rate of the designs with restrictions of the form p = n+1, p = n+2, p = n+3 etc. In this paper, we study the maximal rate of complex orthogonal designs with the restrictions p = n+1 and p = n+2. We derive upper and lower bounds for the maximal rate for p = n+1 and p = n+2. Also for the case of p = n+1, we show that if the orthogonal design admit only the variables, their negatives and multiples of these by root-1 and zeros as the entries of the matrix (other complex linear combinations are not allowed), then the maximal rate always equals the lower bound.