325 resultados para Free speech


Relevância:

20.00% 20.00%

Publicador:

Resumo:

Oversmoothing of speech parameter trajectories is one of the causes for quality degradation of HMM-based speech synthesis. Various methods have been proposed to overcome this effect, the most recent ones being global variance (GV) and modulation-spectrum-based post-filter (MSPF). However, there is still a significant quality gap between natural and synthesized speech. In this paper, we propose a two-fold post-filtering technique to alleviate to a certain extent the oversmoothing of spectral and excitation parameter trajectories of HMM-based speech synthesis. For the spectral parameters, we propose a sparse coding-based post-filter to match the trajectories of synthetic speech to that of natural speech, and for the excitation trajectory, we introduce a perceptually motivated post-filter. Experimental evaluations show quality improvement compared with existing methods.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Speech enhancement in stationary noise is addressed using the ideal channel selection framework. In order to estimate the binary mask, we propose to classify each time-frequency (T-F) bin of the noisy signal as speech or noise using Discriminative Random Fields (DRF). The DRF function contains two terms - an enhancement function and a smoothing term. On each T-F bin, we propose to use an enhancement function based on likelihood ratio test for speech presence, while Ising model is used as smoothing function for spectro-temporal continuity in the estimated binary mask. The effect of the smoothing function over successive iterations is found to reduce musical noise as opposed to using only enhancement function. The binary mask is inferred from the noisy signal using Iterated Conditional Modes (ICM) algorithm. Sentences from NOIZEUS corpus are evaluated from 0 dB to 15 dB Signal to Noise Ratio (SNR) in 4 kinds of additive noise settings: additive white Gaussian noise, car noise, street noise and pink noise. The reconstructed speech using the proposed technique is evaluated in terms of average segmental SNR, Perceptual Evaluation of Speech Quality (PESQ) and Mean opinion Score (MOS).

Relevância:

20.00% 20.00%

Publicador:

Resumo:

A method for acylation for heteroarenes under metal-free conditions has been described using NCS as an additive and TBHP as an oxidant. This method has been successfully employed in acylation of a variety of aldehyde with heteroarenes. The application of the method has been illustrated in synthesizing isoquinoline derived natural products. This strategy provides an efficient, mild and inexpensive method for acylation of heteroarenes. (C) 2015 Elsevier Ltd. All rights reserved.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

We report here the first general method for the geminal diamination and an intermolecular metal-free, geminal aminooxygenation of vinylarenes using hypervalent iodine reagent. A new m-CPBA mediated geminal aminooxygenation is also reported. A novel reagent-switch for the control of migrating group by controlling the two independent geminal addition paths is developed. Deuterium labelling studies and the control studies have provided unambiguous evidences for the phenyl migration and hydride migration in the oxidative geminal difunctionalization process mediated by Phl(OCOCF3)(2) and m-CPBA, respectively through a semi-pinacol rearrangement. (C) 2016 Elsevier Ltd. All rights reserved.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Two shape-persistent covalent cages (CC1(r) and CC2(r)) have been devised from triphenyl amine-based trialdehydes and cyclohexane diamine building blocks utilizing the dynamic imine chemistry followed by imine bond reduction. The cage compounds have been characterized by several spectroscopic techniques which suggest that CC1(r) and CC2(r) are 2+3] and 8+12] self-assembled architectures, respectively. These state-of-the-art molecules have a porous interior and stable aromatic backbone with multiple palladium binding sites to engineer the controlled synthesis and stabilization of ultrafine palladium nanoparticles (PdNPs). As-synthesized cage-embedded PdNPs have been characterized by transmission electron microscopy (TEM), scanning electron microscopy (SEM), and powder X-ray diffraction (PXRD). Inductively coupled plasma optical emission spectrometry reveals that Pd@CC1(r) and Pd@CC2(r) have 40 and 25 wt% palladium loading, respectively. On the basis of TEM analysis, it has been estimated that as small as similar to 1.8 nm PdNPs could be stabilized inside the CC1(r), while larger CC2(r) could stabilize similar to 3.7 nm NPs. In contrast, reduction of palladium salts in the absence of the cages form structure less agglomerates. The well-dispersed cage-embedded NPs exhibit efficient catalytic performance in the cyanation of aryl halides under heterogeneous, additive-free condition. Moreover, these materials have excellent stability and recyclability without any agglomeration of PdNPs after several cycles.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Computer Assisted Assessment (CAA) has been existing for several years now. While some forms of CAA do not require sophisticated text understanding (e.g., multiple choice questions), there are also student answers that consist of free text and require analysis of text in the answer. Research towards the latter till date has concentrated on two main sub-tasks: (i) grading of essays, which is done mainly by checking the style, correctness of grammar, and coherence of the essay and (ii) assessment of short free-text answers. In this paper, we present a structured view of relevant research in automated assessment techniques for short free-text answers. We review papers spanning the last 15 years of research with emphasis on recent papers. Our main objectives are two folds. First we present the survey in a structured way by segregating information on dataset, problem formulation, techniques, and evaluation measures. Second we present a discussion on some of the potential future directions in this domain which we hope would be helpful for researchers.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Sn4+-doped In2O3 (ITO) is a benchmark transparent conducting oxide material. We prepared ligand-free but colloidal ITO (8nm, 10% Sn4+) nanocrystals (NCs) by using a post-synthesis surface-modification reaction. (CH3)(3)OBF4 removes the native oleylamine ligand from NC surfaces to give ligand-free, positively charged NCs that form a colloidal dispersion in polar solvents. Both oleylamine-capped and ligand-free ITO NCs exhibit intense absorption peaks, due to localized surface plasmon resonance (LSPR) at around =1950nm. Compared with oleylamine-capped NCs, the electrical resistivity of ligand-free ITO NCs is lower by an order of magnitude (approximate to 35mcm(-1)). Resistivity over a wide range of temperatures can be consistently described as a composite of metallic ITO grains embedded in an insulating matrix by using a simple equivalent circuit, which provides an insight into the conduction mechanism in these systems.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Nanocrystalline powders of Ba1-xMgxZr0.1Ti0.9O3 (x = 0.025-0.1) were synthesized via citrate assisted sol-gel method. Interestingly, the one with x = 0.05 in the system Ba1-xMgxZr0.1Ti0.9O3 exhibited fairly good piezoelectric response aside from the other physical properties. The phase and structural confirmation of synthesized powder was established by X-ray powder diffraction (XRD) and Raman Spectroscopic techniques. Two distinct Raman bands i.e., 303 and 723 cm(-1) characteristic of tetragonal phase were observed. Thermogravimetric analysis (TGA) was performed to evaluate the phase decomposition of the as-synthesized Ba0.95Mg0.05Zr0.1Ti0.9O3 sample as a function of temperature. The average crystallite size associated with Ba0.95Mg0.05Zr0.1Ti0.9O3 was calculated using Scherrer formula based on the XRD data and was found to be 25 nm. However, Scanning and Transmission Electron Microscopy studies revealed the average crystallite size to be in the range of 30-40 nm, respectively. Kubelka-Munk function was employed to determine the optical band gap of these nanocrystallites. A piezoelectric response of 26 pm/V was observed for Ba0.95Mg0.05Zr0.1Ti0.9O3 nanocrystal by Piezoresponse Force Microscopy (PFM) technique. Photoluminescence (PL) study carried out on these nanocrystals exhibited a blue emission (470 nm) at room temperature.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Acoustic feature based speech (syllable) rate estimation and syllable nuclei detection are important problems in automatic speech recognition (ASR), computer assisted language learning (CALL) and fluency analysis. A typical solution for both the problems consists of two stages. The first stage involves computing a short-time feature contour such that most of the peaks of the contour correspond to the syllabic nuclei. In the second stage, the peaks corresponding to the syllable nuclei are detected. In this work, instead of the peak detection, we perform a mode-shape classification, which is formulated as a supervised binary classification problem - mode-shapes representing the syllabic nuclei as one class and remaining as the other. We use the temporal correlation and selected sub-band correlation (TCSSBC) feature contour and the mode-shapes in the TCSSBC feature contour are converted into a set of feature vectors using an interpolation technique. A support vector machine classifier is used for the classification. Experiments are performed separately using Switchboard, TIMIT and CTIMIT corpora in a five-fold cross validation setup. The average correlation coefficients for the syllable rate estimation turn out to be 0.6761, 0.6928 and 0.3604 for three corpora respectively, which outperform those obtained by the best of the existing peak detection techniques. Similarly, the average F-scores (syllable level) for the syllable nuclei detection are 0.8917, 0.8200 and 0.7637 for three corpora respectively. (C) 2016 Elsevier B.V. All rights reserved.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Na0.5Bi0.5TiO3- based lead-free piezoelectrics exhibiting giant piezostrain are technologically interesting materials for actuator applications. The lack of clarity with regard to the structure of the nonpolar phase of this system has hindered the understanding of the structural mechanism associated with the giant piezostrain and other related phenomena. In this paper, we have investigated the structure and field-induced phase transformation behavior of a model system (0.94 - x) Na0.5Bi0.5TiO3-0.06BaTiO(3)-xK(0.5)Na(0.5)NbO(3) (0.0 <= x <= 0.025). A detailed structural analysis using neutron powder diffraction revealed that the nonpolar phase is neither cubic nor a mixture of rhombohedral (R3c) and tetragonal (P4bm) phases as commonly reported in literature but exhibits a long-period modulated structure, which is most probably of the type root 2 x root 2 x n with n = 16. Our results suggest that the giant piezoelectric strain is associated with a field-induced phase transformation of the long-period modulated structure to rhombohedral R3c structure above a critical field. We also demonstrate that the giant piezostrain is lost if the system retains a fraction of the field-induced R3c phase. A possible correlation among depolarization temperature, giant piezostrain, and its electrical fatigue behavior has also been indicated.