46 resultados para JFA
Resumo:
This paper investigates the use of mel-frequency deltaphase (MFDP) features in comparison to, and in fusion with, traditional mel-frequency cepstral coefficient (MFCC) features within joint factor analysis (JFA) speaker verification. MFCC features, commonly used in speaker recognition systems, are derived purely from the magnitude spectrum, with the phase spectrum completely discarded. In this paper, we investigate if features derived from the phase spectrum can provide additional speaker discriminant information to the traditional MFCC approach in a JFA based speaker verification system. Results are presented which provide a comparison of MFCC-only, MFDPonly and score fusion of the two approaches within a JFA speaker verification approach. Based upon the results presented using the NIST 2008 Speaker Recognition Evaluation (SRE) dataset, we believe that, while MFDP features alone cannot compete with MFCC features, MFDP can provide complementary information that result in improved speaker verification performance when both approaches are combined in score fusion, particularly in the case of shorter utterances.
Resumo:
This work aims to take advantage of recent developments in joint factor analysis (JFA) in the context of a phonetically conditioned GMM speaker verification system. Previous work has shown performance advantages through phonetic conditioning, but this has not been shown to date with the JFA framework. Our focus is particularly on strategies for combining the phone-conditioned systems. We show that the classic fusion of the scores is suboptimal when using multiple GMM systems. We investigate several combination strategies in the model space, and demonstrate improvement over score-level combination as well as over a non-phonetic baseline system. This work was conducted during the 2008 CLSP Workshop at Johns Hopkins University.
Resumo:
This work presents an extended Joint Factor Analysis model including explicit modelling of unwanted within-session variability. The goals of the proposed extended JFA model are to improve verification performance with short utterances by compensating for the effects of limited or imbalanced phonetic coverage, and to produce a flexible JFA model that is effective over a wide range of utterance lengths without adjusting model parameters such as retraining session subspaces. Experimental results on the 2006 NIST SRE corpus demonstrate the flexibility of the proposed model by providing competitive results over a wide range of utterance lengths without retraining and also yielding modest improvements in a number of conditions over current state-of-the-art.
Resumo:
In this paper we extend the concept of speaker annotation within a single-recording, or speaker diarization, to a collection wide approach we call speaker attribution. Accordingly, speaker attribution is the task of clustering expectantly homogenous intersession clusters obtained using diarization according to common cross-recording identities. The result of attribution is a collection of spoken audio across multiple recordings attributed to speaker identities. In this paper, an attribution system is proposed using mean-only MAP adaptation of a combined-gender UBM to model clusters from a perfect diarization system, as well as a JFA-based system with session variability compensation. The normalized cross-likelihood ratio is calculated for each pair of clusters to construct an attribution matrix and the complete linkage algorithm is employed to conduct clustering of the inter-session clusters. A matched cluster purity and coverage of 87.1% was obtained on the NIST 2008 SRE corpus.
Resumo:
Robust speaker verification on short utterances remains a key consideration when deploying automatic speaker recognition, as many real world applications often have access to only limited duration speech data. This paper explores how the recent technologies focused around total variability modeling behave when training and testing utterance lengths are reduced. Results are presented which provide a comparison of Joint Factor Analysis (JFA) and i-vector based systems including various compensation techniques; Within-Class Covariance Normalization (WCCN), LDA, Scatter Difference Nuisance Attribute Projection (SDNAP) and Gaussian Probabilistic Linear Discriminant Analysis (GPLDA). Speaker verification performance for utterances with as little as 2 sec of data taken from the NIST Speaker Recognition Evaluations are presented to provide a clearer picture of the current performance characteristics of these techniques in short utterance conditions.
Resumo:
We consider convolution equations of the type f * T = g, where f, g is an element of L-P (R-n) and T is a compactly supported distribution. Under natural assumptions on the zero set of the Fourier transform of T, we show that f is compactly supported, provided g is. Similar results are proved for non-compact symmetric spaces as well. (C) 2010 Elsevier Inc. All rights reserved.
Resumo:
Fluorescence and stopped-flow spectrophotometric studies on three plant lectins fromPsophocarpus tetragonolobus (winged bean),Glycine max (soybean) andArtocarpus integrifolia (jack fruit) have been studied usingN-dansylgalactosamine as a fluorescent ligand. The best monosaccharide for the winged bean agglutinin I (WBA I) and soybean (SBA) is Me-agrGalNAc and for jack fruit agglutinin (JFA) is Me-agrGal. Examination of the percentage enhancement and association constants (1.51×106, 6.56×106 and 4.17×105 M–1 for SBA, WBA I and JFA, respectively) suggests that the combining regions of the lectins SBA and WBA I are apolar whereas that of JFA is polar. Thermodynamic parameters obtained for the binding of several monosaccharides to these lectins are enthalpically favourable. The binding of monosaccharides to these lectins suggests that the-OH groups at C-1, C-2, C-4 and C-6 in thed-galactose configuration are important loci for interaction with these lectins. An important finding is that the JFA binds specifically to Galß1-3GaINAc with much higher affinity than the other disaccharides which are structurally and topographically similar.The results of stopped-flow spectrometry on the binding ofN-dansylgalactosamine to these lectins are consistent with a bimolecular single step mechanism. The association rate constants (2.4×105, 1.3×104, and 11.7×105 M–1 sec–1 for SBA, WBA I and JFA, respectively) obtained are several orders of magnitude slower than the ones expected for diffusion controlled reactions. The dissociation rate constants (0.2, 3.2×10–2, 83.3 sec–1 for SBA, WBA I and JFA, respectively) obtained for the dissociation ofN-dansylgalactosamine from its lectin complex are slowest for SBA and WBA I when compared with any other lectin-ligand dissociation process.
Resumo:
In this article we deal with a variation of a theorem of Mauceri concerning the L-P boundedness of operators M which are known to be bounded on L-2. We obtain sufficient conditions on the kernel of the operator M so that it satisfies weighted L-P estimates. As an application we prove L-P boundedness of Hermite pseudo-multipliers. (C) 2014 Elsevier Inc. All rights reserved.
Resumo:
The purpose of the present paper is to lay the foundations for a systematic study of tensor products of operator systems. After giving an axiomatic definition of tensor products in this category, we examine in detail several particular examples of tensor products, including a minimal, maximal, maximal commuting, maximal injective and some asymmetric tensor products. We characterize these tensor products in terms of their universal properties and give descriptions of their positive cones. We also characterize the corresponding tensor products of operator spaces induced by a certain canonical inclusion of an operator space into an operator system. We examine notions of nuclearity for our tensor products which, on the category of C*-algebras, reduce to the classical notion. We exhibit an operator system S which is not completely order isomorphic to a C*-algebra yet has the property that for every C*-algebra A, the minimal and maximal tensor product of S and A are equal.