18 resultados para Speech and Audio Research Laboratory


Relevância:

100.00% 100.00%

Publicador:

Resumo:

This paper presents speaker normalization approaches for audio search task. Conventional state-of-the-art feature set, viz., Mel Frequency Cepstral Coefficients (MFCC) is known to contain speaker-specific and linguistic information implicitly. This might create problem for speaker-independent audio search task. In this paper, universal warping-based approach is used for vocal tract length normalization in audio search. In particular, features such as scale transform and warped linear prediction are used to compensate speaker variability in audio matching. The advantage of these features over conventional feature set is that they apply universal frequency warping for both the templates to be matched during audio search. The performance of Scale Transform Cepstral Coefficients (STCC) and Warped Linear Prediction Cepstral Coefficients (WLPCC) are about 3% higher than the state-of-the-art MFCC feature sets on TIMIT database.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

We propose apractical, feature-level and score-level fusion approach by combining acoustic and estimated articulatory information for both text independent and text dependent speaker verification. From a practical point of view, we study how to improve speaker verification performance by combining dynamic articulatory information with the conventional acoustic features. On text independent speaker verification, we find that concatenating articulatory features obtained from measured speech production data with conventional Mel-frequency cepstral coefficients (MFCCs) improves the performance dramatically. However, since directly measuring articulatory data is not feasible in many real world applications, we also experiment with estimated articulatory features obtained through acoustic-to-articulatory inversion. We explore both feature level and score level fusion methods and find that the overall system performance is significantly enhanced even with estimated articulatory features. Such a performance boost could be due to the inter-speaker variation information embedded in the estimated articulatory features. Since the dynamics of articulation contain important information, we included inverted articulatory trajectories in text dependent speaker verification. We demonstrate that the articulatory constraints introduced by inverted articulatory features help to reject wrong password trials and improve the performance after score level fusion. We evaluate the proposed methods on the X-ray Microbeam database and the RSR 2015 database, respectively, for the aforementioned two tasks. Experimental results show that we achieve more than 15% relative equal error rate reduction for both speaker verification tasks. (C) 2015 Elsevier Ltd. All rights reserved.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The characteristics of neurological, psychiatric, developmental and substance-use disorders in low-and middle-income countries are unique and the burden that they have will be different from country to country. Many of the differences are explained by the wide variation in population demographics and size, poverty, conflict, culture, land area and quality, and genetics. Neurological, psychiatric, developmental and substance-use disorders that result from, or are worsened by, a lack of adequate nutrition and infectious disease still afflict much of sub-Saharan Africa, although disorders related to increasing longevity, such as stroke, are on the rise. In the Middle East and North Africa, major depressive disorders and post-traumatic stress disorder are a primary concern because of the conflict-ridden environment. Consanguinity is a serious concern that leads to the high prevalence of recessive disorders in the Middle East and North Africa and possibly other regions. The burden of these disorders in Latin American and Asian countries largely surrounds stroke and vascular disease, dementia and lifestyle factors that are influenced by genetics. Although much knowledge has been gained over the past 10 years, the epidemiology of the conditions in low-and middle-income countries still needs more research. Prevention and treatments could be better informed with more longitudinal studies of risk factors. Challenges and opportunities for ameliorating nervous-system disorders can benefit from both local and regional research collaborations. The lack of resources and infrastructure for health-care and related research, both in terms of personnel and equipment, along with the stigma associated with the physical or behavioural manifestations of some disorders have hampered progress in understanding the disease burden and improving brain health. Individual countries, and regions within countries, have specific needs in terms of research priorities.