Biblioteca Digital

924 resultados para acoustic speech recognition system

Subjective and objective measures of adult bimodal users' listening

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Inconsistencies exist between traditional objective measures such as speech recognition and localization, and subjective reports of bimodal benefit. The purpose of this study was to expand the set of objective measures of bimodal benefit to include non-traditional listening tests, and to examine possible correlations between objective measures of auditory perception and subjective satisfaction reports.

The effects of difficulty and gain versus loss on vocal physiology and acoustics

Relevância:

100.00% 100.00%

Publicador:

Resumo:

To examine the basis of emotional changes to the voice, physiological and electroglottal measures were combined with acoustic speech analysis of 30 men performing a computer task in which they lost or gained points under two levels of difficulty. Predictions of the main effects of difficulty and reward on the voice were not borne out by the data. Instead, vocal changes depended largely on interactions between gain versus loss and difficulty. The rate at which the vocal folds open and close (fundamental frequency; f0) was higher for loss than for gain when difficulty was high, but not when difficulty was low. Electroglottal measures revealed that f0 changes corresponded to shorter glottal open times for the loss conditions. Longer closed and shorter open phases were consistent with raised laryngeal tension in difficult loss conditions. Similarly, skin conductance indicated higher sympathetic arousal in loss than gain conditions, particularly when difficulty was high. The results provide evidence of the physiological basis of affective vocal responses, confirming the utility of measuring physiology and voice in the study of emotion.

Neural network feature maps for Chinese phonemes

Relevância:

100.00% 100.00%

Publicador:

Resumo:

It has been shown through a number of experiments that neural networks can be used for a phonetic typewriter. Algorithms can be looked on as producing self-organizing feature maps which correspond to phonemes. In the Chinese language the utterance of a Chinese character consists of a very simple string of Chinese phonemes. With this as a starting point, a neural network feature map for Chinese phonemes can be built up. In this paper, feature map structures for Chinese phonemes are discussed and tested. This research on a Chinese phonetic feature map is important both for Chinese speech recognition and for building a Chinese phonetic typewriter.

Lexical access across languages: a multinomial model of auditory distraction

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Recall in many types of verbal memory task is reliably disrupted by the presence of auditory distracters, with verbal distracters frequently proving the most disruptive (Beaman, 2005). A multinomial processing tree model (Schweickert, 1993) is applied to the effects on free recall of background speech from a known or an unknown language. The model reproduces the free recall curve and the impact on memory of verbal distracters for which a lexical entry exists (i.e., verbal items from a known language). The effects of semantic relatedness of distracters within a language is found to depend upon a redintegrative factor thought to reflect the contribution of the speech-production system. The differential impacts of known and unknown languages cannot be accounted for in this way, but the same effects of distraction are observed amongst bilinguals, regardless of distracter-language.

Spectral modulation detection in normal hearing children

Relevância:

100.00% 100.00%

Publicador:

Resumo:

This Capstone Project attempts to determine the ability of normal hearing children to resolve spectral information, and the relationship between spectral resolution ability and speech recognition ability in noise. This study also examines how these abilities develop with age.

Wavelet-based dynamic time warping

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Dynamic Time Warping (DTW), a pattern matching technique traditionally used for restricted vocabulary speech recognition, is based on a temporal alignment of the input signal with the template models. The principal drawback of DTW is its high computational cost as the lengths of the signals increase. This paper shows extended results over our previously published conference paper, which introduces an optimized version of the DTW I hat is based on the Discrete Wavelet Transform (DWT). (C) 2008 Elsevier B.V. All rights reserved.

Fingerprint Segmentation

Relevância:

100.00% 100.00%

Publicador:

Resumo:

In this thesis, a new algorithm has been proposed to segment the foreground of the fingerprint from the image under consideration. The algorithm uses three features, mean, variance and coherence. Based on these features, a rule system is built to help the algorithm to efficiently segment the image. In addition, the proposed algorithm combine split and merge with modified Otsu. Both enhancements techniques such as Gaussian filter and histogram equalization are applied to enhance and improve the quality of the image. Finally, a post processing technique is implemented to counter the undesirable effect in the segmented image. Fingerprint recognition system is one of the oldest recognition systems in biometrics techniques. Everyone have a unique and unchangeable fingerprint. Based on this uniqueness and distinctness, fingerprint identification has been used in many applications for a long period. A fingerprint image is a pattern which consists of two regions, foreground and background. The foreground contains all important information needed in the automatic fingerprint recognition systems. However, the background is a noisy region that contributes to the extraction of false minutiae in the system. To avoid the extraction of false minutiae, there are many steps which should be followed such as preprocessing and enhancement. One of these steps is the transformation of the fingerprint image from gray-scale image to black and white image. This transformation is called segmentation or binarization. The aim for fingerprint segmentation is to separate the foreground from the background. Due to the nature of fingerprint image, the segmentation becomes an important and challenging task. The proposed algorithm is applied on FVC2000 database. Manual examinations from human experts show that the proposed algorithm provides an efficient segmentation results. These improved results are demonstrating in diverse experiments.

Up-converter nanophosphor Y2O2S:Er,Yb aminofunctionalized containing or not spherical silica conjugated with BSA

Relevância:

100.00% 100.00%

Publicador:

Resumo:

This work reports on the study of the nanophosphor. Y2O2S:Er(2%),Yb(1%) obtained from polymeric resin to be evaluated as fluorescent label with Suitable features to conjugate with bio-molecules for bioassay up-converting phosphor technology (UPT) application A conjugation protocol between bovine serum albumin (BSA) and the aminofunctionalized nanophosphor containing or not spherical silica was established UV-vis results indicated an effective conjugation between nanophosphor particles and the protein up-conversion measurements under 980 nm excitation performed for samples before and after aminofunctionalization showed that nanophosphor particles luminescence features keep unchanged in all cases All results suggest that the adapted protocol is feasible to provide a nanoparticle-protein effective conjugation preserving nanophosphor optical features The presence of spherical silica can be considered advantageous to increase conjugation efficiency Therefore. the developed procedure is applicable for future conjugations between the chosen nanophosphor and the streptavidin protein chat takes part in the well known self-recognition system avidin-biotin. (C) 2009 Elsevier B.V All rights reserved.

Aplicação do teste SSW em indivíduos com perda auditiva neurossensorial usuários e não usuários de aparelho de amplificação sonora individual

Relevância:

100.00% 100.00%

Publicador:

Resumo:

OBJETIVO: comparar o desempenho de pacientes usuários e não usuários de AASI, por meio do teste SSW. MÉTODO: o estudo foi realizado em 13 sujeitos com idade entre 55 e 85 anos, com perda auditiva bilateral, sendo seis usuários de prótese auditiva bilateral e sete não usuários de prótese auditiva. O teste de processamento auditivo aplicado foi o teste de reconhecimento de dissílabos em tarefa dicótica SSW. Foi realizado um tratamento estatístico feito por meio da técnica Bootstrap e do Teste de Hipótese Kolmogorov-Smirnov. RESULTADOS: o grupo de usuários apresentou melhor desempenho nas condições estudadas do que o grupo de não usuários, principalmente nas condições competitivas. CONCLUSÃO: os resultados obtidos nessa pesquisa apontam para a eficácia do uso do AASI na melhora da compreensão de fala da população estudada, não somente pela compensação da perda auditiva periférica, mas também pela interferência no processo de envelhecimento do sistema nervoso auditivo central.

Optimum-Path Forest Classifier for Large Scale Biometric Applications

Relevância:

100.00% 100.00%

Publicador:

Resumo:

This paper addresses biometric identification using large databases, in particular, iris databases. In such applications, it is critical to have low response time, while maintaining an acceptable recognition rate. Thus, the trade-off between speed and accuracy must be evaluated for processing and recognition parts of an identification system. In this paper, a graph-based framework for pattern recognition, called Optimum-Path Forest (OPF), is utilized as a classifier in a pre-developed iris recognition system. The aim of this paper is to verify the effectiveness of OPF in the field of iris recognition, and its performance for various scale iris databases. The existing Gauss-Laguerre Wavelet based coding scheme is used for iris encoding. The performance of the OPF and two other - Hamming and Bayesian - classifiers, is compared using small, medium, and large-scale databases. Such a comparison shows that the OPF has faster response for large-scale databases, thus performing better than the more accurate, but slower, classifiers.

A Fast Large Scale Iris Database Classification with Optimum-Path Forest Technique: A Case Study

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Majority of biometric researchers focus on the accuracy of matching using biometrics databases, including iris databases, while the scalability and speed issues have been neglected. In the applications such as identification in airports and borders, it is critical for the identification system to have low-time response. In this paper, a graph-based framework for pattern recognition, called Optimum-Path Forest (OPF), is utilized as a classifier in a pre-developed iris recognition system. The aim of this paper is to verify the effectiveness of OPF in the field of iris recognition, and its performance for various scale iris databases. This paper investigates several classifiers, which are widely used in iris recognition papers, and the response time along with accuracy. The existing Gauss-Laguerre Wavelet based iris coding scheme, which shows perfect discrimination with rotary Hamming distance classifier, is used for iris coding. The performance of classifiers is compared using small, medium, and large scale databases. Such comparison shows that OPF has faster response for large scale database, thus performing better than more accurate but slower Bayesian classifier.

Autoregressive decomposition and pole tracking applied to vocal fold nodule signals

Relevância:

100.00% 100.00%

Publicador:

Resumo:

This letter describes a novel algorithm that is based on autoregressive decomposition and pole tracking used to recognize two patterns of speech data: normal voice and disphonic voice caused by nodules. The presented method relates the poles and the peaks of the signal spectrum which represent the periodic components of the voice. The results show that the perturbation contained in the signal is clearly depicted by pole's positions. Their variability is related to jitter and shimmer. The pole dispersion for pathological voices is about 20% higher than for normal voices, therefore, the proposed approach is a more trustworthy measure than the classical ones. © 2007.

On the determination of epsilon during discriminative GMM training

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Discriminative training of Gaussian Mixture Models (GMMs) for speech or speaker recognition purposes is usually based on the gradient descent method, in which the iteration step-size, ε, uses to be defined experimentally. In this letter, we derive an equation to adaptively determine ε, by showing that the second-order Newton-Raphson iterative method to find roots of equations is equivalent to the gradient descent algorithm. © 2010 IEEE.

Estudo de métodos para classificação e localização precisa de padrões usando um sistema de luz estruturada

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES)

Biomobile: sistema de identificação de usuários em dispositivos móveis na plataforma Android utilizando reconhecimento de faces a partir de vídeo

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Pós-graduação em Ciência da Computação - IBILCE

«
1
2
...
13
14
15
16
17
18
19
...
61
62
»