984 resultados para Audiovisual speech recognition
Resumo:
This special issue aims to cover some problems related to non-linear and nonconventional speech processing. The origin of this volume is in the ISCA Tutorial and Research Workshop on Non-Linear Speech Processing, NOLISP’09, held at the Universitat de Vic (Catalonia, Spain) on June 25–27, 2009. The series of NOLISP workshops started in 2003 has become a biannual event whose aim is to discuss alternative techniques for speech processing that, in a sense, do not fit into mainstream approaches. A selected choice of papers based on the presentations delivered at NOLISP’09 has given rise to this issue of Cognitive Computation.
Resumo:
The work presented here is part of a larger study to identify novel technologies and biomarkers for early Alzheimer disease (AD) detection and it focuses on evaluating the suitability of a new approach for early AD diagnosis by non-invasive methods. The purpose is to examine in a pilot study the potential of applying intelligent algorithms to speech features obtained from suspected patients in order to contribute to the improvement of diagnosis of AD and its degree of severity. In this sense, Artificial Neural Networks (ANN) have been used for the automatic classification of the two classes (AD and control subjects). Two human issues have been analyzed for feature selection: Spontaneous Speech and Emotional Response. Not only linear features but also non-linear ones, such as Fractal Dimension, have been explored. The approach is non invasive, low cost and without any side effects. Obtained experimental results were very satisfactory and promising for early diagnosis and classification of AD patients.
Resumo:
Within the context of rising competition between territories, identity has become the most important element of recognition, differentiation and commodification in the communicative process within which cities, regions and countries position themselves. Geographical spaces thus compete in terms of this identity, which is then subjected to fierce comparison and competition (Nogué, 1999; Anholt, 2007a). The territorial brand thus entails the reinvention of places through a process of brand construction (branding) based on the promotion of the individual and collective identities of geographical spaces; these identities, in turn, are imbued with the intangible factors associated with their respective territorial identities.
Resumo:
Alzheimer's disease is the most prevalent form of progressive degenerative dementia; it has a high socio-economic impact in Western countries. Therefore it is one of the most active research areas today. Alzheimer's is sometimes diagnosed by excluding other dementias, and definitive confirmation is only obtained through a post-mortem study of the brain tissue of the patient. The work presented here is part of a larger study that aims to identify novel technologies and biomarkers for early Alzheimer's disease detection, and it focuses on evaluating the suitability of a new approach for early diagnosis of Alzheimer’s disease by non-invasive methods. The purpose is to examine, in a pilot study, the potential of applying Machine Learning algorithms to speech features obtained from suspected Alzheimer sufferers in order help diagnose this disease and determine its degree of severity. Two human capabilities relevant in communication have been analyzed for feature selection: Spontaneous Speech and Emotional Response. The experimental results obtained were very satisfactory and promising for the early diagnosis and classification of Alzheimer’s disease patients.
Resumo:
In this work we present a simulation of a recognition process with perimeter characterization of a simple plant leaves as a unique discriminating parameter. Data coding allowing for independence of leaves size and orientation may penalize performance recognition for some varieties. Border description sequences are then used, and Principal Component Analysis (PCA) is applied in order to study which is the best number of components for the classification task, implemented by means of a Support Vector Machine (SVM) System. Obtained results are satisfactory, and compared with [4] our system improves the recognition success, diminishing the variance at the same time.
Resumo:
In this work we present a simulation of a recognition process with perimeter characterization of a simple plant leaves as a unique discriminating parameter. Data coding allowing for independence of leaves size and orientation may penalize performance recognition for some varieties. Border description sequences are then used to characterize the leaves. Independent Component Analysis (ICA) is then applied in order to study which is the best number of components to be considered for the classification task, implemented by means of an Artificial Neural Network (ANN). Obtained results with ICA as a pre-processing tool are satisfactory, and compared with some references our system improves the recognition success up to 80.8% depending on the number of considered independent components.
Resumo:
In this work we explore the multivariate empirical mode decomposition combined with a Neural Network classifier as technique for face recognition tasks. Images are simultaneously decomposed by means of EMD and then the distance between the modes of the image and the modes of the representative image of each class is calculated using three different distance measures. Then, a neural network is trained using 10- fold cross validation in order to derive a classifier. Preliminary results (over 98 % of classification rate) are satisfactory and will justify a deep investigation on how to apply mEMD for face recognition.
Resumo:
Alzheimer’s disease (AD) is the most prevalent form of progressive degenerative dementia and it has a high socio-economic impact in Western countries, therefore is one of the most active research areas today. Its diagnosis is sometimes made by excluding other dementias, and definitive confirmation must be done trough a post-mortem study of the brain tissue of the patient. The purpose of this paper is to contribute to im-provement of early diagnosis of AD and its degree of severity, from an automatic analysis performed by non-invasive intelligent methods. The methods selected in this case are Automatic Spontaneous Speech Analysis (ASSA) and Emotional Temperature (ET), that have the great advantage of being non invasive, low cost and without any side effects.
Resumo:
Evidence from neuropsychological and activation studies (Clarke et al., 2oo0, Maeder et al., 2000) suggests that sound recognitionand localisation are processed by two anatomically and functionally distinct cortical networks. We report here on a case of a patientthat had an interruption of auditory information and we show: i) the effects of this interruption on cortical auditory processing; ii)the effect of the workload on activation pattern.A 36 year old man suffered from a small left mesencephalic haemotrhage, due to cavernous angioma; the let% inferior colliculuswas resected in the surgical approach of the vascular malformation. In the acute stage, the patient complained of auditoryhallucinations and of auditory loss in right ear, while tonal audiometry was normal. At 12 months, auditory recognition, auditorylocalisation (assessed by lTD and IID cues) and auditory motion perception were normal (Clarke et al., 2000), while verbal dichoticlistening was deficient on the right side.Sound recognition and sound localisation activation patterns were investigated with fMRI, using a passive and an activeparadigm. In normal subjects, distinct cortical networks were involved in sound recognition and localisation, both in passive andactive paradigm (Maeder et al., 2OOOa, 2000b).Passive listening of environmental and spatial stimuli as compared to rest strongly activated right auditory cortex, but failed toactivate left primary auditory cortex. The specialised networks for sound recognition and localisation could not be visual&d onthe right and only minimally on the left convexity. A very different activation pattern was obtained in the active condition wherea motor response was required. Workload not only increased the activation of the right auditory cortex, but also allowed theactivation of the left primary auditory cortex. The specialised networks for sound recognition and localisation were almostcompletely present in both hemispheres.These results show that increasing the workload can i) help to recruit cortical region in the auditory deafferented hemisphere;and ii) lead to processing auditory information within specific cortical networks.References:Clarke et al. (2000). Neuropsychologia 38: 797-807.Mae.der et al. (2OOOa), Neuroimage 11: S52.Maeder et al. (2OOOb), Neuroimage 11: S33
Resumo:
Estudiar la comunicación publicitaria del siglo XXI nos permite entender la importancia del desarrollo actual de este tipo de comunicación. Sin duda un desarrollo que afecta enormemente en el modo en que las marcas llevan a cabo sus estrategias de publicidad con su público. Pero sin duda el formato audiovisual sigue siendo el formato más utilizado por anunciantes y agencias para contactar con su público. El siglo XX estuvo definido por el desarrollo del medio televisión, el siglo XXI por el desarrollo del medio internet, pero la pregunta que realmente nos plantea este cambio mediático es conocer si estamos hablando de un cambio de medio o de un cambio en las estrategias y formatos publicitarios audiovisuales utilizados para llegar al receptor. Los mensajes publicitarios actuales resaltan por una mayor interactividad con el espectador, un contacto voluntario de éste, en muchos casos, con nuestra campaña e incluso una participación activa de nuestro consumidor en la distribución de la campaña. Los formatos evolucionan, también la relación del consumidor con el mensaje publicitario audiovisual y el producto
Resumo:
La evolución del pensamiento ha sido causada por los diversos medios de comunicación: el diálogo en la Grecia clásica o la imprenta en el Renacimiento. Con la revolución técnica en la que se basan los medios actuales, estamos asistiendo a un nuevo cambio de necesaria inclusión en la enseñanza: ya no basta con la enseñanza de la lengua oral y escrita, ahora se impone la enseñanza del medio audiovisual. Opina la autora de este artículo que la LOGSE no ha respondido a las expectativas que sobre este ámbito se habían creado porque no se dota a los centros del material necesario ni a los profesores de la adecuada preparación
Resumo:
The value of earmarks as an efficient means of personal identification is still subject to debate. It has been argued that the field is lacking a firm systematic and structured data basis to help practitioners to form their conclusions. Typically, there is a paucity of research guiding as to the selectivity of the features used in the comparison process between an earmark and reference earprints taken from an individual. This study proposes a system for the automatic comparison of earprints and earmarks, operating without any manual extraction of key-points or manual annotations. For each donor, a model is created using multiple reference prints, hence capturing the donor within source variability. For each comparison between a mark and a model, images are automatically aligned and a proximity score, based on a normalized 2D correlation coefficient, is calculated. Appropriate use of this score allows deriving a likelihood ratio that can be explored under known state of affairs (both in cases where it is known that the mark has been left by the donor that gave the model and conversely in cases when it is established that the mark originates from a different source). To assess the system performance, a first dataset containing 1229 donors elaborated during the FearID research project was used. Based on these data, for mark-to-print comparisons, the system performed with an equal error rate (EER) of 2.3% and about 88% of marks are found in the first 3 positions of a hitlist. When performing print-to-print transactions, results show an equal error rate of 0.5%. The system was then tested using real-case data obtained from police forces.
Resumo:
In contrast with the low frequency of most single epitope reactive T cells in the preimmune repertoire, up to 1 of 1,000 naive CD8(+) T cells from A2(+) individuals specifically bind fluorescent A2/peptide multimers incorporating the A27L analogue of the immunodominant 26-35 peptide from the melanocyte differentiation and melanoma associated antigen Melan-A. This represents the only naive antigen-specific T cell repertoire accessible to direct analysis in humans up to date. To get insight into the molecular basis for the selection and maintenance of such an abundant repertoire, we analyzed the functional diversity of T cells composing this repertoire ex vivo at the clonal level. Surprisingly, we found a significant proportion of multimer(+) clonotypes that failed to recognize both Melan-A analogue and parental peptides in a functional assay but efficiently recognized peptides from proteins of self- or pathogen origin selected for their potential functional cross-reactivity with Melan-A. Consistent with these data, multimers incorporating some of the most frequently recognized peptides specifically stained a proportion of naive CD8(+) T cells similar to that observed with Melan-A multimers. Altogether these results indicate that the high frequency of Melan-A multimer(+) T cells can be explained by the existence of largely cross-reactive subsets of naive CD8(+) T cells displaying multiple specificities.