9 resultados para Audio-Visual Automatic Speech Recognition

em Archivo Digital para la Docencia y la Investigación - Repositorio Institucional de la Universidad del País Vasco


Relevância:

100.00% 100.00%

Publicador:

Resumo:

Query-by-Example Spoken Term Detection (QbE STD) aims at retrieving data from a speech data repository given an acoustic query containing the term of interest as input. Nowadays, it has been receiving much interest due to the high volume of information stored in audio or audiovisual format. QbE STD differs from automatic speech recognition (ASR) and keyword spotting (KWS)/spoken term detection (STD) since ASR is interested in all the terms/words that appear in the speech signal and KWS/STD relies on a textual transcription of the search term to retrieve the speech data. This paper presents the systems submitted to the ALBAYZIN 2012 QbE STD evaluation held as a part of ALBAYZIN 2012 evaluation campaign within the context of the IberSPEECH 2012 Conference(a). The evaluation consists of retrieving the speech files that contain the input queries, indicating their start and end timestamps within the appropriate speech file. Evaluation is conducted on a Spanish spontaneous speech database containing a set of talks from MAVIR workshops(b), which amount at about 7 h of speech in total. We present the database metric systems submitted along with all results and some discussion. Four different research groups took part in the evaluation. Evaluation results show the difficulty of this task and the limited performance indicates there is still a lot of room for improvement. The best result is achieved by a dynamic time warping-based search over Gaussian posteriorgrams/posterior phoneme probabilities. This paper also compares the systems aiming at establishing the best technique dealing with that difficult task and looking for defining promising directions for this relatively novel task.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Study of emotions in human-computer interaction is a growing research area. This paper shows an attempt to select the most significant features for emotion recognition in spoken Basque and Spanish Languages using different methods for feature selection. RekEmozio database was used as the experimental data set. Several Machine Learning paradigms were used for the emotion classification task. Experiments were executed in three phases, using different sets of features as classification variables in each phase. Moreover, feature subset selection was applied at each phase in order to seek for the most relevant feature subset. The three phases approach was selected to check the validity of the proposed approach. Achieved results show that an instance-based learning algorithm using feature subset selection techniques based on evolutionary algorithms is the best Machine Learning paradigm in automatic emotion recognition, with all different feature sets, obtaining a mean of 80,05% emotion recognition rate in Basque and a 74,82% in Spanish. In order to check the goodness of the proposed process, a greedy searching approach (FSS-Forward) has been applied and a comparison between them is provided. Based on achieved results, a set of most relevant non-speaker dependent features is proposed for both languages and new perspectives are suggested.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Deep neural networks have recently gained popularity for improv- ing state-of-the-art machine learning algorithms in diverse areas such as speech recognition, computer vision and bioinformatics. Convolutional networks especially have shown prowess in visual recognition tasks such as object recognition and detection in which this work is focused on. Mod- ern award-winning architectures have systematically surpassed previous attempts at tackling computer vision problems and keep winning most current competitions. After a brief study of deep learning architectures and readily available frameworks and libraries, the LeNet handwriting digit recognition network study case is developed, and lastly a deep learn- ing network for playing simple videogames is reviewed.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Eguíluz, Federico; Merino, Raquel; Olsen, Vickie; Pajares, Eterio; Santamaría, José Miguel (eds.)

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Tesis con Mención de Doctor Internacional

Relevância:

100.00% 100.00%

Publicador:

Resumo:

[ES] El concepto de historia del patrimonio histórico-educativo ha ido cambiando acorde con la aparición de nuevas formas de comunicarnos. Hemos pasado de estar en archivos nacionales o autonómicos para conseguir documentos, a poder hacerlo desde la comodidad de nuestro hogar a través de unos simples clics. A pesar de ello, esta suerte de aventura sólo puede llevarse a buen término cuando aprendamos a ser investigadores-cernedores. Cuando aprendamos a separar con nuestro cedazo la información veraz, presumible de poder erigirse en conocimiento científico, de aquella infoxicada que distorsiona y llena de ruidos nuestras pesquisas como personas historiadoras.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

[SPA] El objetivo de la investigación es conocer cual es la aportación cuantitativa y cualitativa de la documentación audiovisual en la información que ofrece diariamente la televisión. El marco temporal de la investigación de campo se sitúa en los años 1993 y 1994, en un marco geográfico constituido por los canales que emiten en el estado español. El estudio parte de una aproximación teórica a la documentación periodística, a la documentación audiovisual y a los estudios sobre la comunicación de masas, y lleva a cabo una investigación de campo en tres áreas: 1) Análisis de programas informativos diarios de seis cadenas de televisión (ETB, TVE, Canal Sur, TV3, Antena 3 y Canal+), a través de tres muestras independientes. 2) Análisis de las peticiones de documentación audiovisual realizadas desde las redacciones de programas informativos a los servicios de documentación. 3) Estudio de las funciones, tareas, estructura y organización de los servicios de documentación de televisión, basado en encuestas, visitas y entrevistas. En anexo se ofrece el análisis detallado de 620 noticias, así como la información de los centros de documentación. La investigación concluye afirmando que la documentación audiovisual es uno de los elementos constitutivos de la información de actualidad, tanto por su presencia cuantitativa (más de un 40% de las noticias emitidas la emplean), como por su aportación cualitativa y su utilización generalizada en todas las secciones informativas. Las conclusiones señalan que la importancia de las noticias incide positivamente en el empleo de documentación audiovisual, sintetizan las funciones de esta documentación y las características específicas de su uso. Confirman el carácter de retroalimentación de la documentación informativa en televisión. Señalan un empleo de esta documentación como documentación puramente visual. Y afirman que la documentación audiovisual, además de contribuir en la producción, coadyuva a la calidad de los programas informativos, en la medida en que facilita la tarea de ofrecer una información más completa y contextualizada.

Relevância:

50.00% 50.00%

Publicador:

Resumo:

The work presented here is part of a larger study to identify novel technologies and biomarkers for early Alzheimer disease (AD) detection and it focuses on evaluating the suitability of a new approach for early AD diagnosis by non-invasive methods. The purpose is to examine in a pilot study the potential of applying intelligent algorithms to speech features obtained from suspected patients in order to contribute to the improvement of diagnosis of AD and its degree of severity. In this sense, Artificial Neural Networks (ANN) have been used for the automatic classification of the two classes (AD and control subjects). Two human issues have been analyzed for feature selection: Spontaneous Speech and Emotional Response. Not only linear features but also non-linear ones, such as Fractal Dimension, have been explored. The approach is non invasive, low cost and without any side effects. Obtained experimental results were very satisfactory and promising for early diagnosis and classification of AD patients.