935 resultados para audiovisual speech


Relevância:

20.00% 20.00%

Publicador:

Resumo:

We describe a series of experiments in which we start with English to French and English to Japanese versions of an Open Source rule-based speech translation system for a medical domain, and bootstrap correspondign statistical systems. Comparative evaluation reveals that the rule-based systems are still significantly better than the statistical ones, despite the fact that considerable effort has been invested in tuning both the recognition and translation components; also, a hybrid system only marginally improved recall at the cost of a los in precision. The result suggests that rule-based architectures may still be preferable to statistical ones for safety-critical speech translation tasks.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

In a system where tens of thousands of words are made up of a limited number of phonemes, many words are bound to sound alike. This similarity of the words in the lexicon as characterized by phonological neighbourhood density (PhND) has been shown to affect speed and accuracy of word comprehension and production. Whereas there is a consensus about the interfering nature of neighbourhood effects in comprehension, the language production literature offers a more contradictory picture with mainly facilitatory but also interfering effects reported on word production. Here we report both of these two types of effects in the same study. Multiple regression mixed models analyses were conducted on PhND effects on errors produced in a naming task by a group of 21 participants with aphasia. These participants produced more formal errors (interfering effect) for words in dense phonological neighbourhoods, but produced fewer nonwords and semantic errors (a facilitatory effect) with increasing density. In order to investigate the nature of these opposite effects of PhND, we further analysed a subset of formal errors and nonword errors by distinguishing errors differing on a single phoneme from the target (corresponding to the definition of phonological neighbours) from those differing on two or more phonemes. This analysis confirmed that only formal errors that were phonological neighbours of the target increased in dense neighbourhoods, while all other errors decreased. Based on additional observations favouring a lexical origin of these formal errors (they exceeded the probability of producing a real-word error by chance, were of a higher frequency, and preserved the grammatical category of the targets), we suggest that the interfering effect of PhND is due to competition between lexical neighbours and target words in dense neighbourhoods.

Relevância:

20.00% 20.00%

Publicador:

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Typically developing (TD) preschoolers and age-matched preschoolers with specific language impairment (SLI) received event-related potentials (ERPs) to four monosyllabic speech sounds prior to treatment and, in the SLI group, after 6 months of grammatical treatment. Before treatment, the TD group processed speech sounds faster than the SLI group. The SLI group increased the speed of their speech processing after treatment. Posttreatment speed of speech processing predicted later impairment in comprehending phrase elaboration in the SLI group. During the treatment phase, change in speed of speech processing predicted growth rate of grammar in the SLI group.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Subjects with autism often show language difficulties, but it is unclear how they relate to neurophysiological anomalies of cortical speech processing. We used combined EEG and fMRI in 13 subjects with autism and 13 control participants and show that in autism, gamma and theta cortical activity do not engage synergistically in response to speech. Theta activity in left auditory cortex fails to track speech modulations, and to down-regulate gamma oscillations in the group with autism. This deficit predicts the severity of both verbal impairment and autism symptoms in the affected sample. Finally, we found that oscillation-based connectivity between auditory and other language cortices is altered in autism. These results suggest that the verbal disorder in autism could be associated with an altered balance of slow and fast auditory oscillations, and that this anomaly could compromise the mapping between sensory input and higher-level cognitive representations.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The last 2 years have seen exciting advances in the genetics of Landau-Kleffner syndrome and related disorders, encompassed within the epilepsy-aphasia spectrum (EAS). The striking finding of mutations in the N-methyl-D-aspartate (NMDA) receptor subunit gene GRIN2A as the first monogenic cause in up to 20 % of patients with EAS suggests that excitatory glutamate receptors play a key role in these disorders. Patients with GRIN2A mutations have a recognizable speech and language phenotype that may assist with diagnosis. Other molecules involved in RNA binding and cell adhesion have been implicated in EAS; copy number variations are also found. The emerging picture highlights the overlap between the genetic determinants of EAS with speech and language disorders, intellectual disability, autism spectrum disorders and more complex developmental phenotypes.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Alzheimer’s disease (AD) is the most prevalent form of progressive degenerative dementia and it has a high socio-economic impact in Western countries, therefore is one of the most active research areas today. Its diagnosis is sometimes made by excluding other dementias, and definitive confirmation must be done trough a post-mortem study of the brain tissue of the patient. The purpose of this paper is to contribute to improvement of early diagnosis of AD and its degree of severity, from an automatic analysis performed by non-invasive intelligent methods. The methods selected in this case are Automatic Spontaneous Speech Analysis (ASSA) and Emotional Temperature (ET), that have the great advantage of being non invasive, low cost and without any side effects.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This paper analyzes applications of cumulant analysis in speech processing. A special focus is made on different second-order statistics. A dominant role is played by an integral representation for cumulants by means of integrals involving cyclic products of kernels.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The purpose of our project is to contribute to earlier diagnosis of AD and better estimates of its severity by using automatic analysis performed through new biomarkers extracted from non-invasive intelligent methods. The methods selected in this case are speech biomarkers oriented to Sponta-neous Speech and Emotional Response Analysis. Thus the main goal of the present work is feature search in Spontaneous Speech oriented to pre-clinical evaluation for the definition of test for AD diagnosis by One-class classifier. One-class classifi-cation problem differs from multi-class classifier in one essen-tial aspect. In one-class classification it is assumed that only information of one of the classes, the target class, is available. In this work we explore the problem of imbalanced datasets that is particularly crucial in applications where the goal is to maximize recognition of the minority class as in medical diag-nosis. The use of information about outlier and Fractal Dimen-sion features improves the system performance.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

OBJECTIVE: To identify and quantify sources of variability in scores on the speech, spatial, and qualities of hearing scale (SSQ) and its short forms among normal-hearing and hearing-impaired subjects using a French-language version of the SSQ. DESIGN: Multi-regression analyses of SSQ scores were performed using age, gender, years of education, hearing loss, and hearing-loss asymmetry as predictors. Similar analyses were performed for each subscale (Speech, Spatial, and Qualities), for several SSQ short forms, and for differences in subscale scores. STUDY SAMPLE: One hundred normal-hearing subjects (NHS) and 230 hearing-impaired subjects (HIS). RESULTS: Hearing loss in the better ear and hearing-loss asymmetry were the two main predictors of scores on the overall SSQ, the three main subscales, and the SSQ short forms. The greatest difference between the NHS and HIS was observed for the Speech subscale, and the NHS showed scores well below the maximum of 10. An age effect was observed mostly on the Speech subscale items, and the number of years of education had a significant influence on several Spatial and Qualities subscale items. CONCLUSION: Strong similarities between SSQ scores obtained across different populations and languages, and between SSQ and short forms, underline their potential international use.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

El objetivo del presente trabajo es presentar la propuesta de desarrollo de la materia Proyectos del Grado de Comunicación Audiovisual de la Universidad de Barcelona, que incluye las asignaturas Proyectos I y Proyectos II. Ambas asignaturas son especialmente idóneas para trabajar las competencias transversales del grado, dado que el objetivo de la materia a la que pertenecen es integrar las competencias adquiridas en el conjunto de asignaturas cursadas por los alumnos hasta este momento, poniendo en relación los diferente lenguajes (escrito, oral, audiovisual y multimedia). Todo esto permite que el estudiante adquiera una visión integral y transversal. El presente trabajo reflexiona sobre los mecanismos que permitan al profesorado diseñar de forma colaborativa pautas y estrategias de enseñanza-aprendizaje; los modos de evaluación de las competencias de estas asignaturas, y todos aquellos aspectos claves que deben recoger los planes docentes. Materias como la de Proyectos suponen un reto en la actividad docente, al requerir del trabajo interdisciplinar e integrador de las áreas de conocimiento implicadas y de los docentes vinculados, al tiempo que facilitan la generación de puentes entre el ámbito académico y profesional.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This paper gives a full description of the phonetics and phonology of Traditional Cockney and Popular London speech, treating these varieties as constituting a continuum rather than two separate dialects. Exemplification of the vowels, diphthongs and consonants is provided, both in isolate words and in connected speech, along with their range of variation. The frequencies of the vowels have been charted on the basis of the pronunciation of three elderly male speakers. Regarding the consonants, there are detailed observations on the features typically associated with the linguistic varieties examined: strong aspiration of unvoiced plosives, glottalization, H-dropping, L-vocalization and TH-fronting. A section on prosody provides coverage of lexical stress, rhythm and intonation. The paper takes into account up-to-date research on these phenomena, but does not deal with the most recent vowel shifts, some of which form part of Multi-cultural London English.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

With the aim of preserving artistic heritage, museums have typically removed paintings and furniture from the places they were created for. Over the decades, the curators of these places have begun to request that these artistic works be returned, conscious of the significance that many of these works now have. Some institutions and museums have responded to these requests by providing copies of the original works. Although traditionally these copies were handmade, digital resources, such as audiovisual technology, are now being used. The Taüll 1123 project (Lleida, Spain) is an example of the use of these new tools for the benefit of artistic heritage and of modern visitors.