25 resultados para Linguística de corpus

em QUB Research Portal - Research Directory and Institutional Repository for Queen's University Belfast


Relevância:

20.00% 20.00%

Publicador:

Resumo:

The objective of this paper is to describe and evaluate the application of the Text Encoding Initiative (TEI) Guidelines to a corpus of oral French, this being the first corpus of oral French where the TEI has been used. The paper explains the purpose of the corpus, both in creating a specialist corpus of néo-contage that will broaden the range of oral corpora available, and, more importantly, in creating a dataset to explore a variety of oral French that has a particularly interesting status in terms of factors such as conception orale/écrite, réalisation médiale and comportement communicatif (Koch and Oesterreicher 2001). The linguistic phenomena to be encoded are both stylistic (speech and thought presentation) and syntactic (negation, detachment, inversion), and all represent areas where previous research has highlighted the significance of factors such as medium, register and discourse type, as well as a host of linguistic factors (syntactic, phonetic, lexical). After a discussion of how a tagset can be designed and applied within the TEI to encode speech and thought presentation, negation, detachment and inversion, the final section of the paper evaluates the benefits and possible drawbacks of the methodology offered by the TEI when applied to a syntactic and stylistic markup of an oral corpus.

Relevância:

20.00% 20.00%

Publicador:

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Speech recognition and language analysis of spontaneous speech arising in naturally spoken conversations are becoming the subject of much research. However, there is a shortage of spontaneous speech corpora that are freely available for academics. We therefore undertook the building of a natural conversation speech database, recording over 200 hours of conversations in English by over 600 local university students. With few exceptions, the students used their own cell phones from their own rooms or homes to speak to one another, and they were permitted to speak on any topic they chose. Although they knew that they were being recorded and that they would receive a small payment, their conversations in the corpus are probably very close to being natural and spontaneous. This paper describes a detailed case study of the problems we faced and the methods we used to make the recordings and control the collection of these social science data on a limited budget.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Atrophic gastritis can develop in patients with Helicobacter pylori infection leading to a reduction in basal acid output. Whether the atrophy that develops is reversible is controversial.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

We studied the relationship between corpus callosum area and both inter-hemispheric facilitation and interference in schizophrenics and controls. Mid-sagittal sections through the corpus callosum were measured using structural magnetic resonance imaging on 42 patients and 43 normal controls, along with symptom profiles. In a sub-sample, a modified version of the Stroop Test was also performed (27 patients and 29 controls) to assess inter-hemispheric facilitation and interference of colour naming. In the larger sample (total subjects, n=85), there were no significant differences between patients and controls in CC area but a trend towards smaller values in patients in all but the posterior segment. In the sub-sample, bilateral facilitation was greater, and interference, less in schizophrenics compared with controls. There was a positive correlation between facilitation and posterior CC area, parallelled by a negative correlation between interference and posterior CC area, in both patients and controls, which only reached statistical significance when both groups were combined. These findings suggest that the link, between CC size and neuropsychological processes involving inter-hemispheric transfer of information, is common to both schizophrenics and normal controls. There were significant negative correlations between anterior CC area and psychomotor poverty (avolition, anhedonia and affective flattening), and a suggestion that the negative correlation between age and CC size in controls was not present in patients.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The present study aimed to investigate the presence of corpus callosum (CC) volume deficits in a population-based recent-onset psychosis (ROP) sample, and whether CC volume relates to interhemispheric communication deficits. For this purpose, we used voxel-based morphometry comparisons of magnetic resonance imaging data between ROP (n = 122) and healthy control (n = 94) subjects. Subgroups (38 ROP and 39 controls) were investigated for correlations between CC volumes and performance on the Crossed Finger Localization Test (CFLT). Significant CC volume reductions in ROP subjects versus controls emerged after excluding substance misuse and non-right-handedness. CC reductions retained significance in the schizophrenia subgroup but not in affective psychoses subjects. There were significant positive correlations between CC volumes and CFLT scores in ROP subjects, specifically in subtasks involving interhemispheric communication. From these results, we can conclude that CC volume reductions are present in association with ROP. The relationship between such deficits and CFLT performance suggests that interhemispheric communication impairments are directly linked to CC abnormalities in ROP. (C) 2010 Elsevier Ireland Ltd. All rights reserved.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Thinning of the corpus callosum (CC) is often observed in individuals who were born very preterm. Damage to the CC during neurodevelopment may be associated with poor neuropsychological performance. This study aimed to explore any evidence of CC pathology in adolescents aged 14-15 years who were born very preterm, and to investigate the relationship between CC areas and verbal skills. Seventy-two individuals born before 33 weeks of gestation and 51 age- and sex-matched full-term controls received structural MRI and neuropsychological assessment. Total CC area in very preterm adolescents was 7.5% smaller than in controls, after adjusting for total white matter volume (P=0.015). The absolute size of callosal subregions differed between preterm and fullterm adolescents: preterm individuals had a 14.7% decrease in posterior (P<0.0001) and an 11.6% decrease in mid-posterior CC quarters (P=0.029). Preterm individuals who had experienced periventricular haemorrhage and ventricular dilatation in the neonatal period showed the greatest decrease in CC area. In very preterm boys only, verbal IQ and verbal fluency scores were positively associated with total mid-sagittal CC size and midposterior surface area. These results suggest that very preterm birth adversely affects the development of the CC, particularly its posterior quarter, and this impairs verbal skills in boys.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Temporal dynamics and speaker characteristics are two important features of speech that distinguish speech from noise. In this paper, we propose a method to maximally extract these two features of speech for speech enhancement. We demonstrate that this can reduce the requirement for prior information about the noise, which can be difficult to estimate for fast-varying noise. Given noisy speech, the new approach estimates clean speech by recognizing long segments of the clean speech as whole units. In the recognition, clean speech sentences, taken from a speech corpus, are used as examples. Matching segments are identified between the noisy sentence and the corpus sentences. The estimate is formed by using the longest matching segments found in the corpus sentences. Longer speech segments as whole units contain more distinct dynamics and richer speaker characteristics, and can be identified more accurately from noise than shorter speech segments. Therefore, estimation based on the longest recognized segments increases the noise immunity and hence the estimation accuracy. The new approach consists of a statistical model to represent up to sentence-long temporal dynamics in the corpus speech, and an algorithm to identify the longest matching segments between the noisy sentence and the corpus sentences. The algorithm is made more robust to noise uncertainty by introducing missing-feature based noise compensation into the corpus sentences. Experiments have been conducted on the TIMIT database for speech enhancement from various types of nonstationary noise including song, music, and crosstalk speech. The new approach has shown improved performance over conventional enhancement algorithms in both objective and subjective evaluations.

Relevância:

20.00% 20.00%

Publicador: