Annotating an Oral Corpus using the Text Encoding Initiative. Methodology, Problems, Solutions


Autoria(s): Carruthers, Janice
Data(s)

01/03/2008

Resumo

The objective of this paper is to describe and evaluate the application of the Text Encoding Initiative (TEI) Guidelines to a corpus of oral French, this being the first corpus of oral French where the TEI has been used. The paper explains the purpose of the corpus, both in creating a specialist corpus of néo-contage that will broaden the range of oral corpora available, and, more importantly, in creating a dataset to explore a variety of oral French that has a particularly interesting status in terms of factors such as conception orale/écrite, réalisation médiale and comportement communicatif (Koch and Oesterreicher 2001). The linguistic phenomena to be encoded are both stylistic (speech and thought presentation) and syntactic (negation, detachment, inversion), and all represent areas where previous research has highlighted the significance of factors such as medium, register and discourse type, as well as a host of linguistic factors (syntactic, phonetic, lexical). After a discussion of how a tagset can be designed and applied within the TEI to encode speech and thought presentation, negation, detachment and inversion, the final section of the paper evaluates the benefits and possible drawbacks of the methodology offered by the TEI when applied to a syntactic and stylistic markup of an oral corpus.

Identificador

http://pure.qub.ac.uk/portal/en/publications/annotating-an-oral-corpus-using-the-text-encoding-initiative-methodology-problems-solutions(d07cdf1f-b118-41c8-9a59-f5b6bb041314).html

http://dx.doi.org/10.1017/S0959269507003183

Idioma(s)

eng

Direitos

info:eu-repo/semantics/restrictedAccess

Fonte

Carruthers , J 2008 , ' Annotating an Oral Corpus using the Text Encoding Initiative. Methodology, Problems, Solutions ' Journal of French Language Studies , vol 18 , no. 1 , pp. 103-119 . DOI: 10.1017/S0959269507003183

Tipo

article