80 resultados para Text Linguistics


Relevância:

30.00% 30.00%

Publicador:

Resumo:

The analytic advantages of central concepts from linguistics and information theory, and the analogies demonstrated between them, for understanding patterns of retrieval from full-text indexes to documents are developed. The interaction between the syntagm and the paradigm in computational operations on written language in indexing, searching, and retrieval is used to account for transformations of the signified or meaning between documents and their representation and between queries and documents retrieved. Characteristics of the message, and messages for selection for written language, are brought to explain the relative frequency of occurrence of words and multiple word sequences in documents. The examples given in the companion article are revisited and a fuller example introduced. The signified of the sequence stood for, the term classically used in the definitions of the sign, as something standing for something else, can itself change rapidly according to its syntagm. A greater than ordinary discourse understanding of patterns in retrieval is obtained.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

An analogy is established between the syntagm and paradigm from Saussurean linguistics and the message and messages for selection from the information theory initiated by Claude Shannon. The analogy is pursued both as an end itself and for its analytic value in understanding patterns of retrieval from full text systems. The multivalency of individual words when isolated from their syntagm is contrasted with the relative stability of meaning of multi-word sequences, when searching ordinary written discourse. The syntagm is understood as the linear sequence of oral and written language. Saussureâ??s understanding of the word, as a unit which compels recognition by the mind, is endorsed, although not regarded as final. The lesser multivalency of multi-word sequences is understood as the greater determination of signification by the extended syntagm. The paradigm is primarily understood as the network of associations a word acquires when considered apart from the syntagm. The restriction of information theory to expression or signals, and its focus on the combinatorial aspects of the message, is sustained. The message in the model of communication in information theory can include sequences of written language. Shannonâ??s understanding of the written word, as a cohesive group of letters, with strong internal statistical influences, is added to the Saussurean conception. Sequences of more than one word are regarded as weakly correlated concatenations of cohesive units.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

In this article we review recent work on the history of French negation in relation to three key issues in socio-historical linguistics: identifying appropriate sources, interpreting scant or anomalous data, and interpreting generational differences in historical data. We then turn to a new case study, that of verbal agreement with la plupart, to see whether this can shed fresh light on these issues. We argue that organising data according to the author’s date of birth is methodologically sounder than according to date of publication. We explore the extent to which different genres and text types reflect changing patterns of usage and suggest that additional, different case-studies are required in order to make more secure generalisations about the reliability of different sources.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Vector space models (VSMs) represent word meanings as points in a high dimensional space. VSMs are typically created using a large text corpora, and so represent word semantics as observed in text. We present a new algorithm (JNNSE) that can incorporate a measure of semantics not previously used to create VSMs: brain activation data recorded while people read words. The resulting model takes advantage of the complementary strengths and weaknesses of corpus and brain activation data to give a more complete representation of semantics. Evaluations show that the model 1) matches a behavioral measure of semantics more closely, 2) can be used to predict corpus data for unseen words and 3) has predictive power that generalizes across brain imaging technologies and across subjects. We believe that the model is thus a more faithful representation of mental vocabularies.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

David Norbrook, Review of English Studies 56 (Sept. 2005), 675-6.
‘We have waited a long time for a study of Marvell’s Latin poetry; fortunately, Estelle Haan’s monograph generously makes good the loss ... One of her most intriguing suggestions … is that Marvell may have presented paired poems like ‘Ros’ and ‘On a Drop of Dew’, and the poems to the obligingly named Dr Witty, to his student Maria Fairfax as his own patterns for the pedagogical practice of double translation. Perhaps the most original parts of the book, however, move beyond the familiar canon to cover the generic range of the Latin verse. Haan offers a very full contextualization of the early Horatian Ode to Charles I in seventeenth-century exercises in parodia. In a rewarding reading of the poem to Dr Ingelo she shows how Marvell deploys the language of Ovid’s Tristia to present Sweden as a place of shivering exile, only to subvert this model with a neo-Virgilian celebration of Christina as a virtuous, city-building Dido. She draws extensively on historical as well as literary sources to offer very detailed contextualizations of the poem to Maniban and ‘Scaevola Scotto-Britannus’... This monograph opens up many new ways into the Latin verse, not least because it is rounded off with new texts and prose translations of the Latin poems. These make a substantial contribution in their own right. They are the best and most accurate translations to date (those in Smith’s edition having some lapses); they avoid poeticisms but bring out the structure of the poems' wordplay very clearly. This book brings us a lot closer to seeing Marvell whole.'

Relevância:

20.00% 20.00%

Publicador: