94 resultados para word boundaries

em CentAUR: Central Archive University of Reading - UK


Relevância:

20.00% 20.00%

Publicador:

Resumo:

Word sense disambiguation is the task of determining which sense of a word is intended from its context. Previous methods have found the lack of training data and the restrictiveness of dictionaries' choices of senses to be major stumbling blocks. A robust novel algorithm is presented that uses multiple dictionaries, the Internet, clustering and triangulation to attempt to discern the most useful senses of a given word and learn how they can be disambiguated. The algorithm is explained, and some promising sample results are given.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Infants (12 to 17 months) were taught 2 novel words for 2 images of novel objects, by pairing isolated auditory labels with to-be-associated images. Comprehension was tested using a preferential looking task in which the infant was presented with both images together with an isolated auditory label. The auditory label usually, but not always, matched one of the images. Infants looked preferentially at images that matched the auditory stimulus. The experiment controlled within-subjects for both side bias and preference for previously named items. Infants showed learning after 12 presentations of the new words. Evidence is presented that, in certain circumstances, the duration of longest look at a target may be a more robust measure of target preference than overall looking time. The experiment provides support for previous demonstrations of rapid word learning by pre-vocabulary spurt children, and offers some methodological improvements to the preferential looking task.

Relevância:

20.00% 20.00%

Publicador:

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Greek speakers say "ovpa", Germans "schwanz'' and the French "queue'' to describe what English speakers call a 'tail', but all of these languages use a related form of 'two' to describe the number after one. Among more than 100 Indo-European languages and dialects, the words for some meanings (such as 'tail') evolve rapidly, being expressed across languages by dozens of unrelated words, while others evolve much more slowly-such as the number 'two', for which all Indo-European language speakers use the same related word-form(1). No general linguistic mechanism has been advanced to explain this striking variation in rates of lexical replacement among meanings. Here we use four large and divergent language corpora (English(2), Spanish(3), Russian(4) and Greek(5)) and a comparative database of 200 fundamental vocabulary meanings in 87 Indo-European languages(6) to show that the frequency with which these words are used in modern language predicts their rate of replacement over thousands of years of Indo-European language evolution. Across all 200 meanings, frequently used words evolve at slower rates and infrequently used words evolve more rapidly. This relationship holds separately and identically across parts of speech for each of the four language corpora, and accounts for approximately 50% of the variation in historical rates of lexical replacement. We propose that the frequency with which specific words are used in everyday language exerts a general and law-like influence on their rates of evolution. Our findings are consistent with social models of word change that emphasize the role of selection, and suggest that owing to the ways that humans use language, some words will evolve slowly and others rapidly across all languages.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This study investigated the relationships between phonological awareness and reading in Oriya and English. Oriya is the official language of Orissa, an eastern state of India. The writing system is an alphasyllabary. Ninety-nine fifth grade children (mean age 9 years 7 months) were assessed on measures of phonological awareness, word reading and pseudo-word reading in both languages. Forty-eight of the children attended Oriya-medium schools where they received literacy instruction in Oriya from grade 1 and learned English from grade 2. Fifty-one children attended English-medium schools where they received literacy instruction in English from grade 1 and in Oriya from grade 2. The results showed that phonological awareness in Oriya contributed significantly to reading Oriya and English words and pseudo-words for the children in the Oriya-medium schools. However, it only contributed to Oriya pseudo-word reading and English word reading for children in the English-medium schools. Phonological awareness in English contributed to English word and pseudo-word reading for both groups. Further analyses investigated the contribution of awareness of large phonological units (syllable, onsets and rimes) and small phonological units (phonemes) to reading in each language. The data suggest that cross-language transfer and facilitation of phonological awareness to word reading is not symmetrical across languages and may depend both on the characteristics of the different orthographies of the languages being learned and whether the first literacy language is also the first spoken language.