982 resultados para CONTENT WORDS
Resumo:
The purpose of this study was to investigate the exchange of disfluencies from function words to content words with age in Brazilian Portuguese speakers who do and do not stutter. Ninety stuttering individuals and 90 controls, native speakers of Brazilian Portuguese, were divided into three age groups (children, adolescents and adults). The study method involved analyzing the occurrence of stuttering on content and function words based on spontaneous speech samples. Results indicated that children tend to be more disfluent on function words. With the increase in age, teenagers and adults who stutter presented a higher number of disfluencies on content words. These findings support the current literature, indicating that with the aging process, there is an exchange of disfluencies from function to content words. This shift in the disfluency pattern may account for a more advanced type of stuttering. The study also demonstrated that disfluencies in Portuguese speakers follow the same pattern of shifting from function to content words with age as for English speakers.
Resumo:
In order to explore the impact of a degraded semantic system on the structure of language production, we analysed transcripts from autobiographical memory interviews to identify naturally-occurring speech errors by eight patients with semantic dementia (SD) and eight age-matched normal speakers. Relative to controls, patients were significantly more likely to (a) substitute and omit open class words, (b) substitute (but not omit) closed class words, (c) substitute incorrect complex morphological forms and (d) produce semantically and/or syntactically anomalous sentences. Phonological errors were scarce in both groups. The study confirms previous evidence of SD patients’ problems with open class content words which are replaced by higher frequency, less specific terms. It presents the first evidence that SD patients have problems with closed class items and make syntactic as well as semantic speech errors, although these grammatical abnormalities are mostly subtle rather than gross. The results can be explained by the semantic deficit which disrupts the representation of a pre-verbal message, lexical retrieval and the early stages of grammatical encoding.
Resumo:
[EN] One universal feature of human languages is the division between grammatical functors and content words. From a learnability point of view, functors might provide entry points or anchors into the syntactic structure of utterances due to their high frequency. Despite its potentially universal scope, this hypothesis has not yet been tested on typologically different languages and on populations of different ages. Here we report a corpus study and an artificial grammar learning experiment testing the anchoring hypothesis in Basque, Japanese, French, and Italian adults. We show that adults are sensitive to the distribution of functors in their native language and use them when learning new linguistic material. However, compared to infants’ performance on a similar task, adults exhibit a slightly different behavior, matching the frequency distributions of their native language more closely than infants do. This finding bears on the issue of the continuity of language learning mechanism.
Resumo:
As Levelt and Meyer (2000) noted, because studies of lexical access during multiword utterances production such as phrases and sentences, they raise two novel questions which studies of single word production do not. Firstly, does the access of different words in a sentence occur in a parallel or a serial fashion? Secondly, does the access of the different words in a sentence occur in an interactive or a discrete fashion? The latter question concerns the horizontal information flow (Smith & Wheeldon, 2004), which is a very important aspect of continuous speech production. A variant of the picture–word interference paradigm combining with eye-tracking technique and a dual task paradigm was used in 7 experiments to investigate the horizontal information flow of semantic and phonological information between nouns in spoken Mandarin Chinese sentences. The results suggested that: 1. Before speech onset, semantic information of different words accross the whole sentence has been activated, while phonological activation has been limited within the first phrase of the sentence. 2. Before speech onset, speaker will look ahead and check the semantic information of latter words as the first noun is beening processed, such looking ahead for phonological information can just occur within the first phrase of the sentence. 3. After speech onset, speaker will concentrate on the content words beyond the first one and will check the semantic information of other words with the same sentence. 4. The result suggested that the lexical accesses of multiple words during spoken sentence production are processed in a partly serial and partly parallel manner and stands for the Unit-by-Unit and Incremental view proposed by Levelt (2000). 5. The horizontal information flow during spoken sentence production is not an automatic process and is constrained by cognitive resource.
Resumo:
We present three natural language marking strategies based on fast and reliable shallow parsing techniques, and on widely available lexical resources: lexical substitution, adjective conjunction swaps, and relativiser switching. We test these techniques on a random sample of the British National Corpus. Individual candidate marks are checked for goodness of structural and semantic fit, using both lexical resources, and the web as a corpus. A representative sample of marks is given to 25 human judges to evaluate for acceptability and preservation of meaning. This establishes a correlation between corpus based felicity measures and perceived quality, and makes qualified predictions. Grammatical acceptability correlates with our automatic measure strongly (Pearson's r = 0.795, p = 0.001), allowing us to account for about two thirds of variability in human judgements. A moderate but statistically insignificant (Pearson's r = 0.422, p = 0.356) correlation is found with judgements of meaning preservation, indicating that the contextual window of five content words used for our automatic measure may need to be extended. © 2007 SPIE-IS&T.
Resumo:
Les documents publiés par des entreprises, tels les communiqués de presse, contiennent une foule d’informations sur diverses activités des entreprises. C’est une source précieuse pour des analyses en intelligence d’affaire. Cependant, il est nécessaire de développer des outils pour permettre d’exploiter cette source automatiquement, étant donné son grand volume. Ce mémoire décrit un travail qui s’inscrit dans un volet d’intelligence d’affaire, à savoir la détection de relations d’affaire entre les entreprises décrites dans des communiqués de presse. Dans ce mémoire, nous proposons une approche basée sur la classification. Les méthodes de classifications existantes ne nous permettent pas d’obtenir une performance satisfaisante. Ceci est notamment dû à deux problèmes : la représentation du texte par tous les mots, qui n’aide pas nécessairement à spécifier une relation d’affaire, et le déséquilibre entre les classes. Pour traiter le premier problème, nous proposons une approche de représentation basée sur des mots pivots c’est-à-dire les noms d’entreprises concernées, afin de mieux cerner des mots susceptibles de les décrire. Pour le deuxième problème, nous proposons une classification à deux étapes. Cette méthode s’avère plus appropriée que les méthodes traditionnelles de ré-échantillonnage. Nous avons testé nos approches sur une collection de communiqués de presse dans le domaine automobile. Nos expérimentations montrent que les approches proposées peuvent améliorer la performance de classification. Notamment, la représentation du document basée sur les mots pivots nous permet de mieux centrer sur les mots utiles pour la détection de relations d’affaire. La classification en deux étapes apporte une solution efficace au problème de déséquilibre entre les classes. Ce travail montre que la détection automatique des relations d’affaire est une tâche faisable. Le résultat de cette détection pourrait être utilisé dans une analyse d’intelligence d’affaire.
Resumo:
The occurrence of pauses and hesitations in spontaneous speech has been shown to occur systematically, for example, "between sentences, after discourse markers and conjunctions and before accented content words." (Hansson [15]) This is certainly plausible in English, where pauses and hesitations can and often do occur before content words such as nominals, for example, "uh, there's a … man." (Chafe [8]) However, if hesitations are, in fact, evidence of "deciding what to talk about next," (Chafe [8]) then the complex grammatical system of German should render this pausing position precarious, since pre-modifiers must account for the gender of the nominals they modify.In this paper, I present data to test the hypothesis that pre-nominal hesitation patterns in German are dissimilar to those in English. Hesitations in German will be shown, in fact, to occur within noun phrase units. Nevertheless, native speakers most often succeed in supplying a nominal which conforms to the gender indicated by the determiner or pre-modifier. Corrections, or repairs, of infelicitous pre-modifiers indicate that the speaker was unable to supply a nominal of the same gender which the choice of pre-modifier had committed him/her to. The frequency of such repairs is shown to vary according to task, with fewest repairs occurring in elicited speech which allows for linguistic freedom and therefore is most like spontaneous speech. The data sets indicate that among German native speakers, hesitations occurring before noun phrase units (pre-NPU hesitations) indicate deliberation of what to say, while hesitations within or before the head of the noun phrase (pre-NPH hesitations) indicate deliberation of how to say what has already been decided (cf. Chafe [8]).
Resumo:
This article explores one aspect of the processing perspective in L2 learning in an EST context: the processing of new content words, in English, of the type ‘cognates’ and ‘false friends’, by Spanish speaking engineering students. The paper does not try to offer a comprehensive overview of language acquisition mechanisms, but rather it is intended to review more narrowly how our conceptual systems, governed by intricately linked networks of neural connections in the brain, make language development possible, creating, at the same time, some L2 processing problems. The case of ‘cognates and false friends’ in specialised contexts is brought here to illustrate some of the processing problems that the L2 learner has to confront, and how mappings in the visual, phonological and semantic (conceptual) brain structures function in second language processing of new vocabulary. Resumen Este artículo pretende reflexionar sobre un aspecto de la perspectiva del procesamiento de segundas lenguas (L2) en el contexto del ICT: el procesamiento de palabras nuevas, en inglés, conocidas como “cognados” y “falsos amigos”, por parte de estudiantes de ingeniería españoles. No se pretende ofrecer una visión completa de los mecanismos de adquisición del lenguaje, más bien se intenta mostrar cómo nuestro sistema conceptual, gobernado por una complicada red de conexiones neuronales en el cerebro, hace posible el desarrollo del lenguaje, aunque ello conlleve ciertas dificultades en el procesamiento de segundas lenguas. El caso de los “cognados” y los “falsos amigos”, en los lenguajes de especialidad, se trae para ilustrar algunos de los problemas de procesamiento que el estudiante de una lengua extranjera tiene que afrontar y el funcionamiento de las correspondencias entre las estructuras visuales, fonológicas y semánticas (conceptuales) del cerebro en el procesamiento de nuevo vocabulario.
Resumo:
The purpose of the current thesis is to develop a better understanding of the interaction between Spanish and Quichua in the Salcedo region and provide more information for the processes that might have given rise to Media Lengua, a ‘mixed’ language comprised of a Quichua grammar and Spanish lexicon. Muysken attributes the formation of Media Lengua to relexification, ruling out any influence from other bilingual phenomena. I argue that the only characteristic that distinguishes Media Lengua from other language contact varieties in central Ecuador is the quantity of the overall Spanish borrowings and not the type of processes that might have been employed by Quichua speakers during the genesis of Media Lengua. The results from the Salcedo data that I have collected show how processes such as adlexification, code-mixing, and structural convergence produce Media Lengua-type sentences, evidence that supports an alternative analysis to Muysken’s relexification hypothesis. Overall, this dissertation is developed around four main objectives: (1) to describe the variation of Spanish loanwords within a bilingual community in Salcedo; (2) to analyze some of the prominent and recent structural changes in Quichua and Spanish; (3) to determine whether Spanish loanword use can be explained by the relationship consultants have with particular social categories; and (4) to analyze the consultants’ language ideologies toward syncretic uses of Spanish and Quichua. Overall, 58% of the content words, 39% of the basic vocabulary, and 50% of the subject pronouns in the Salcedo corpus were derived from Spanish. When compared to Muysken’s description of highlander Quichua in the 1970’s, Spanish loanwords have more than doubled in each category. The overall level of Spanish loanwords in Salcedo Quichua has grown to a level between highlander Quichua in the 1970’s and Media Lengua. Similar to Spanish’s lexical influence in Media Lengua, the increase of Spanish borrowings in today’s rural Quichua can be seen in non-basic and basic vocabularies as well as the subject pronoun system. Significantly, most of the growth has occurred through forms of adlexification i.e., doublets, well-established borrowings, and cultural borrowings, suggesting that ‘ordinary’ lexical borrowing is also capable of producing Media Lengua-type sentences. I approach the second objective by investigating two separate phenomena related to structural convergence. The first examines the complex verbal constructions that have developed in Quichua through Spanish loan translations while the second describes the type of Quichua particles that are attached to Spanish lexemes while speaking Spanish. The calquing of the complex verbal constructions from Spanish were employed when speaking standard Quichua. Since this standard form is typically used by language purists, I argue that their use of calques is a strategy of exploiting the full range of expression from Spanish without incorporating any of the Spanish lexemes which would give the appearance of ‘contamination’. The use of Quichua particles in local varieties of Spanish is a defining characteristic of Quichuacized Spanish, spoken most frequently by women and young children in the community. Although the use of Quichua particles was probably not the main catalyst engendering Media Lengua, I argue that its contribution as a source language to other ‘mixed’ varieties, such as Media Lengua, needs to be accounted for in descriptions of BML genesis. Contrary to Muysken’s representation of relatively ‘unmixed’ Spanish and Quichua as the two source languages of Media Lengua, I propose that local varieties of Spanish might have already been ‘mixed’ to a large degree before Media Lengua was created. The third objective attempts to draw a relationship between particular social variables and the use of Spanish loanwords. Whisker Boxplots and ANOVAs were used to determine which social group, if any, have been introducing new Spanish borrowings into the bilingual communities in Salcedo. Specifically, I controlled for age, education, native language, urban migration, and gender. The results indicate that none of the groups in each of the five social variables indicate higher or lower loanword use. The implication of these results are twofold: (a) when lexical borrowing occurs, it is immediately adopted as the community-wide norm and spoken by members from different backgrounds and generations, or (b) this level of Spanish borrowing (58%) is not a recent phenomenon. The fourth and final objective draws on my ethnographic research that addresses the attitudes of syncretic language use. I observed that Quichuacized Spanish and Hispanicized Quichua are highly stigmatized varieties spoken by the country’s most marginalized populations and families, yet within the community, syncretic ways of speaking are in fact the norm. It was shown that there exists a range of different linguistic definitions for ‘Chaupi Lengua’ and other syncretic language practices as well as many contrasting connotations, most of which were negative. One theme that emerged from the interviews was that speaking syncretic varieties of Quichua weakened the consultant’s claim to an indigenous identity. The linguistic and social data presented in this dissertation supports an alternative view to Muysken’s relexification hypothesis, one that has the advantage of operating with well-precedented linguistic processes and which is actually observable in the present-day Salcedo area. The results from the study on lexical borrowing are significant because they demonstrate how a dynamic bilingual speech community has gradually diversified their Quichua lexicon under intense pressure to shift toward Spanish. They also show that Hispanicized Quichua (Quichua with heavy lexical borrowing) clearly arose from adlexification and prolonged lexical borrowing, and is one of at least six identifiable speech styles found in Salcedo. These results challenge particular interpretations of language contact outcomes, such as, ones that depict sources languages as discrete and ‘unmixed.’ The bilingual continuum presented in this thesis shows on the one hand, the range of speech styles that are accessible to different speakers, and on the other hand, the overlapping, syncretic features that are shared among the different registers and language varieties. It was observed that syncretic speech styles in Salcedo are employed by different consultants in varied interactional contexts, and in turn, produce different evaluations by other fellow community members. In the current dissertation, I challenge the claim that relexification and Media Lengua-type sentences develop in isolation and without the influence of other bilingual phenomena. Based on Muysken's Media Lengua example sentences and the speech styles from the Salcedo corpus, I argue that Media Lengua may have arisen as an institutionalized variant of the highly mixed "middle ground" within the range of the Salcedo bilingual continuum discussed above. Such syncretic forms of Spanish and Quichua strongly resemble Media Lengua sentences in Muysken’s research, and therefore demonstrate how its development could have occurred through several different language contact processes and not only through relexification.
Resumo:
Students in the middle years encounter an increasing range of unfamiliar visuals. Visual literacy, the ability to encode and decode visuals and to think visually, is an expectation of all middle years curriculum areas and an expectation of NAPLAN literacy and numeracy tests. This article presents a multidisciplinary approach to teaching visual literacy that links the content of all learning areas and encourages students to transfer skills from familiar to unfamiliar contexts. It proposes a classification of visuals in six parts: one-dimensional; two-dimensional; map; shape; connection; and picture, based on the properties, rather than the purpose, of the visual. By placing a visual in one of these six categories, students learn to transfer the skills used to decode familiar visuals to unfamiliar cases in the same category. The article also discusses a range of other teaching strategies that can be used to complement this multi-disciplinary approach.
Resumo:
Language processing is an example of implicit learning of multiple statistical cues that provide probabilistic information regarding word structure and use. Much of the current debate about language embodiment is devoted to how action words are represented in the brain, with motor cortex activity evoked by these words assumed to selectively reflect conceptual content and/or its simulation. We investigated whether motor cortex activity evoked by manual action words (e.g., caress) might reflect sensitivity to probabilistic orthographic-phonological cues to grammatical category embedded within individual words. We first review neuroimaging data demonstrating that nonwords evoke activity much more reliably than action words along the entire motor strip, encompassing regions proposed to be action category specific. Using fMRI, we found that disyllabic words denoting manual actions evoked increased motor cortex activity compared with non-body-part-related words (e.g., canyon), activity which overlaps that evoked by observing and executing hand movements. This result is typically interpreted in support of language embodiment. Crucially, we also found that disyllabic nonwords containing endings with probabilistic cues predictive of verb status (e.g., -eve) evoked increased activity compared with nonwords with endings predictive of noun status (e.g., -age) in the identical motor area. Thus, motor cortex responses to action words cannot be assumed to selectively reflect conceptual content and/or its simulation. Our results clearly demonstrate motor cortex activity reflects implicit processing of ortho-phonological statistical regularities that help to distinguish a word's grammatical class.
Resumo:
This paper reviews the remarkably similar experiences of school science reported by high school students in Sweden, England, and Australia. It compares student narratives from interpretive studies by Lindahl, by Osborne and Collins, and by Lyons, identifying core themes relating to critical contemporary issues in science education. These themes revolve around the transmissive pedagogy, decontextualized content, and unnecessary difficulty of school science commonly reported by students in the studies. Their collective experiences are used as a framework for examining student conceptions of, and attitudes to, school science more generally, drawing on an extensive range of international literature. The paper argues that the experiences of students in the three studies provide important insights into the widespread declines in interest and enrolments in high school and university science courses.
Resumo:
In this paper, we present the results of an exploratory study that examined the problem of automating content analysis of student online discussion transcripts. We looked at the problem of coding discussion transcripts for the levels of cognitive presence, one of the three main constructs in the Community of Inquiry (CoI) model of distance education. Using Coh-Metrix and LIWC features, together with a set of custom features developed to capture discussion context, we developed a random forest classification system that achieved 70.3% classification accuracy and 0.63 Cohen's kappa, which is significantly higher than values reported in the previous studies. Besides improvement in classification accuracy, the developed system is also less sensitive to overfitting as it uses only 205 classification features, which is around 100 times less features than in similar systems based on bag-of-words features. We also provide an overview of the classification features most indicative of the different phases of cognitive presence that gives an additional insights into the nature of cognitive presence learning cycle. Overall, our results show great potential of the proposed approach, with an added benefit of providing further characterization of the cognitive presence coding scheme.
Resumo:
While a significant number of geotechnical structures are subjected to static loading, many, such as avement subgrade, also are subjected to cyclic or dynamic loading. While the performance of saturated soils under repeated, cyclic or dynamic loading conditions is still a topic of research, similar interests are growing when the soilcondition is unsaturated. This paper examines the performance of unsaturated soils under repeated loading. As part of the research, a triaxial system was developed which incorporates small strain measurements using Hall-effect transducers, in addition to suction measurements taken using a psychrometer. Tests were conducted on samples of kaolin under constant water mass conditions. The results address the effects of compaction effort and water content at the time of compaction on the overall performance of unsaturated soils, under different amplitudes of loading and different confining pressures. The results show that suction in the sample reduced with increasing number of loading cycles of the same magnitude. The resilient modulus initially increased with increasing water content up to approximately optimum water content, and substantially reduced with further increase in water content. Key Words: suction, resilient modulus, subgrade, repeated loading, small strain measurements, compaction.
Resumo:
Background: Search filters are combinations of words and phrases designed to retrieve an optimal set of records on a particular topic (subject filters) or study design (methodological filters). Information specialists are increasingly turning to reusable filters to focus their searches. However, the extent of the academic literature on search filters is unknown. We provide a broad overview to the academic literature on search filters.
Objectives: To map the academic literature on search filters from 2004 to 2015 using a novel form of content analysis.
Methods: We conducted a comprehensive search for literature between 2004 and 2015 across eight databases using a subjectively derived search strategy. We identified key words from titles, grouped them into categories, and examined their frequency and co-occurrences.
Results: The majority of records were housed in Embase (n = 178) and MEDLINE (n = 154). Over the last decade, both databases appeared to exhibit a bimodal distribution with the number of publications on search filters rising until 2006, before dipping in 2007, and steadily increasing until 2012. Few articles appeared in social science databases over the same time frame (e.g. Social Services Abstracts, n = 3).
Unsurprisingly, the term ‘search’ appeared in most titles, and quite often, was used as a noun adjunct for the word 'filter' and ‘strategy’. Across the papers, the purpose of searches as a means of 'identifying' information and gathering ‘evidence’ from 'databases' emerged quite strongly. Other terms relating to the methodological assessment of search filters, such as precision and validation, also appeared albeit less frequently.
Conclusions: Our findings show surprising commonality across the papers with regard to the literature on search filters. Much of the literature seems to be focused on developing search filters to identify and retrieve information, as opposed to testing or validating such filters. Furthermore, the literature is mostly housed in health-related databases, namely MEDLINE, CINAHL, and Embase, implying that it is medically driven. Relatively few papers focus on the use of search filters in the social sciences.