991 resultados para grammatical category
Resumo:
During the process of language development, one of the most important tasks that children must face is that of identifying the grammatical category to which words in their language belong. This is essential in order to be able to form grammatically correct utterances. How do children proceed in order to classify words in their language and assign them to their corresponding grammatical category? The present study investigates the usefulness of phonological information for the categorization of nouns in English, given the fact that it is phonology the first source of information that might be available to prelinguistic infants who lack access to semantic information or complex morphosyntactic information. We analyse four different corpora containing linguistic samples of English speaking mothers addressing their children in order to explore the reliability with which words are represented in mothers’ speech based on several phonological criteria. The results of the analysis confirm the prediction that most of the words to which English learning infants are exposed during the first two years of life can be accounted for in terms of their phonological resemblance
Resumo:
Research on the relationship between grammatical aspect and motion event construal has posited that speakers of non-aspect languages are more prone to encoding event endpoints than are speakers of aspect languages (e.g., von Stutterheim and Carroll 2011). In the present study, we test this hypothesis by extending this line of inquiry to Afrikaans, a non-aspect language which is previously unexplored in this regard. Motion endpoint behavior among Afrikaans speakers was measured by means of a linguistic retelling task and a non-linguistic similarity judgment task, and then compared with the behavior of speakers of a non-aspect language (Swedish) and speakers of an aspect language (English). Results showed the Afrikaans speakers' endpoint patterns aligned with Swedish patterns, but were significantly different from English patterns. It was also found that the variation among the Afrikaans speakers could be partially explained by taking into account their frequency of use of English, such that those who used English more frequently exhibited an endpoint behavior that was more similar to English speakers. The current study thus lends further support to the hypothesis that speakers of different languages attend differently to event endpoints as a function of the grammatical category of aspect.
Resumo:
El objetivo de la presente investigación es analizar el tratamiento que algunos de los diccionarios generales monolingües del español aparecidos en los últimos diez años han dado al fenómeno de las colocaciones léxicas, a saber, aquellas combinaciones de palabras que, desde el punto de vista de la norma, presentan ciertas restricciones combinatorias, esencialmente de carácter semántico, impuestas por el uso (Corpas: 1996). Los repertorios objeto de análisis han sido: el "Diccionario Salamanca de la lengua española", dirigido por Juan Gutiérrez (1996); el "Diccionario del español actual", de Manuel Seco, Olimpia Andrés y Gabino Ramos (1999); la vigésima segunda edición del diccionario de la RAE (2001); y el "Gran diccionario de uso del español actual. Basado en el corpus Cumbre", dirigido por Aquilino Sánchez (2001). Nuestro estudio se ha fundamentado en un corpus de 52 colocaciones léxicas confeccionado a partir del análisis de las subentradas contenidas en la letra "b" de cada uno de los diccionarios seleccionados. Posteriormente, hemos examinado las entradas correspondientes a cada uno de los elementos que constituyen la colocación (base y colocativo) con el fin de observar si los diccionarios estudiados dan cuenta de estas mismas combinaciones en otras partes del artículo lexicográfico, como son las definiciones o los ejemplos. A la hora de analizar la información lexicográfica hemos centrado nuestra atención en cuatro aspectos: a) la información contenida en las páginas preliminares de cada una de las obras; b) la ubicación de las colocaciones en el artículo lexicográfico; c) la asignación de la colocación a un artículo determinado; y d) la marcación gramatical.
Resumo:
In a system where tens of thousands of words are made up of a limited number of phonemes, many words are bound to sound alike. This similarity of the words in the lexicon as characterized by phonological neighbourhood density (PhND) has been shown to affect speed and accuracy of word comprehension and production. Whereas there is a consensus about the interfering nature of neighbourhood effects in comprehension, the language production literature offers a more contradictory picture with mainly facilitatory but also interfering effects reported on word production. Here we report both of these two types of effects in the same study. Multiple regression mixed models analyses were conducted on PhND effects on errors produced in a naming task by a group of 21 participants with aphasia. These participants produced more formal errors (interfering effect) for words in dense phonological neighbourhoods, but produced fewer nonwords and semantic errors (a facilitatory effect) with increasing density. In order to investigate the nature of these opposite effects of PhND, we further analysed a subset of formal errors and nonword errors by distinguishing errors differing on a single phoneme from the target (corresponding to the definition of phonological neighbours) from those differing on two or more phonemes. This analysis confirmed that only formal errors that were phonological neighbours of the target increased in dense neighbourhoods, while all other errors decreased. Based on additional observations favouring a lexical origin of these formal errors (they exceeded the probability of producing a real-word error by chance, were of a higher frequency, and preserved the grammatical category of the targets), we suggest that the interfering effect of PhND is due to competition between lexical neighbours and target words in dense neighbourhoods.
Resumo:
Ce mémoire de maitrise vise à dresser un portrait des erreurs syntaxiques des élèves du secondaire en analysant un corpus de textes de cinq groupes du Québec, de la 1re à la 5e secondaire. Les résultats actuels aux épreuves ministérielles des élèves de 5e secondaire nous indiquent que les élèves éprouvent des difficultés avec l’écriture du français. Une analyse approfondie nous permet de comprendre que l’amélioration de la situation passe par une meilleure connaissance des erreurs syntaxiques des élèves. En nous appuyant sur la grille de Boivin et Pinsonneault (2014), nous avons analysé les données provenant du codage des textes d’élèves de la 1re à la 5e secondaire. L’analyse de ces données nous a permis de constater que parmi les sept grandes catégories d’erreurs de la grille, c’est en syntaxe que les élèves commettent le plus d’erreurs. Une incursion au cœur des six sous-catégories de la syntaxe a révélé que la ponctuation causait le plus de problème aux élèves, et ce, à tous les niveaux. Les erreurs liées à la détermination de la catégorie grammaticale des mots (homophones) arrivaient en deuxième place. Par la suite, nous avons précisé davantage l’analyse en déterminant, pour chacun des codes, l’évolution du nombre d’erreurs d’un niveau du secondaire à l’autre. Il est ressorti de cette étude que les deux principales erreurs, basées sur les sous-catégories syntaxiques, sont celles portant sur l’usage de la virgule et celles liées à la confusion qui existe encore un verbe terminant par «er» et un adjectif ou un participe passé terminant par «é-e-s».
Resumo:
This monograph is a comparative study of past time reference in Spanish and Russian.The ambition is to present a functional perspective of how both languages systemically express temporal and aspectual information. The verb, naturally, attracts the main attention of the thesis and the focus is almost exclusively on verbs in the indicative mood.The definition of the parameters of Time and Aspect plays an important part in the present dissertation. A particular emphasis concerns the elaboration and testing of the ‘ABC’ model, which represents a graphic definition of verbal aspect as a grammatical category.Another important issue is the distinction of aspect and Aktionsart; these concepts are closely related but operate on different functional levels. The analysis is essentially based on linguistic material from parallel corpora, constructed for this purpose. At first the material is treated statistically in order to create astarting point for the qualitative part of the analysis. The three main areas of investigation dealt with are:1) The relation between the simple past tenses, pretérito and imperfecto, in Spanish and their imperfective and perfective counterparts in Russian.2) The relation between the compound tenses in Spanish and the Russian verbal system.The analysis of this relation also comprises a critique of the traditional interpretationof the aspectual contents of the compound tenses.3) The usage of alternative strategies in both languages. In this part of the analysis the focus is widened to include verbal periphrasis, infinite verb forms and subordination. The results of the analysis demonstrate that verbal aspect, according to the definition represented by ‘the ABC model’, works as a grammatical category in both Spanish and Russian. It is also shown that there are systemic differences in the manifestation of this functional category in both languages. Another important result is that the neither the compound tenses nor the progressive express verbal aspect, at least not in a narrow sense of the word but represent different verbal functions related to aspect.
Resumo:
Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)
Resumo:
Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES)
Resumo:
O presente trabalho descreve características da linguagem, em especial de alguns dos aspectos discursivos, de idosos com envelhecimento saudável, com declínio cognitivo senil, ou com doença de Alzheimer leve e moderada. Foram avaliados um total de 44 idosos sendo 22 saudáveis, 4 com declínio cognitivo senil, 9 com doença de Alzheimer leve e 9 com doença de Alzheimer moderada, classificados pelos critérios do CDR. Foram aplicados os testes neuropsicológicos do mini exame do estado mental, nomeação de Boston resumido, fluência verbal e narrativa por confronto visual (figura do roubo dos bolinhos). Foram estimados os desempenhos nos testes selecionados e aplicada a avaliação dos relatos pelos critérios propostos por Groves-Wright (2004) assim como foi feita a tipificação da narrativa. Tratamento estatístico paramétrico determinou os valores de média, erro-padrão e o nível de significância das diferenças entre as médias foi fixado para valores de p<0.05. Em seguida foi realizado inventário e análise do léxico e das categorias gramaticas das narrativas, e realizada análise paramétrica a partir dos escores Z, através do programa STABLEX. Encontrou-se que a fluência verbal semântica é melhor nos idosos saudáveis quando comparados aos com declínio cognitivo leve. As narrativas mostraram diferenças estatisticamente significantes entre os idosos saudáveis e os com declínio cognitivo leve nas análises de frequência de uso do vocabulário e das categorias gramaticais como um todo, e também entre saudáveis e todos os demais grupos nas análises de cada uma das categorias gramaticais. A análise do vocabulário e das categorias gramaticais permitiu identificar comprometimentos da função narrativa medida pelo tipo de vocabulário e pelas categorias gramaticais preferidas ou rejeitadas. Os resultados contribuem para distinguir as características da narrativa de idosos saudáveis, daquelas de idosos com declínio cognitivo leve ou com Alzheimer leve ou moderado, apontando alterações que possivelmente são indicadores precoces que podem ser usados para avaliar o curso temporal da doença.
Resumo:
This paper describes the prepositions sob and sobre in the Functional Discourse Grammar framework (HENGEVELD; MACKENZIE, 2008) aiming at checking their lexical or grammatical status on the basis of classification criteria postulated by Keizer (2007). The following aspects point to their lexical status : (i) they consist of an Ascription Subact; (ii) they contain a specific content on the vertical axis signaling inferiority and superiority position in relation to a limit; (iii) they are not required by any predicate, but they are predicates by themselves, which require complementation by an argument playing Reference semantic function; (iv) they may be combined with de, em, para and por; which are genuine grammatical prepositions; and, finally, (v) they are not subjected to any phonological process of reduction and fusion.
Resumo:
Pós-graduação em Linguística e Língua Portuguesa - FCLAR
Resumo:
Feminist linguists claim that masculine forms used in a generic sense (e.g. he referring to a doctor irrespective of sex) facilitate the cognitive representation of men compared to women and make women less visible. A number of experimental studies have confirmed this assumption with regard to the English language. Concerning other languages, however, this question has been addressed only in very few studies, although gender is a much more pervasive grammatical category and masculine generics are more prominent in languages such as French, Spanish or German. This paper reports three experiments with native speakers of German which were conducted to determine the influence of different types of German generics on the cognitive inclusion of women. Results indicate that inclusion of women is higher with 'non-sexist' alternatives than with masculine generics, a tendency which was consistent over studies. But the different alternative forms show different effects which also vary depending on the context. These results are discussed with regard to their practical consequences in situations such as nominating women and men for awards, political offices etc.
Resumo:
The goal of the present thesis was to investigate the production of code-switched utterances in bilinguals’ speech production. This study investigates the availability of grammatical-category information during bilingual language processing. The specific aim is to examine the processes involved in the production of Persian-English bilingual compound verbs (BCVs). A bilingual compound verb is formed when the nominal constituent of a compound verb is replaced by an item from the other language. In the present cases of BCVs the nominal constituents are replaced by a verb from the other language. The main question addressed is how a lexical element corresponding to a verb node can be placed in a slot that corresponds to a noun lemma. This study also investigates how the production of BCVs might be captured within a model of BCVs and how such a model may be integrated within incremental network models of speech production. In the present study, both naturalistic and experimental data were used to investigate the processes involved in the production of BCVs. In the first part of the present study, I collected 2298 minutes of a popular Iranian TV program and found 962 code-switched utterances. In 83 (8%) of the switched cases, insertions occurred within the Persian compound verb structure, hence, resulting in BCVs. As to the second part of my work, a picture-word interference experiment was conducted. This study addressed whether in the case of the production of Persian-English BCVs, English verbs compete with the corresponding Persian compound verbs as a whole, or whether English verbs compete with the nominal constituents of Persian compound verbs only. Persian-English bilinguals named pictures depicting actions in 4 conditions in Persian (L1). In condition 1, participants named pictures of action using the whole Persian compound verb in the context of its English equivalent distractor verb. In condition 2, only the nominal constituent was produced in the presence of the light verb of the target Persian compound verb and in the context of a semantically closely related English distractor verb. In condition 3, the whole Persian compound verb was produced in the context of a semantically unrelated English distractor verb. In condition 4, only the nominal constituent was produced in the presence of the light verb of the target Persian compound verb and in the context of a semantically unrelated English distractor verb. The main effect of linguistic unit was significant by participants and items. Naming latencies were longer in the nominal linguistic unit compared to the compound verb (CV) linguistic unit. That is, participants were slower to produce the nominal constituent of compound verbs in the context of a semantically closely related English distractor verb compared to producing the whole compound verbs in the context of a semantically closely related English distractor verb. The three-way interaction between version of the experiment (CV and nominal versions), linguistic unit (nominal and CV linguistic units), and relation (semantically related and unrelated distractor words) was significant by participants. In both versions, naming latencies were longer in the semantically related nominal linguistic unit compared to the response latencies in the semantically related CV linguistic unit. In both versions, naming latencies were longer in the semantically related nominal linguistic unit compared to response latencies in the semantically unrelated nominal linguistic unit. Both the analysis of the naturalistic data and the results of the experiment revealed that in the case of the production of the nominal constituent of BCVs, a verb from the other language may compete with a noun from the base language, suggesting that grammatical category does not necessarily provide a constraint on lexical access during the production of the nominal constituent of BCVs. There was a minimal context in condition 2 (the nominal linguistic unit) in which the nominal constituent was produced in the presence of its corresponding light verb. The results suggest that generating words within a context may not guarantee that the effect of grammatical class becomes available. A model is proposed in order to characterize the processes involved in the production of BCVs. Implications for models of bilingual language production are discussed.
Resumo:
OntoTag - A Linguistic and Ontological Annotation Model Suitable for the Semantic Web
1. INTRODUCTION. LINGUISTIC TOOLS AND ANNOTATIONS: THEIR LIGHTS AND SHADOWS
Computational Linguistics is already a consolidated research area. It builds upon the results of other two major ones, namely Linguistics and Computer Science and Engineering, and it aims at developing computational models of human language (or natural language, as it is termed in this area). Possibly, its most well-known applications are the different tools developed so far for processing human language, such as machine translation systems and speech recognizers or dictation programs.
These tools for processing human language are commonly referred to as linguistic tools. Apart from the examples mentioned above, there are also other types of linguistic tools that perhaps are not so well-known, but on which most of the other applications of Computational Linguistics are built. These other types of linguistic tools comprise POS taggers, natural language parsers and semantic taggers, amongst others. All of them can be termed linguistic annotation tools.
Linguistic annotation tools are important assets. In fact, POS and semantic taggers (and, to a lesser extent, also natural language parsers) have become critical resources for the computer applications that process natural language. Hence, any computer application that has to analyse a text automatically and ‘intelligently’ will include at least a module for POS tagging. The more an application needs to ‘understand’ the meaning of the text it processes, the more linguistic tools and/or modules it will incorporate and integrate.
However, linguistic annotation tools have still some limitations, which can be summarised as follows:
1. Normally, they perform annotations only at a certain linguistic level (that is, Morphology, Syntax, Semantics, etc.).
2. They usually introduce a certain rate of errors and ambiguities when tagging. This error rate ranges from 10 percent up to 50 percent of the units annotated for unrestricted, general texts.
3. Their annotations are most frequently formulated in terms of an annotation schema designed and implemented ad hoc.
A priori, it seems that the interoperation and the integration of several linguistic tools into an appropriate software architecture could most likely solve the limitations stated in (1). Besides, integrating several linguistic annotation tools and making them interoperate could also minimise the limitation stated in (2). Nevertheless, in the latter case, all these tools should produce annotations for a common level, which would have to be combined in order to correct their corresponding errors and inaccuracies. Yet, the limitation stated in (3) prevents both types of integration and interoperation from being easily achieved.
In addition, most high-level annotation tools rely on other lower-level annotation tools and their outputs to generate their own ones. For example, sense-tagging tools (operating at the semantic level) often use POS taggers (operating at a lower level, i.e., the morphosyntactic) to identify the grammatical category of the word or lexical unit they are annotating. Accordingly, if a faulty or inaccurate low-level annotation tool is to be used by other higher-level one in its process, the errors and inaccuracies of the former should be minimised in advance. Otherwise, these errors and inaccuracies would be transferred to (and even magnified in) the annotations of the high-level annotation tool.
Therefore, it would be quite useful to find a way to
(i) correct or, at least, reduce the errors and the inaccuracies of lower-level linguistic tools;
(ii) unify the annotation schemas of different linguistic annotation tools or, more generally speaking, make these tools (as well as their annotations) interoperate.
Clearly, solving (i) and (ii) should ease the automatic annotation of web pages by means of linguistic tools, and their transformation into Semantic Web pages (Berners-Lee, Hendler and Lassila, 2001). Yet, as stated above, (ii) is a type of interoperability problem. There again, ontologies (Gruber, 1993; Borst, 1997) have been successfully applied thus far to solve several interoperability problems. Hence, ontologies should help solve also the problems and limitations of linguistic annotation tools aforementioned.
Thus, to summarise, the main aim of the present work was to combine somehow these separated approaches, mechanisms and tools for annotation from Linguistics and Ontological Engineering (and the Semantic Web) in a sort of hybrid (linguistic and ontological) annotation model, suitable for both areas. This hybrid (semantic) annotation model should (a) benefit from the advances, models, techniques, mechanisms and tools of these two areas; (b) minimise (and even solve, when possible) some of the problems found in each of them; and (c) be suitable for the Semantic Web. The concrete goals that helped attain this aim are presented in the following section.
2. GOALS OF THE PRESENT WORK
As mentioned above, the main goal of this work was to specify a hybrid (that is, linguistically-motivated and ontology-based) model of annotation suitable for the Semantic Web (i.e. it had to produce a semantic annotation of web page contents). This entailed that the tags included in the annotations of the model had to (1) represent linguistic concepts (or linguistic categories, as they are termed in ISO/DCR (2008)), in order for this model to be linguistically-motivated; (2) be ontological terms (i.e., use an ontological vocabulary), in order for the model to be ontology-based; and (3) be structured (linked) as a collection of ontology-based
Portrait des difficultés des élèves du secondaire relativement à l’orthographe des formes homophones
Resumo:
Cette recherche descriptive vise à établir un portrait des principales difficultés rencontrées par les élèves du secondaire relativement à l’orthographe des homophones et cela à travers différents angles d’analyse. Nous avons d’abord fait ressortir l’importance des difficultés orthographiques chez les élèves du secondaire québécois et mis en relief la proportion de ces erreurs attribuée à l’orthographe des homophones. À partir des données recueillies par le groupe de recherche Projet grammaire-écriture qui s’est donné comme objectif, dans un premier temps, de recueillir de nombreuses données à travers deux instruments de collecte (une dictée et une production écrite), nous avons tout d’abord relevé les erreurs d’homophonie commises le plus fréquemment par les élèves pour ensuite analyser chacune des formes homophones problématiques en fonction de critères variés tels que leur fréquence lexicale dans la langue française, leur appartenance à une catégorie grammaticale particulière ou encore la structure syntaxique qui les sous-tend. Les erreurs les plus importantes ont fait l’objet d’une observation plus poussée : nous avons établi le pourcentage de graphies correctes versus erronées dans tous les textes des élèves. Finalement, nous avons aussi comparé nos résultats à ceux obtenus par McNicoll et Roy (1984) auprès d’une population de niveau primaire. Les résultats révélés par notre analyse montrent que ce sont principalement les finales verbales en /E/ qui posent problème aux élèves du secondaire, suivies par les formes homophones s’est/c’est/ces/ses et se/ce.