929 resultados para second language processing


Relevância:

90.00% 90.00%

Publicador:

Resumo:

Language Resources are a critical component for Natural Language Processing applications. Throughout the years many resources were manually created for the same task, but with different granularity and coverage information. To create richer resources for a broad range of potential reuses, nformation from all resources has to be joined into one. The hight cost of comparing and merging different resources by hand has been a bottleneck for merging existing resources. With the objective of reducing human intervention, we present a new method for automating merging resources. We have addressed the merging of two verbs subcategorization frame (SCF) lexica for Spanish. The results achieved, a new lexicon with enriched information and conflicting information signalled, reinforce our idea that this approach can be applied for other task of NLP.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

This thesis is a pedagogical and methodological work related to the Teacher’s use of the students’ common language in 7th grade (beginners-level 1) Capeverdean English classroom. It discusses the importance of a limited and judicious use of the students’ common language (Creole/Portuguese) as a teaching technique to assist in the teaching and learning process. This thesis contains four chapters. The first chapter defines and shows the difference between mother tongue, second language and foreign language, talks about the methods and approaches (classroom procedures) to teach English as a foreign language, the different opinions about the teacher’s use of the students’ first language in the EFL classroom, and presents two studies already conducted on the use of the students’ mother tongue in the English classroom in two different EFL context. The second Chapter describes the methodology of research to conduct a study on the use of the students’ common language (Creole/Portuguese) in the EFL Capeverdean context with 7th grade students. The third chapter is the presentation of the Results and Analyses of the field research. And finally the fourth chapter is the recommendations and conclusions.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Neste trabalho analisamos um conteúdo gramatical - o modo conjuntivo num contexto de aprendizagem formal, seguindo duas vias: a teórica, em que fizemos uma abordagem sobre as perspectivas de vários gramáticos; a prática, que se traduz na análise de um corpus. Desenvolvemos esta análise com base nas produções escritas dos aprendentes do terceiro ciclo, na Escola Secundária Manuel Lopes, Praia, com o objectivo de fazer uma abordagem crítica respeitante ao ensino do Português Língua Segunda, em particular, ao ensino e uso do conjuntivo na supracitada língua em Cabo Verde. Com efeito, foram abordadas questões relativas às teorias de aquisição/aprendizagem segundo as perspectivas de diversos autores. A análise dos textos escritos orientou-nos para o levantamento e estudo dos erros de uso do conjuntivo e para termos uma noção do perfil sociolinguístico dos aprendentes. A descrição linguística efectuada conduziu-nos a uma reflexão sobre as formas de tratamento e de recuperação dos erros . Os resultados sugerem que o insucesso na aprendizagem do Português Língua Segunda deve-se a factores vários, entre os quais, sociolinguísticos e políticos. Nesta sequência, perspectivamos a superação de erros mediante estratégias e sugestões metodológicas para um ensino eficaz do conjuntivo em Português Língua Segunda, neste país.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Esta tese centra-se em aspectos relevantes do inglês como uma língua universal, no actual contexto globalizado e examina possíveis mudanças relacionadas com o seu uso, em especial no continente africano, particularmente no caso de Cabo Verde, no sentido de ponderar eventuais alternativas nas pedagogias linguísticas no ensino desta língua que impliquem uma adaptação à realidade contemporânea. Uma vez que, nos nossos tempos, o inglês é a língua de eleição para a comunicação intercultural entre povos com várias experiências culturais e linguísticas, o conhecimento deste idioma torna-se, a cada dia que passa, impreterível e indispensável, na interacção intercultural. Em África, as funções desempenhadas pelo inglês são complexas; além da língua inglesa ser usada para comunicação entre etnias, com o estatuto de língua franca, também tem o papel de preservar a identidade nacional e de estabelecer a unidade entre os povos da mesma nação. Por conseguinte, é de considerar talvez ainda com mais pertinência, a adopção de uma nova filosofia de pedagogia de ensino que permita dotar os seus cidadãos de capacidades que lhes possibilitem comunicar de forma inteligível com povos de outras culturas e línguas. O primeiro capítulo aborda aspectos teóricos relacionados com a expansão, comunicação e mudança associadas à língua inglesa e suas implicações no ensino em países onde esta não é língua nativa (L1). O segundo capítulo reflecte, em primeiro lugar, sobre a situação linguística em África e as línguas francas predominantes no continente, incluindo a língua inglesa. Considera também questões relacionadas com o multilinguismo e a identidade, bem como assuntos relacionados com as implicações da diversidade linguística para a educação dos povos africanos.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

La globalización cuestiona la existencia de una relación mimética entre ciudadanía y Estado-nación. Las identidades homogéneas, sustentadas ideológicamente en nociones como «lengua nacional», plantean problemas en sociedades en las que ha crecido espectacularmente la diversidad lingüística e identitaria. Cataluña es un territorio en el que una parte de la población afirma una identidad catalana distinta a la española y viceversa. Además, se ha teorizado que la identidad catalana y la lengua catalana coexisten mutuamente. Por eso, se suceden voces que defienden la presencia del catalán en la educación escolar como fuente de la identidad nacional catalana, mientras que otras voces defienden su presencia simplemente como una buena manera de aprender el catalán cuando no se puede aprender en el medio social y familiar. En los últimos años, Cataluña ha recibido casi un millón de personas extranjeras que han modificado notablemente su situación sociolingüística. Las últimas encuestas manifiestan que un 6,3% de la población utiliza habitualmente una lengua distinta del catalán y del castellano. En este marco, mostramos las construcciones identitarias de un grupo de adolescentes de origen extranjero que están en el segundo ciclo de la ESO. Los datos fueron recogidos mediante dos grupos de discusión de seis-siete estudiantes de distinto origen, lengua propia y tiempo de residencia en Cataluña. Los resultados muestran la importancia del lugar de origen en la construcción de la identidad. Además, los participantes que afirman sentimientos catalanes o españoles no los relacionan con la lengua sino con los intercambios sociales que han establecido con sus iguales de origen naciona. Las intervenciones muestran también las dificultades para promover identidades múltiples desde el contexto escolar que eviten actitudes racistas y xenófobas y sirvan para promover proyectos colectivos de futuro en los que se pueda vivir desde una cierta diferencia

Relevância:

90.00% 90.00%

Publicador:

Resumo:

La infancia extranjera se escolariza en Cataluña en un programa de cambio de lengua del hogar a la escuela. Las investigaciones afirman que este alumnado tarda un mínimo de seis años en equiparar sus habilidades lingüístico-cognitivas con sus pares autóctonos, no así las habilidades conversacionales, las cuales se adquieren antes de los dos años de residencia. Sin embargo, no existen estudios sobre los efectos de la escolarización en el parvulario del alumnado alófono, así como de su lengua familiar, en relación con la adquisición de la lengua escolar. El artículo es un estudio comparativo de la adquisición del catalán de 567 autóctonos y 434 alófonos, al final del parvulario, en 50 escuelas de Cataluña que escolarizan a alumnado de origen extranjero. Las lenguas del alumnado autóctono son el catalán, el castellano y el bilingüismo catalán-castellano y las lenguas del alumnado alófono son el árabe, el soninké y el castellano. Los factores utilizados más relevantes han sido el nivel socioprofesional y educativo de las familias, el tiempo de residencia y el momento de escolarización del alumnado, el porcentaje de alumnado catalanohablante y de alumnado alófono en el aula y el contexto sociolingüístico del centro escolar. Los resultados muestran que el alumnado autóctono sabe más catalán que el alumnado alófono, pero las diferencias desaparecen respecto a algunos factores, de los cuales los más relevantes son los relacionados con las características del alumnado de las aulas. La lengua familiar del alumnado alófono no incide en sus resultados

Relevância:

90.00% 90.00%

Publicador:

Resumo:

En contra del que molts pares i mestres pensen, i en contra del que s'ha trobat en contextos d'adquisició natural de segones llengües i d'immersió, començar al més aviat possible l'aprenentatge d'idiomes, sobretot pel que fa a les tasques cognitivament més exigents, com l'escriptura, no sembla l'opció més eficaç. Els resultats del Grup de Recerca en Adquisició de Llengües (GRAL) estudià aprenents d'anglès que començaren als vuit i onze anys i trobaren, sistemàticament, que a llarg termini, després de les mateixes hores d'instrucció, eren els més grans, que havien començat als onze anys, els que obtenien millors resultats en totes les proves orals i escrites d'anglès excepte en alguna de prova de reconeixement fonètic. La maduresa cognitiva dels alumnes més grans i els diferents mecanismes d¿aprenentatge implícit i explícit entre els nens més petits i els adults ajuden a explicar per què només començar abans l'aprenentatge d'idiomes no sembla suficient per obtenir millors resultats.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Although paraphrasing is the linguistic mechanism underlying many plagiarism cases, little attention has been paid to its analysis in the framework of automatic plagiarism detection. Therefore, state-of-the-art plagiarism detectors find it difficult to detect cases of paraphrase plagiarism. In this article, we analyse the relationship between paraphrasing and plagiarism, paying special attention to which paraphrase phenomena underlie acts of plagiarism and which of them are detected by plagiarism detection systems. With this aim in mind, we created the P4P corpus, a new resource which uses a paraphrase typology to annotate a subset of the PAN-PC-10 corpus for automatic plagiarism detection. The results of the Second International Competition on Plagiarism Detection were analysed in the light of this annotation. The presented experiments show that (i) more complex paraphrase phenomena and a high density of paraphrase mechanisms make plagiarism detection more difficult, (ii) lexical substitutions are the paraphrase mechanisms used the most when plagiarising, and (iii) paraphrase mechanisms tend to shorten the plagiarized text. For the first time, the paraphrase mechanisms behind plagiarism have been analysed, providing critical insights for the improvement of automatic plagiarism detection systems.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

The prediction filters are well known models for signal estimation, in communications, control and many others areas. The classical method for deriving linear prediction coding (LPC) filters is often based on the minimization of a mean square error (MSE). Consequently, second order statistics are only required, but the estimation is only optimal if the residue is independent and identically distributed (iid) Gaussian. In this paper, we derive the ML estimate of the prediction filter. Relationships with robust estimation of auto-regressive (AR) processes, with blind deconvolution and with source separation based on mutual information minimization are then detailed. The algorithm, based on the minimization of a high-order statistics criterion, uses on-line estimation of the residue statistics. Experimental results emphasize on the interest of this approach.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

In this thesis we study the field of opinion mining by giving a comprehensive review of the available research that has been done in this topic. Also using this available knowledge we present a case study of a multilevel opinion mining system for a student organization's sales management system. We describe the field of opinion mining by discussing its historical roots, its motivations and applications as well as the different scientific approaches that have been used to solve this challenging problem of mining opinions. To deal with this huge subfield of natural language processing, we first give an abstraction of the problem of opinion mining and describe the theoretical frameworks that are available for dealing with appraisal language. Then we discuss the relation between opinion mining and computational linguistics which is a crucial pre-processing step for the accuracy of the subsequent steps of opinion mining. The second part of our thesis deals with the semantics of opinions where we describe the different ways used to collect lists of opinion words as well as the methods and techniques available for extracting knowledge from opinions present in unstructured textual data. In the part about collecting lists of opinion words we describe manual, semi manual and automatic ways to do so and give a review of the available lists that are used as gold standards in opinion mining research. For the methods and techniques of opinion mining we divide the task into three levels that are the document, sentence and feature level. The techniques that are presented in the document and sentence level are divided into supervised and unsupervised approaches that are used to determine the subjectivity and polarity of texts and sentences at these levels of analysis. At the feature level we give a description of the techniques available for finding the opinion targets, the polarity of the opinions about these opinion targets and the opinion holders. Also at the feature level we discuss the various ways to summarize and visualize the results of this level of analysis. In the third part of our thesis we present a case study of a sales management system that uses free form text and that can benefit from an opinion mining system. Using the knowledge gathered in the review of this field we provide a theoretical multi level opinion mining system (MLOM) that can perform most of the tasks needed from an opinion mining system. Based on the previous research we give some hints that many of the laborious market research tasks that are done by the sales force, which uses this sales management system, can improve their insight about their partners and by that increase the quality of their sales services and their overall results.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Language switching is omnipresent in bilingual individuals. In fact, the ability to switch languages (code switching) is a very fast, efficient, and flexible process that seems to be a fundamental aspect of bilingual language processing. In this study, we aimed to characterize psychometrically self-perceived individual differences in language switching and to create a reliable measure of this behavioral pattern by introducing a bilingual switching questionnaire. As a working hypothesis based on the previous literature about code switching, we decomposed language switching into four constructs: (i) L1 switching tendencies (the tendency to switch to L1; L1-switch); (ii) L2 switching tendencies (L2-switch); (iii) contextual switch, which indexes the frequency of switches usually triggered by a particular situation, topic, or environment; and (iv) unintended switch, which measures the lack of intention and awareness of the language switches. A total of 582 SpanishCatalan bilingual university students were studied. Twelve items were selected (three for each construct). The correlation matrix was factor-analyzed using minimum rank factor analysis followed by oblique direct oblimin rotation. The overall proportion of common variance explained by the four extracted factors was 0.86. Finally, to assess the external validity of the individual differences scored with the new questionnaire, we evaluated the correlations between these measures and several psychometric (language proficiency) and behavioral measures related to cognitive and attentional control. The present study highlights the importance of evaluating individual differences in language switching using self-assessment instruments when studying the interface between cognitive control and bilingualism.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Across Latin America 420 indigenous languages are spoken. Spanish is considered a second language in indigenous communities and is progressively introduced in education. However, most of the tools to support teaching processes of a second language have been developed for the most common languages such as English, French, German, Italian, etc. As a result, only a small amount of learning objects and authoring tools have been developed for indigenous people considering the specific needs of their population. This paper introduces Multilingual–Tiny as a web authoring tool to support the virtual experience of indigenous students and teachers when they are creating learning objects in indigenous languages or in Spanish language, in particular, when they have to deal with the grammatical structures of Spanish. Multilingual–Tiny has a module based on the Case-based Reasoning technique to provide recommendations in real time when teachers and students write texts in Spanish. An experiment was performed in order to compare some local similarity functions to retrieve cases from the case library taking into account the grammatical structures. As a result we found the similarity function with the best performance

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Los programas de inmersión lingüística han constituido y constituyen dentro del Sistema Educativo catalánla principal forma para que el alumnado de lengua familiar no-catalana aprenda una nueva lengua, el catalán,sin que, en su proceso de aprendizaje, vea mermado ni el desarrollo de su propia lengua ni su rendimientoacadémico. El éxito de la inmersión lingüística en las décadas anteriores ha sido frecuentemente utilizado comouno de los argumentos orientativos para justificar la política lingüística que se sigue en la escolarización de lainfancia extranjera. Sin embargo, los resultados obtenidos por investigaciones recientes parece que no avalanempíricamente dicho argumento. Este artículo analiza dichos resultados y expone, a partir del Plan para laLengua y Cohesión Social puesto en marcha por el Departamento de Educación de la Generalitat de Cataluña,cuáles son los retos que se presentan a su Sistema Educativo dentro del nuevo marco que supone el aumento de ladiversidad cultural y lingüística en la actual sociedad catalana

Relevância:

90.00% 90.00%

Publicador:

Resumo:

The human language-learning ability persists throughout life, indicating considerable flexibility at the cognitive and neural level. This ability spans from expanding the vocabulary in the mother tongue to acquisition of a new language with its lexicon and grammar. The present thesis consists of five studies that tap both of these aspects of adult language learning by using magnetoencephalography (MEG) and functional magnetic resonance imaging (fMRI) during language processing and language learning tasks. The thesis shows that learning novel phonological word forms, either in the native tongue or when exposed to a foreign phonology, activates the brain in similar ways. The results also show that novel native words readily become integrated in the mental lexicon. Several studies in the thesis highlight the left temporal cortex as an important brain region in learning and accessing phonological forms. Incidental learning of foreign phonological word forms was reflected in functionally distinct temporal lobe areas that, respectively, reflected short-term memory processes and more stable learning that persisted to the next day. In a study where explicitly trained items were tracked for ten months, it was found that enhanced naming-related temporal and frontal activation one week after learning was predictive of good long-term memory. The results suggest that memory maintenance is an active process that depends on mechanisms of reconsolidation, and that these process vary considerably between individuals. The thesis put special emphasis on studying language learning in the context of language production. The neural foundation of language production has been studied considerably less than that of perceptive language, especially on the sentence level. A well-known paradigm in language production studies is picture naming, also used as a clinical tool in neuropsychology. This thesis shows that accessing the meaning and phonological form of a depicted object are subserved by different neural implementations. Moreover, a comparison between action and object naming from identical images indicated that the grammatical class of the retrieved word (verb, noun) is less important than the visual content of the image. In the present thesis, the picture naming was further modified into a novel paradigm in order to probe sentence-level speech production in a newly learned miniature language. Neural activity related to grammatical processing did not differ between the novel language and the mother tongue, but stronger neural activation for the novel language was observed during the planning of the upcoming output, likely related to more demanding lexical retrieval and short-term memory. In sum, the thesis aimed at examining language learning by combining different linguistic domains, such as phonology, semantics, and grammar, in a dynamic description of language processing in the human brain.