41 resultados para anaphora


10.00% 10.00%



Several studies of different bilingual groups including L2 learners, child bilinguals, heritage speakers and L1 attriters reveal similar performance on syntax-discourse interface properties such as anaphora resolution (Sorace, 2011 and references therein). Specifically, bilinguals seem to allow more optionality in the interpretation of overt subject pronouns in null subject languages, such as Greek, Italian and Spanish while the interpretation of null subject pronouns is indistinguishable from monolingual natives. Nevertheless, there is some evidence pointing to bilingualism effects on the interpretation of null subject pronouns too in heritage speakers’ grammars (Montrul, 2004) due to some form of ‘arrested’ development in this group of bilinguals. The present study seeks to investigate similarities and differences between two Greek–Swedish bilingual groups, heritage speakers and L1 attriters, in anaphora resolution of null and overt subject pronouns in Greek using a self-paced listening with a sentence-picture matching decision task at the end of each sentence. The two groups differ in crucial ways: heritage speakers were simultaneous or early bilinguals while the L1 attriters were adult learners of the second language, Swedish. Our findings reveal differences from monolingual preferences in the interpretation of the overt pronoun for both heritage and attrited speakers while the differences attested between the two groups in the interpretation of null subject pronouns affect only response times with heritage being faster than attrited speakers. We argue that our results do not support an age of onset or differential input effects on bilingual performance in pronoun resolution.


10.00% 10.00%



This paper presents a salience-based technique for the annotation of directly quoted speech from fiction text. In particular, this paper determines to what extent a naïve (without the use of complex machine learning or knowledge-based techniques) scoring technique can be used for the identification of the speaker of speech quotes. The presented technique makes use of a scoring technique, similar to that commonly found in knowledge-poor anaphora resolution research, as well as a set of hand-coded rules for the final identification of the speaker of each quote in the text. Speaker identification is shown to be achieved using three tasks: the identification of a speech-verb associated with a quote with a recall of 94.41%; the identification of the actor associated with a quote with a recall of 88.22%; and the selection of a speaker with an accuracy of 79.40%.


10.00% 10.00%



By relying mainly on the accessibility approach to anaphora, this article intends to analyze the types, distributions and retrieval of anaphors in two forms of spoken discourse: casual and controlled talk. For the specific purposes of the study, twenty sophomore Iranian students were randomly selected to conduct the talks. The subjects were divided into two groups of casual and controlled talk. According to the settings and adopted topics, the overall casual talk group was further divided into two groups of dorm and academic talk. In the end, it was observed that as the talk situations vary, types, frequencies, distances, retrieval qualities and thematic structure (patterning) of anaphors undergo dramatic changes too. Further analyses of the obtained data show that the number of pronominal anaphors is by far more than NP anaphors in dorm casual talk whereas in academic casual talk the number of NP anaphors exceeds that of the former talk groups. However, the distribution of anaphors in the performance of controlled talk groups has shown to be more moderate with regard to the types of anaphors used in it. Overall, the distributional patterns of various anaphoric devices in different talk situations are considered to be a function of the speakers’ evaluation of the cognitive states of the listeners/addressees.
Average distances and frequencies of the different types of zero, pronominal, and NP anaphors have also been shown to undergo dramatic changes as talk situations vary.


10.00% 10.00%



Esta pesquisa trata da referenciação ocorrida pelas anáforas diretas e indire-tas em produções textuais de alunos do Ensino Fundamental. Como elemento pri-mordial de nossas investigações, ancoramo-nos na referenciação e estudos da aná-fora no âmbito da Lingüística Textual como fazem Mondada e Dubois (2003); KOCH (2003) e Marcuschi (2005). De acordo com estudos já realizados e envolvidos com a temática, defendemos a noção de referenciação como sendo um processo de cons-trução de sentidos que se realiza no discurso, isto é, em atividades interativas entre os sujeitos exigindo, portanto, uma ação colaborativa entre os interlocutores. Nesse aspecto, a interação precisa existir, visto que os referentes de um termo anafórico nem sempre se encontram na superfície textual; muitas vezes, são construídos a partir de uma representação ou modelo mental disponível na memória discursiva. Nessa perspectiva, procuramos analisar o processo de referenciação construído a partir de anáforas diretas e indiretas identificando as estratégias utilizadas em textos produzidos em situações de sala de aula. Dizemos que a anáfora direta, de modo geral, é definida pela relação de correferencialidade estabelecida entre o anafórico e seu antecedente, enquanto a anáfora indireta é vista como uma estratégia referenci-al de associação, sem referente explícito, tendo que se esforçar para estabelecer a continuidade referencial no texto e para isso utiliza-se da ativação (referenciação mental) de elementos novos e não de uma reativação de referentes já conhecidos, o que constitui um processo de referenciação implícita. Para alcançarmos nosso obje-tivo, utilizamos como subsídio teórico, o processo de informação na memória cogni-tiva, a referência e estudos da anáfora no âmbito da Lingüística Textual. O corpus deste trabalho constitui-se de sessenta textos escritos por alunos de 7ª série (atual 8º ano), dentre os quais analisamos o processo de referenciação ocasionado pelas anáforas diretas e indiretas em doze desses textos. Considerando os dados analisa-dos, dentre os tipos de anáforas estudados, constatamos que houve preponderância no uso das anáforas diretas, destacando-se entre elas a anáfora direta correferencial co-significativa, com maior manifestação no uso das retomadas diretas por pronomi-nalização


10.00% 10.00%



Pós-graduação em Letras - FCLAS


10.00% 10.00%



Investiga o uso de descrições nominais anafóricas em narrativas orais, mais precisamente, a relação existente entre o uso dessas formas e as partes que constituem a estrutura da narrativa. Observamos as funções que as descrições nominais anafóricas exercem em decorrência dessa relação, bem como a orientação argumentativa que tais expressões imprimem no discurso do narrador. Para esse empreendimento, seguimos um percurso teórico no qual discutimos conceitos de narrativa oral, definimos estrutura da narrativa e problematizamos referenciação, anáfora e as estratégias de referenciação com núcleos nominais, entre as quais destacamos as descrições nominais. Para os estudos da narrativa oral e sua estrutura, recorremos, de modo particular, aos estudos de Labov (1972). Quanto à referenciação, anáfora e estratégias de referenciação, seguimos os postulados de Marcuschi (2005; 2007), Koch (1996; 2001; 2004; 2005; 2006; 2008), Mondada e Dubois (2003) e Lima (2004). O corpus deste estudo é constituído por dez narrativas orais gravadas em vídeo e, posteriormente, transcritas. Os informantes são vigilantes noturnos que atuam no centro de Castanhal (PA). Fazemos, a partir de nossa análise, uma classificação das funções que as descrições nominais anafóricas exercem nas diferentes partes da narrativa. Propomos, outrossim, uma classificação dessas formas, levando em consideração seus diferentes graus de argumentatividade.


10.00% 10.00%



Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq)


10.00% 10.00%



Within the scope of Festival of Languages took place in 2009 the Conference Advances in Kartvelian Morphology and Syntax. Selected presentations are presented in this publication. The authors discuss topics such as anaphora in Svan, intonation in Georgien, pragmatics of subordinating clauses in Georgian, but also research on modern developments as SMS-communication in Georgian language area etc. DEUTSCH: Im Rahmen des Festivals der Sprachen fand im Jahre 2009 an der Universität Bremen die Tagung Advances in Kartvelian Morphology and Syntax statt. Ausgewählte Vorträge werden mit dieser Publikation vorgestellt. Die Autoren behandeln unter anderem Themen wie Ana-pher im Svanischen, Intonation im Georgischen, Pragmatik von Nebensätzen des Georgi-schen, aber auch Forschungen über moderne Entwicklungen wie die SMS-Kommunikation im georgischsprachigen Sprachraum usw. CONTENTS: NINO AMIRIDZE, TAMAR RESECK & MANANA TOPADZE GÄUMANN: Preface; KEVIN TUITE: The Kartvelian suffixal intransitive; MANANA KOBAIDZE: Towards the morphological and syntactical classification of Georgian verbs; RENÉ LACROIX: Origin of Sets I–II suffixes in South Caucasian through reanalysis; STAVROS SKOPETEAS & CAROLINE FÉRY: Prosodic cues for exhaustive interpretations: a production study on Georgian intonation; WINFRIED BOEDER: Anaphora in Svan; YASUHIRO KOJIMA : The position of rom and the pragmatics of subordinate clauses in Georgian; NATIA AMAGHLOBELI : Morphological aspects of Georgian SMS language.


10.00% 10.00%



El presente trabajo pretende continuar la línea general inaugurada por Krivochen y Luder (2012) en el desarrollo de herramientas formales para el estudio de las lenguas clásicas, en particular la lengua latina. Tales herramientas, desarrolladas a partir de métodos axiomático-formales, son puestas a prueba con los datos concretos y nos ayudan a comprender mejor no solamente el funcionamiento de una lengua como sistema sino el sustrato cognitivo-biológico que posibilita el desarrollo de tal sistema. Nuestro foco en este trabajo será la noción de "localidad" en las derivaciones sintácticas, que ha sido el centro de gran parte de los estudios de orientación chomskyana durante la última década. Basándonos en problemas concretos que aparecen en la consideración de ejemplos atestiguados en los textos, proporcionaremos una explicación alternativa a la ortodoxia chomskyana. El objetivo es proveer al lector de una serie de herramientas teóricas explícitas que pueda utilizar en el análisis de la localidad en la lengua latina


10.00% 10.00%



El presente trabajo pretende continuar la línea general inaugurada por Krivochen y Luder (2012) en el desarrollo de herramientas formales para el estudio de las lenguas clásicas, en particular la lengua latina. Tales herramientas, desarrolladas a partir de métodos axiomático-formales, son puestas a prueba con los datos concretos y nos ayudan a comprender mejor no solamente el funcionamiento de una lengua como sistema sino el sustrato cognitivo-biológico que posibilita el desarrollo de tal sistema. Nuestro foco en este trabajo será la noción de "localidad" en las derivaciones sintácticas, que ha sido el centro de gran parte de los estudios de orientación chomskyana durante la última década. Basándonos en problemas concretos que aparecen en la consideración de ejemplos atestiguados en los textos, proporcionaremos una explicación alternativa a la ortodoxia chomskyana. El objetivo es proveer al lector de una serie de herramientas teóricas explícitas que pueda utilizar en el análisis de la localidad en la lengua latina


10.00% 10.00%



El presente trabajo pretende continuar la línea general inaugurada por Krivochen y Luder (2012) en el desarrollo de herramientas formales para el estudio de las lenguas clásicas, en particular la lengua latina. Tales herramientas, desarrolladas a partir de métodos axiomático-formales, son puestas a prueba con los datos concretos y nos ayudan a comprender mejor no solamente el funcionamiento de una lengua como sistema sino el sustrato cognitivo-biológico que posibilita el desarrollo de tal sistema. Nuestro foco en este trabajo será la noción de "localidad" en las derivaciones sintácticas, que ha sido el centro de gran parte de los estudios de orientación chomskyana durante la última década. Basándonos en problemas concretos que aparecen en la consideración de ejemplos atestiguados en los textos, proporcionaremos una explicación alternativa a la ortodoxia chomskyana. El objetivo es proveer al lector de una serie de herramientas teóricas explícitas que pueda utilizar en el análisis de la localidad en la lengua latina


10.00% 10.00%



An important part of human intelligence, both historically and operationally, is our ability to communicate. We learn how to communicate, and maintain our communicative skills, in a society of communicators – a highly effective way to reach and maintain proficiency in this complex skill. Principles that might allow artificial agents to learn language this way are in completely known at present – the multi-dimensional nature of socio-communicative skills are beyond every machine learning framework so far proposed. Our work begins to address the challenge of proposing a way for observation-based machine learning of natural language and communication. Our framework can learn complex communicative skills with minimal up-front knowledge. The system learns by incrementally producing predictive models of causal relationships in observed data, guided by goal-inference and reasoning using forward-inverse models. We present results from two experiments where our S1 agent learns human communication by observing two humans interacting in a realtime TV-style interview, using multimodal communicative gesture and situated language to talk about recycling of various materials and objects. S1 can learn multimodal complex language and multimodal communicative acts, a vocabulary of 100 words forming natural sentences with relatively complex sentence structure, including manual deictic reference and anaphora. S1 is seeded only with high-level information about goals of the interviewer and interviewee, and a small ontology; no grammar or other information is provided to S1 a priori. The agent learns the pragmatics, semantics, and syntax of complex utterances spoken and gestures from scratch, by observing the humans compare and contrast the cost and pollution related to recycling aluminum cans, glass bottles, newspaper, plastic, and wood. After 20 hours of observation S1 can perform an unscripted TV interview with a human, in the same style, without making mistakes.


10.00% 10.00%



An important part of human intelligence is the ability to use language. Humans learn how to use language in a society of language users, which is probably the most effective way to learn a language from the ground up. Principles that might allow an artificial agents to learn language this way are not known at present. Here we present a framework which begins to address this challenge. Our auto-catalytic, endogenous, reflective architecture (AERA) supports the creation of agents that can learn natural language by observation. We present results from two experiments where our S1 agent learns human communication by observing two humans interacting in a realtime mock television interview, using gesture and situated language. Results show that S1 can learn multimodal complex language and multimodal communicative acts, using a vocabulary of 100 words with numerous sentence formats, by observing unscripted interaction between the humans, with no grammar being provided to it a priori, and only high-level information about the format of the human interaction in the form of high-level goals of the interviewer and interviewee and a small ontology. The agent learns both the pragmatics, semantics, and syntax of complex sentences spoken by the human subjects on the topic of recycling of objects such as aluminum cans, glass bottles, plastic, and wood, as well as use of manual deictic reference and anaphora.


10.00% 10.00%



La tesis que se presenta tiene como propósito la construcción automática de ontologías a partir de textos, enmarcándose en el área denominada Ontology Learning. Esta disciplina tiene como objetivo automatizar la elaboración de modelos de dominio a partir de fuentes información estructurada o no estructurada, y tuvo su origen con el comienzo del milenio, a raíz del crecimiento exponencial del volumen de información accesible en Internet. Debido a que la mayoría de información se presenta en la web en forma de texto, el aprendizaje automático de ontologías se ha centrado en el análisis de este tipo de fuente, nutriéndose a lo largo de los años de técnicas muy diversas provenientes de áreas como la Recuperación de Información, Extracción de Información, Sumarización y, en general, de áreas relacionadas con el procesamiento del lenguaje natural. La principal contribución de esta tesis consiste en que, a diferencia de la mayoría de las técnicas actuales, el método que se propone no analiza la estructura sintáctica superficial del lenguaje, sino que estudia su nivel semántico profundo. Su objetivo, por tanto, es tratar de deducir el modelo del dominio a partir de la forma con la que se articulan los significados de las oraciones en lenguaje natural. Debido a que el nivel semántico profundo es independiente de la lengua, el método permitirá operar en escenarios multilingües, en los que es necesario combinar información proveniente de textos en diferentes idiomas. Para acceder a este nivel del lenguaje, el método utiliza el modelo de las interlinguas. Estos formalismos, provenientes del área de la traducción automática, permiten representar el significado de las oraciones de forma independiente de la lengua. Se utilizará en concreto UNL (Universal Networking Language), considerado como la única interlingua de propósito general que está normalizada. La aproximación utilizada en esta tesis supone la continuación de trabajos previos realizados tanto por su autor como por el equipo de investigación del que forma parte, en los que se estudió cómo utilizar el modelo de las interlinguas en las áreas de extracción y recuperación de información multilingüe. Básicamente, el procedimiento definido en el método trata de identificar, en la representación UNL de los textos, ciertas regularidades que permiten deducir las piezas de la ontología del dominio. Debido a que UNL es un formalismo basado en redes semánticas, estas regularidades se presentan en forma de grafos, generalizándose en estructuras denominadas patrones lingüísticos. Por otra parte, UNL aún conserva ciertos mecanismos de cohesión del discurso procedentes de los lenguajes naturales, como el fenómeno de la anáfora. Con el fin de aumentar la efectividad en la comprensión de las expresiones, el método provee, como otra contribución relevante, la definición de un algoritmo para la resolución de la anáfora pronominal circunscrita al modelo de la interlingua, limitada al caso de pronombres personales de tercera persona cuando su antecedente es un nombre propio. El método propuesto se sustenta en la definición de un marco formal, que ha debido elaborarse adaptando ciertas definiciones provenientes de la teoría de grafos e incorporando otras nuevas, con el objetivo de ubicar las nociones de expresión UNL, patrón lingüístico y las operaciones de encaje de patrones, que son la base de los procesos del método. Tanto el marco formal como todos los procesos que define el método se han implementado con el fin de realizar la experimentación, aplicándose sobre un artículo de la colección EOLSS “Encyclopedia of Life Support Systems” de la UNESCO. ABSTRACT The purpose of this thesis is the automatic construction of ontologies from texts. This thesis is set within the area of Ontology Learning. This discipline aims to automatize domain models from structured or unstructured information sources, and had its origin with the beginning of the millennium, as a result of the exponential growth in the volume of information accessible on the Internet. Since most information is presented on the web in the form of text, the automatic ontology learning is focused on the analysis of this type of source, nourished over the years by very different techniques from areas such as Information Retrieval, Information Extraction, Summarization and, in general, by areas related to natural language processing. The main contribution of this thesis consists of, in contrast with the majority of current techniques, the fact that the method proposed does not analyze the syntactic surface structure of the language, but explores his deep semantic level. Its objective, therefore, is trying to infer the domain model from the way the meanings of the sentences are articulated in natural language. Since the deep semantic level does not depend on the language, the method will allow to operate in multilingual scenarios, where it is necessary to combine information from texts in different languages. To access to this level of the language, the method uses the interlingua model. These formalisms, coming from the area of machine translation, allow to represent the meaning of the sentences independently of the language. In this particular case, UNL (Universal Networking Language) will be used, which considered to be the only interlingua of general purpose that is standardized. The approach used in this thesis corresponds to the continuation of previous works carried out both by the author of this thesis and by the research group of which he is part, in which it is studied how to use the interlingua model in the areas of multilingual information extraction and retrieval. Basically, the procedure defined in the method tries to identify certain regularities at the UNL representation of texts that allow the deduction of the parts of the ontology of the domain. Since UNL is a formalism based on semantic networks, these regularities are presented in the form of graphs, generalizing in structures called linguistic patterns. On the other hand, UNL still preserves certain mechanisms of discourse cohesion from natural languages, such as the phenomenon of the anaphora. In order to increase the effectiveness in the understanding of expressions, the method provides, as another significant contribution, the definition of an algorithm for the resolution of pronominal anaphora limited to the model of the interlingua, in the case of third person personal pronouns when its antecedent is a proper noun. The proposed method is based on the definition of a formal framework, adapting some definitions from Graph Theory and incorporating new ones, in order to locate the notions of UNL expression and linguistic pattern, as well as the operations of pattern matching, which are the basis of the method processes. Both the formal framework and all the processes that define the method have been implemented in order to carry out the experimentation, applying on an article of the "Encyclopedia of Life Support Systems" of the UNESCO-EOLSS collection.