107 resultados para Pronominal anaphora


Relevância:

10.00% 10.00%

Publicador:

Resumo:

Comunicación presentada en Cross-Language Evaluation Forum (CLEF 2008), Aarhus, Denmark, September 17-19, 2008.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

In this paper we address two issues. The first one analyzes whether the performance of a text summarization method depends on the topic of a document. The second one is concerned with how certain linguistic properties of a text may affect the performance of a number of automatic text summarization methods. For this we consider semantic analysis methods, such as textual entailment and anaphora resolution, and we study how they are related to proper noun, pronoun and noun ratios calculated over original documents that are grouped into related topics. Given the obtained results, we can conclude that although our first hypothesis is not supported, since it has been found no evident relationship between the topic of a document and the performance of the methods employed, adapting summarization systems to the linguistic properties of input documents benefits the process of summarization.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

This paper reports on the further results of the ongoing research analyzing the impact of a range of commonly used statistical and semantic features in the context of extractive text summarization. The features experimented with include word frequency, inverse sentence and term frequencies, stopwords filtering, word senses, resolved anaphora and textual entailment. The obtained results demonstrate the relative importance of each feature and the limitations of the tools available. It has been shown that the inverse sentence frequency combined with the term frequency yields almost the same results as the latter combined with stopwords filtering that in its turn proved to be a highly competitive baseline. To improve the suboptimal results of anaphora resolution, the system was extended with the second anaphora resolution module. The present paper also describes the first attempts of the internal document data representation.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Decision support systems (DSS) support business or organizational decision-making activities, which require the access to information that is internally stored in databases or data warehouses, and externally in the Web accessed by Information Retrieval (IR) or Question Answering (QA) systems. Graphical interfaces to query these sources of information ease to constrain dynamically query formulation based on user selections, but they present a lack of flexibility in query formulation, since the expressivity power is reduced to the user interface design. Natural language interfaces (NLI) are expected as the optimal solution. However, especially for non-expert users, a real natural communication is the most difficult to realize effectively. In this paper, we propose an NLI that improves the interaction between the user and the DSS by means of referencing previous questions or their answers (i.e. anaphora such as the pronoun reference in “What traits are affected by them?”), or by eliding parts of the question (i.e. ellipsis such as “And to glume colour?” after the question “Tell me the QTLs related to awn colour in wheat”). Moreover, in order to overcome one of the main problems of NLIs about the difficulty to adapt an NLI to a new domain, our proposal is based on ontologies that are obtained semi-automatically from a framework that allows the integration of internal and external, structured and unstructured information. Therefore, our proposal can interface with databases, data warehouses, QA and IR systems. Because of the high NL ambiguity of the resolution process, our proposal is presented as an authoring tool that helps the user to query efficiently in natural language. Finally, our proposal is tested on a DSS case scenario about Biotechnology and Agriculture, whose knowledge base is the CEREALAB database as internal structured data, and the Web (e.g. PubMed) as external unstructured information.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Bd. 24. De elegantia Caesaris sive de commentariorum de b. G. et de b. c. differentiis animadversiones / O. Dernoscheck, 1903--Der Zug des Cimbern und Teutonen / A. Helbling, 1898--Unsere Armeesprache im Dienste der Caesar-Übersetzung / M. Hodermann, 1903--Cäsar, der Eroberer Galliens / R. Lange, 1896--Divico oder die von Caesar den Ost-Galliern und Süd-Germanen gegenüber Vertretene Politik, Lfg. I-III / H. von Müllinen, 1898-1901--Die Unterwerfung Galliens durch Cäsar verglichen mit der Bezwingung Frankreichs durch die deutsche Armee im Feldzuge 1870/71 / A. von Oertzen, 1904.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

We extend Cuervo's (2003) analysis of the Lower Applicative Dative DP in Spanish to account for the animate definite DP preceded by a and the fact that it is not possible to have both an animate dative definite direct object and a dative indirect object in the same clause. We argue that the presence of such a dative DP 'blocks' the upward movement of the direct object DP to the specifier of the Lower Applicative phrase. We analyse the case ‘mismatch’ between the third person accusative clitic and the co-referring dative DP with animate definite reference in River Plate Spanish as resulting from the raising of the accusative clitic to the head of the Applicative phrase and the movement of the DP to its specifier, where dative case is always assigned in Spanish. We propose that similar phenomena observed in some Australian languages are amenable to a similar analysis.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

This thesis sets out to investigate the role of cohesion in the organisation and processing of three text types in English and Arabic. In other words, it attempts to shed some light on the descriptive and explanatory power of cohesion in different text typologies. To this effect, three text types, namely, literary fictional narrative, newspaper editorial and science were analysed to ascertain the intra- and inter-sentential trends in textual cohesion characteristic of each text type in each language. In addition, two small scale experiments which aimed at exploring the facilitatory effect of one cohesive device (i.e. lexical repetition) on the comprehension of three English text types by Arab learners were carried out. The first experiment examined this effect in an English science text; the second covered three English text types, i.e. fictional narrative, culturally-oriented and science. Some interesting and significant results have emerged from the textual analysis and the pilot studies. Most importantly, each text type tends to utilize the cohesive trends that are compatible with its readership, reader knowledge, reading style and pedagogical purpose. Whereas fictional narratives largely cohere through pronominal co-reference, editorials and science texts derive much cohesion from lexical repetition. As for cross-language differences English opts for economy in the use of cohesive devices, while Arabic largely coheres through the redundant effect created by the high frequency of most of those devices. Thus, cohesion is proved to be a variable rather than a homogeneous phenomenon which is dictated by text type among other factors. The results of the experiments suggest that lexical repetition does facilitate the comprehension of English texts by Arab learners. Fictional narratives are found to be easier to process and understand than expository texts. Consequently, cohesion can assist in the processing of text as it can in its creation.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

I conducted this study to provide insights toward deepening understanding of association between culture and writing by building, assessing, and refining a conceptual model of second language writing. To do this, I examined culture and coherence as well as the relationship between them through a mixed methods research design. Coherence has been an important and complex concept in ESL/EFL writing. I intended to study the concept of coherence in the research context of contrastive rhetoric, comparing the coherence quality in argumentative essays written by undergraduates in Mainland China and their U.S. peers. In order to analyze the complex concept of coherence, I synthesized five linguistic theories of coherence: Halliday and Hasan's cohesion theory, Carroll's theory of coherence, Enkvist's theory of coherence, Topical Structure Analysis, and Toulmin's Model. Based upon the synthesis, 16 variables were generated. Across these 16 variables, Hotelling t-test statistical analysis was conducted to predict differences in argumentative coherence between essays written by two groups of participants. In order to complement the statistical analysis, I conducted 30 interviews of the writers in the studies. Participants' responses were analyzed with open and axial coding. By analyzing the empirical data, I refined the conceptual model by adding more categories and establishing associations among them. The study found that U.S. students made use of more pronominal reference. Chinese students adopted more lexical devices of reiteration and extended paralleling progression. The interview data implied that the difference may be associated with the difference in linguistic features and rhetorical conventions in Chinese and English. As far as Toulmin's Model is concerned, Chinese students scored higher on data than their U.S. peers. According to the interview data, this may be due to the fact that Toulmin's Model, modified as three elements of arguments, have been widely and long taught in Chinese writing instruction while U.S. interview participants said that they were not taught to write essays according to Toulmin's Model. Implications were generated from the process of textual data analysis and the formulation of structural model defining coherence. These implications were aimed at informing writing instruction, assessment, peer-review, and self-revision.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Based on the theoretical and methodological presuppositions of the theory of language variation and change (cf. WEINREICH; LABOV; HERZOG, 2006 [1968]), it is described and analyzed in this article the process of variation/change concerning the second person possessive pronouns in letters from readers of Brazilian newspapers from the XIX and XX centuries. These letters feature a portrait of the Brazilian press from the South (Santa Catarina), Southeast (Rio de Janeiro) and Northeast (Bahia and Rio Grande do Norte) regions in each century and are part of the Project for Brazilian Portuguese History‘s (PHPB) printed common minimal corpus. The point of departure of this work is the idea that the use of variant forms of expressing second person possessive pronouns – teu and seu – results from the interaction characterizing the varied social roles performed by the letters‘ senders. Arranging communicative units, which gather elements/features denoting time and space, conditioned and determined by socio-historical and cultural aspects, the readers‘ letters, turn out to be a promising research field under the light of this paper. More specifically, In the row of presented results in studies about the pronominal system in the diachroneity of/in Brazilian Portuguese (PB) (FARACO, 2002; LORENGIAN-PENKAL, 2007; CALLOU; LOPES, 2003; LOPES; DUARTE, 2003; MENON, 2005; ARDUIN; COELHO, 2006; LOPES, 2009; MARCOTULIO, 2010), the results featured in here point at different usages of the possessives, noticing the coexistence of the forms teu/tua and seu/sua strongly conditioned by the socio-discursive nature of the readers‘ letters in the course of the centuries and through different regions.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Based on the theoretical and methodological presuppositions of the theory of language variation and change (cf. WEINREICH; LABOV; HERZOG, 2006 [1968]), it is described and analyzed in this article the process of variation/change concerning the second person possessive pronouns in letters from readers of Brazilian newspapers from the XIX and XX centuries. These letters feature a portrait of the Brazilian press from the South (Santa Catarina), Southeast (Rio de Janeiro) and Northeast (Bahia and Rio Grande do Norte) regions in each century and are part of the Project for Brazilian Portuguese History‘s (PHPB) printed common minimal corpus. The point of departure of this work is the idea that the use of variant forms of expressing second person possessive pronouns – teu and seu – results from the interaction characterizing the varied social roles performed by the letters‘ senders. Arranging communicative units, which gather elements/features denoting time and space, conditioned and determined by socio-historical and cultural aspects, the readers‘ letters, turn out to be a promising research field under the light of this paper. More specifically, In the row of presented results in studies about the pronominal system in the diachroneity of/in Brazilian Portuguese (PB) (FARACO, 2002; LORENGIAN-PENKAL, 2007; CALLOU; LOPES, 2003; LOPES; DUARTE, 2003; MENON, 2005; ARDUIN; COELHO, 2006; LOPES, 2009; MARCOTULIO, 2010), the results featured in here point at different usages of the possessives, noticing the coexistence of the forms teu/tua and seu/sua strongly conditioned by the socio-discursive nature of the readers‘ letters in the course of the centuries and through different regions.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

This paper is a study about the way in which se structures are represented in 20 verb entries of nine dictionaries of Spanish language. There is a large number of these structures and they are problematic for native and non native speakers. Verbs of the analysis are middle-high frequency and, in the most part of the cases, very polysemous, and this allows to observe interconnections between the different se structures and the different meanings of each verb. Data of the lexicographic analysis are cross-checked with corpus analysis of the same units. As a result, it is observed that there is a large variety in the data which are offered in each dictionary and in the way they are offered, inter and intradictionary. The reasons range from the theoretical overall of each Project to practical performance. This leads to the conclusion that it is necessary to further progress in the dictionary model it is being handled, in order to offer lexico-grammatical phenomenon such as se verbs in an accurate, clear and exhaustive way.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Esta investigación analiza el uso del sufijo diminutivo en un corpus oral de jóvenes de la República Dominicana. El material procede de la transcripción de veinte entrevistas orales realizadas en los años noventa en Santo Domingo. En este estudio se realiza un análisis de las ocurrencias documentadas, su morfología, sus preferencias en cuanto a la selección de las clases de palabras que se toman como base para la formación de diminutivos, sus posibles valores semánticos y comunicativos, y, por último, se determina la frecuencia de uso del diminutivo en función del sexo de los hablantes.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

En el marco de la escala de accesibilidad (Givenness Hierarchy), este trabajo presenta el mecanismo en chino que lleva a cabo la misma función anafórica que desempeña el artículo definido en español y analiza desde una perspectiva contrastiva las aportaciones que contribuye la anáfora nominal a la construcción del discurso. Se llega a la conclusión de que a pesar de algunas diferencias en los comportamientos concretos, en ambas lenguas la anáfora favorece a la organización del discurso manteniendo la coherencia discursiva y diversificando las expresiones.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

This PhD thesis examines a phenomenon known as Monosyllabic Circumflexion (MC, hereafter) from a historical linguistics / phonological point of view. MC denotes a Lithuanian or Balto-Slavic phenomenon according to which long vowels and diphthongs in monosyllabic words exhibit a circumflex tone instead of the expected acute tone.  It is observed in the following four categories: I. 3rd person future forms of monosyllabic stems (e.g., šõks ― šókti `to jump;' vy͂s ― výti `to drive') II. reflexes of PIE root nouns (e.g., Latv. gùovs `cow;' Lith. šuõ `dog') III. prepositions/adverbs (e.g., nuõ `from' ~  nùotaka `bride;' vė͂l `again' ~ Latv. vêl `still, yet,' tė͂ (permissive particle) < *teh1) IV. pronominal forms (e.g., tuõ ~ gerúoju `the good (m.~sg.~instr.),' tie͂ ~ tíeji `id. (pl.nom)'). The unexpected circumflex tone in these categories is problematic and important for the solution of a Balto-Slavic accentological question on the etymological background of acute and non-acute tones. The aim of this thesis is to partially contribute to the solution of this problem by establishing the existence of MC and its relative chronology. The first category, the 3rd person future forms, provides a substantial number of examples and counterexamples. The examination of them has revealed the fact that the counterexamples constitute a morpho-semantic group of verbs whose future stems underwent considerable morphological changes in the prehistory, hence not exhibiting MC. This shows that the regular tonal reflex of the 3rd person future forms of monosyllabic acute stem must be circumflex, allowing for the establishment of MC as a regular phonological process, although this category does not provide much information on the relative chronology of MC. The second category, the reflexes of Proto-Indo-European root nouns, gives an important clue as to where MC is located in the relative chronology of Balto-Slavic sound changes. Next, there is a discussion of whether the results of the examinations of the first two categories can be maintained for the data of the third and fourth categories, which show an irregular distribution of the acute and circumflex tones in monosyllabic forms. It is shown that various morphological factors, such as homonymic clashes within the paradigms for pronouns, can explain why some monosyllabic forms have acute tone. Also, the linguistic feature of West Aukštaitian dialects of Lithuanian that tend to preserve the results of MC is revealed. These dialects are known to have played an important role in the formation of standard Lithuanian. In this way, the monosyllabic forms with unexpected circumflex tone in Lithuanian are explained as a combination of MC in the Proto-Balto-Slavic time and the dialectal tendency of West Aukštaitian dialects of Lithuanian.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Thesis (Master's)--University of Washington, 2016-09