960 resultados para LFG grammar and parsing
Resumo:
Dissertação apresentada para cumprimento dos requisitos necessários à obtenção do grau de Mestre em Terminologia e Gestão da Informação de Especialidade
Resumo:
These are the proceedings for the eighth national conference on XML, its Associated Technologies and its Applications (XATA'2010). The paper selection resulted in 33% of papers accepted as full papers, and 33% of papers accepted as short papers. While these two types of papers were distinguish during the conference, and they had different talk duration, they all had the same limit of 12 pages. We are happy that the selected papers focus both aspects of the conference: XML technologies, and XML applications. In the first group we can include the articles on parsing and transformation technologies, like “Processing XML: a rewriting system approach", “Visual Programming of XSLT from examples", “A Refactoring Model for XML Documents", “A Performance based Approach for Processing Large XML Files in Multicore Machines", “XML to paper publishing with manual intervention" and “Parsing XML Documents in Java using Annotations". XML-core related papers are also available, focusing XML tools testing on “Test::XML::Generator: Generating XML for Unit Testing" and “XML Archive for Testing: a benchmark for GuessXQ". XML as the base for application development is also present, being discussed on different areas, like “Web Service for Interactive Products and Orders Configuration", “XML Description for Automata Manipulations", “Integration of repositories in Moodle", “XML, Annotations and Database: a Comparative Study of Metadata Definition Strategies for Frameworks", “CardioML: Integrating Personal Cardiac Information for Ubiquous Diagnosis and Analysis", “A Semantic Representation of Users Emotions when Watching Videos" and “Integrating SVG and SMIL in DAISY DTB production to enhance the contents accessibility in the Open Library for Higher Education". The wide spread of subjects makes us believe that for the time being XML is here to stay what enhances the importance of gathering this community to discuss related science and technology. Small conferences are traversing a bad period. Authors look for impact and numbers and only submit their works to big conferences sponsored by the right institutions. However the group of people behind this conference still believes that spaces like this should be preserved and maintained. This 8th gathering marks the beginning of a new cycle. We know who we are, what is our identity and we will keep working to preserve that. We hope the publication containing the works of this year's edition will catch the same attention and interest of the previous editions and above all that this publication helps in some other's work. Finally, we would like to thank all authors for their work and interest in the conference, and to the scientific committee members for their review work.
Resumo:
The purpose of this PhD thesis is to investigate a semantic relation present in the connection of sentences (more specifically: propositional units). This relation, which we refer to as contrast, includes the traditional categories of adversatives - predominantly represented by the connector but in English and pero in Modern Spanish - and concessives, prototypically verbalised through although / aunque. The aim is to describe, analyse and - as far as possible - to explain the emergence and evolution of different syntactic schemes marking contrast during the first three centuries of Spanish (also referred to as Castilian) as a literary language, i.e., from the 13th to the 15th century. The starting point of this question is a commonplace in syntax, whereby the semantic and syntactic complexity of clause linkage correlates with the degree of textual elaboration. In historical linguistics, i.e., applied to the phylogeny of a language, it is commonly referred to as the parataxis hypothesis A crucial part of the thesis is dedicated by the definition of contrast as a semantic relation. Although the label contrast has been used in this sense, mainly in functional grammar and text linguistics, mainstream grammaticography and linguistics remain attached to the traditional categories adversatives and concessives. In opposition to this traditional view, we present our own model of contrast, based on a pragma-semantic description proposed for the analysis of adversatives by Oswald Ducrot and subsequently adopted by Ekkehard König for the analysis of concessives. We refine and further develop this model in order for it to accommodate all, not just the prototypical instances of contrast in Spanish, arguing that the relationship between adversatives and concessives is a marked opposition, i.e., that the higher degree of semantic and syntactic integration of concessives restricts some possible readings that the adversatives may have, but that this difference is almost systematically neutralised by contextual factors, thus justifying the assumption of contrast as a comprehensive onomasiological category. This theoretical focus is completed by a state-of-the-question overview attempting to account for all relevant forms in which contrast is expressed in Medieval Spanish, with the aid of lexicographic and grammaticographical sources, and an empirical study investigating the expression of corpus in a corpus study on the textual functions of contrast in nine Medieval Spanish texts: Cantar de Mio Cid, Libro de Alexandre, Milagros de Nuestra Sehora, Estoria de Espana, Primera Partida, Lapidario, Libro de buen amor, Conde Lucanor, and Corbacho. This corpus is analysed using quantitative and qualitative tools, and the study is accompanied by a series of methodological remarks on how to investigate a pragma-semantic category in historical linguistics. The corpus study shows that the parataxis hypothesis fails to prove from a statistical viewpoint, although a qualitative analysis shows that the use of subordination does increase over time in some particular contexts.
Resumo:
En väsentlig fråga inom såväl lingvistiska som kognitiva teorier är, hur språket beskriver kausala relationer. I finskan finns det en speciell typ av kausativa verb avledda med suffixet (U)ttA som används för att uttrycka att handlingen i fråga utförs av någon annan än subjektreferenten, t.ex. Maija haetuttaa Matilla kirjastosta kirjan ’Maija låter Matti hämta boken från biblioteket’ och Matti juoksuttaa Maijan kaupunkiin ’Matti låter Maija springa till staden’. Syftet med denna avhandling var att med exempel av sociala dominansens kausativer undersöka ordbildningens natur samt begreppet ’socialt förorsakande’. För att beskriva avledningars regelbundna argumentstruktur i form av kopplingen mellan syntaxen och semantiken upprättades deras prototypiska strukturer. Dessa verb har emellertid också specifika användningsområden som framhäver variationer i sociala relationer. Säregna egenskaper hos den sociala dominansens kausativer inkluderades i undersökningen och definierades som konstruktioner. Konstruktionerna omfattar speciella syntaktiska och/eller semantiska element och utöver det också pragmatiska värderande implikationer. Uppbyggnaden av den sociala dimensionen hos de undersökta verben består av egenskaper förbundna med typen av förorsakande, argumentens agentiva egenskaper (aktivitet eller passivitet, dominans, kontroll, viljestyrdhet och ansvarighet) samt konventionaliserade attityder och tolkningar. Ett exempel på en s.k. 'tolkningskonstruktion’ är den negativa dominansens uttryck som i avhandlingen kallas Maktmissbrukskonstruktionen. Denna konstruktion inkluderar talarens starkt kritiska hållning till den uttryckta situationen, t.ex. Asiakas juoksuttaa lentoemäntää ’Kunden låter flygvärdinnan springa’. Dessa konstruktioner fyller en viktig funktion i språklig kommunikation: att beskriva avvikande av sociala normer och att foga expressivitet till budskapet. Metodologiskt kombinerar denna avhandling teorier som baseras på det aktuella språkbruket och teoretisk lingvistisk analys. Verbens samt konstruktionernas konceptuella lexikala struktur och prototypstrukturerna analyserades med hjälp av den konceptuella semantikens verktyg, som har utvecklats av Jackendoff, Nikanne och Pörn.
Resumo:
Transcript (original grammar and spelling retained): My dear wife I take this time to inform you that I am well hoping that these few lines will Reach you and find you the same I shall in form you of all our Battles that we have had sence I left home we crossed in to Canada the 2 day of July and took fort Erie on the 3 day of July without loss of one man. We then marched down to Chipway eighteen miles below the Fort Erie we got there on the forth day and had our first battle on the 5 day our loss was not jistly known But the inemy loss was double to ours. The 6 day we started with the 2 Brigade to make a bridge a crost the crick two miles a bove the fort in Building the Bridge the inemy Brought up their Canon and playd upon us with their artiliery a bout two hours We drove them from the fort our loss was none the inemy loss was nineteen ciled dead on the ground we then marched to Queenston when we got thare our inemy had fledfrom the fort we then remained thair to Queenston ten days then we marched down to Fort George But that caurdly Chaney did not a rive with the fleet so we had to return back to Queenston thare was a bout six hundred militia formed on the heights of land thay fired up on us from their pickets and retreated to the mane body our flankers ciled and wounded and took about twenty before they got to the Maine body we then marched up the hill they gave us two firs but did not damage and then retreated from the field we stayed there one knight and then marched to Chipway and stayed there one night and the next day just as the sun set the first Brigade marched up in order to give them Battle a bout two miles from the Crick and began the Battle the 2 Brigade has to March up to the Niagara path and ingaged them we charged up on their artlery and took all their Canon Miller commanded the four companys that charged....the battles lasted three hours and forty minutes our loss was about 8 hundred cild and wounded our inemies loss was a bout fourteen hundred cild and wounded the next morning we Marched up in order to give them Battle a gin but thay was afraid to ingage us we then marched to Fort Erie and went to fortiffing and made a strong place the inemy folered us up and Began to cananade and held it fifty three days thay a tacked the fort the fifteenth of august thay atacked a bout one hiour be fore day Light we saw them and Blue up our maggerzean & two hundred of our inemy our loss wasa bout forty cild and wounded and our inemy loss was a bout one thousand on the 7 Day of September we atacked them and took their batteries and Broke all their canon and drove them from the field our loss was a Bout two hundred cild and wounded our inemy loss was a Bout 8 hundred cild and wounded...we crossed in to Canada with five thousand and came out with fifteen hundred we then Marched to Sackett’s harbor....am well and harty for the present....a bout comming home it uncarting for there is not any....given this winter as yet But I shall try to Come home if I Can But if I Cant I want you should take good car of the Phiddness[?] I have not Received any Money as yet But soon as I do receive some I send some home. I want you should write to me as soon as you receive this and and how Much Stock you wintor I Received your Letter with Great pleasure I feel uneasy a bout you I am a frade that you are sick or dead this is from your husband Chase Clough
Resumo:
The present study investigates the effects of child internal (age/time) and child external/environmental factors on the development of a wide range of language domains in successive bilingual (L2) Turkish-English children of homogeneously low SES. Forty-three L2 children were tested on standardized assessments examining the acquisition of vocabulary and morpho-syntax. The L2 children exhibited a differential acquisition of the various domains: they were better on the general comprehension of grammar and tense morphology and less accurate on the acquisition of vocabulary and (complex) morpho-syntax. Profile effects were confirmed by the differential effects of internal and external factors on the language domains. The development of vocabulary and complex syntax were affected by internal and external factors, whereas external factors had no contribution to the development of tense morphology. These results are discussed in light of previous studies on the impact of internal and external factors in child L2 acquisition.
Resumo:
Much recent research in SLA is guided by the hypothesis of L2 interface vulnerability (see Sorace 2005). This study contributes to this general project by examining the acquisition of two classes of subjunctive complement clauses in L2 Spanish: subjunctive complements of volitional predicates (purely syntactic) and subjunctive vs. indicative complements with negated epistemic matrix predicates, where the mood distinction is discourse dependent (thus involving the syntax-discourse interface). We provide an analysis of the volitional subjunctive in English and Spanish, suggesting that English learners of L2 Spanish need to access the functional projection Mood P and an uninterpretable modal feature on the Force head available to them from their formal English register grammar, and simultaneously must unacquire the structure of English for-to clauses. For negated epistemic predicates, our analysis maintains that they need to revalue the modal feature on the Force head from uninterpretable to interpretable, within the L2 grammar.With others (e.g. Borgonovo & Prévost 2003; Borgonovo, Bruhn de Garavito & Prévost 2005) and in line with Sorace's (2000, 2003, 2005) notion of interface vulnerability, we maintain that the latter case is more difficult for L2 learners, which is borne out in the data we present. However, the data also show that the indicative/subjunctive distinction with negated epistemics can be acquired by advanced stages of acquisition, questioning the notion of obligatory residual optionality for all properties which require the integration of syntactic and discourse information.
Resumo:
Este trabalho pretende mostrar que a Teoria da Otimidade proporciona novas formas para explicar mudanças de som que não a re-ordenação no ranqueamento deconstraints. Ele examina os aspectos diacrônicos de harmonia nasal na família Mundurukú, tronco Tupi. A comparação entre os sistemas modernos de Mundurukú e Kuruaya salienta que o sistema original, Proto-Mundurukú, tem propriedades semelhantes às atualmente observadas em Kuruaya. Em especial, os alvos do espalhamento de nasalidadeincluiamoclusivas sonoras e soantes, enquanto que as obstruintes surdas eram transparentes. Esse sistema evoluiu para outro em Pré-Munduruku, quando novos contrastes foram introduzidos na língua, transformando obstruintes em segmentos opacos e, portanto, bloqueando a nasalização. A análise, formalizada dentro da Teoria da Otimidade, demonstra que não houve uma re-ordenação dos constraints harmônicos; eles apenas se tornaram mais restritos, como mostra a cronologia relativa que deu origem ao sistema moderno de Mundurukú. Além disso, o estudo discute também as consequências dessa mudança para a gramática sincrônica, e como isso explica as irregularidades do processo.
Resumo:
Abstract Background The objective of this study was to investigate the effects of the Brazilian Scaptotrigona sp propolis, a widely used folk medicine, in corneal wound healing and inflammation. Methods Corneal epithelial defects of 1 mm in diameter were made in the right eyes of Wistar male adult rats by cauterization with silver nitrate sticks. Subsequently, they were divided in two groups (n = 40 rats/group): Brazilian propolis (BP) group was topically treated with a microemulsion containing 1% Brazilian propolis; vehicle (VH) group received the same formulation without propolis. The epithelial defect area was photographed and measured at t = 0 (wound induction), and after 12, 24, 48 and 120 h of treatment. The inflammatory response was evaluated based on counting of neutrophils. Epithelial regeneration rates were determined based on Ki-67 expression in basal epithelial cells. Comparisons were made using the Kruskal-Wallis and the Mann–Whitney U test. Results The BP group presented both smaller epithelial defect areas at 12, 24 and 48 h and fewer corneal infiltrating neutrophils at 24 and 48 h (P < 0.01) than the VH group. These effects were associated with more pervasive Ki-67 staining in the BP group at 12 and 24 h (P < 0.05). Conclusions Topically applied BP accelerated wound healing and reduced the inflammatory response to silver nitrate-induced corneal alkali burns in rats.
Resumo:
[EN]The use of large corpora in the study of languages is a well established tradition. In the same vein, scholarship is also well represented in the case of the study of corpora for making grammars of languages. This is the case of the COBUILD grammar and dictionary and the case of the Longman Grammar of Spoken and Written English. This means that corpora have been analyzed in order to identify patterns in languages that can be later practised by learners following those patterns described and exemplified with real instances.
Resumo:
From the moment of their birth, a person's life is determined by their sex. Ms. Goroshko wants to know why this difference is so striking, why society is so concerned to sustain it, and how it is able to persist even when certain national or behavioural stereotypes are erased between people. She is convinced of the existence of not only social, but biological differences between men and women, and set herself the task, in a manuscript totalling 126 pages, written in Ukrainian and including extensive illustrations, of analysing these distinctions as they are manifested in language. She points out that, even before 1900, certain stylistic differences between the ways that men and women speak had been noted. Since then it has become possible, for instance in the case of Japanese, to point to examples of male and female sub-languages. In general, one can single out the following characteristics. Males tend to write with less fluency, to refer to events in a verb-phrase, to be time-oriented, to involve themselves more in their references to events, to locate events in their personal sphere of activity, and to refer less to others. Therefore, concludes Ms Goroshko, the male is shown to be more active, more ego-involved in what he does, and less concerned about others. Women, in contrast, were more fluent, referred to events in a noun-phrase, were less time-oriented, tended to be less involved in their event-references, locate events within their interactive community and refer more to others. They spent much more time discussing personal and domestic subjects, relationship problems, family, health and reproductive matters, weight, food and clothing, men, and other women. As regards discourse strategies, Ms Goroshko notes the following. Men more often begin a conversation, they make more utterances, these utterances are longer, they make more assertions, speak less carefully, generally determine the topic of conversation, speak more impersonally, use more vulgar expressions, and use fewer diminutives and more imperatives. Women's speech strategies, apart from being the opposite of those enumerated above, also contain more euphemisms, polite forms, apologies, laughter and crying. All of the above leads Ms. Goroshko to conclude that the differences between male and female speech forms are more striking than the similarities. Furthermore she is convinced that the biological divergence between the sexes is what generates the verbal divergence, and that social factors can only intensify or diminish the differentiation in verbal behaviour established by the sex of a person. Bearing all this in mind, Ms Goroshko set out to construct a grammar of male and female styles of speaking within Russian. One of her most important research tools was a certain type of free association test. She took a list comprising twelve stimuli (to love, to have, to speak, to fuck, a man, a woman, a child, the sky, a prayer, green, beautiful) and gave it to a group of participants specially selected, according to a preliminary psychological testing, for the high levels of masculinity or femininity they displayed. Preliminary responses revealed that the female reactions were more diverse than the male ones, there were more sentences and word combinations in the female reactions, men gave more negative responses to the stimulus and sometimes didn't want to react at all, women reacted more to adjectives and men to nouns, and that, surprisingly, women coloured more negatively their reactions to the words man, to love and a child (Ms. Goroshko is inclined to attribute this to the present economic situation in Russia). Another test performed by Ms. Goroshko was the so-called "defective text" developed by A.A. Brudny. All participants were distributed with packets of complete sentences, which had been taken from a text and then mixed at random. The task was to reconstruct the original text. There were three types of test, the first descriptive, the second narrative, and the third logical. Ms. Goroshko created computer programmes to analyse the results. She found that none of the reconstructed texts was coincident with the original, differing both from the original text and amongst themselves and that there were many more disparities in the male than the female texts. In the descriptive and logical texts the differences manifested themselves more clearly in the male texts, and in the narrative texts in the female texts. The widest dispersal of values was observed at the outset, while the female text ending was practically coincident with the original (in contrast to the male ending). The greatest differences in text reconstruction for both males and females were registered in the middle of the texts. Women, Ms. Goroshko claims, were more sensitive to the semantic structure of the texts, since they assembled the narrative text much more accurately than the other two, while the men assembled more accurately the logical text. Texts written by women were assembled more accurately by women and texts by men by men. On the basis of computer analysis, Ms. Goroshko found that female speech was substantially more emotional. It was expressed by various means, hyperbole, metaphor, comparisons, epithets, ways of enumeration, and with the aid of interjections, rhetorical questions, exclamations. The level of literacy was higher for female speech, and there were fewer mistakes in grammar and spelling in female texts. The last stage of Ms Goroshko's research concerned the social stereotypes of beliefs about men and women in Russian society today. A large number of respondents were asked questions such as "What merits must a woman possess?", "What are male vices and virtues?", etc. After statistical manipulation, an image of modern man and woman, as it exists in the minds of modern Russian men and women, emerged. Ms. Goroshko believes that her findings are significant not only within the field of linguistics. She has already successfully worked on anonymous texts and been able to decide on the sex of the author and consequently believes that in the future her research may even be of benefit to forensic science.
Resumo:
Mr. Kubon's project was inspired by the growing need for an automatic, syntactic analyser (parser) of Czech, which could be used in the syntactic processing of large amounts of texts. Mr. Kubon notes that such a tool would be very useful, especially in the field of corpus linguistics, where creating a large-scale "tree bank" (a collection of syntactic representations of natural language sentences) is a very important step towards the investigation of the properties of a given language. The work involved in syntactically parsing a whole corpus in order to get a representative set of syntactic structures would be almost inconceivable without the help of some kind of robust (semi)automatic parser. The need for the automatic natural language parser to be robust increases with the size of the linguistic data in the corpus or in any other kind of text which is going to be parsed. Practical experience shows that apart from syntactically correct sentences, there are many sentences which contain a "real" grammatical error. These sentences may be corrected in small-scale texts, but not generally in the whole corpus. In order to be able to complete the overall project, it was necessary to address a number of smaller problems. These were; 1. the adaptation of a suitable formalism able to describe the formal grammar of the system; 2. the definition of the structure of the system's dictionary containing all relevant lexico-syntactic information, and the development of a formal grammar able to robustly parse Czech sentences from the test suite; 3. filling the syntactic dictionary with sample data allowing the system to be tested and debugged during its development (about 1000 words); 4. the development of a set of sample sentences containing a reasonable amount of grammatical and ungrammatical phenomena covering some of the most typical syntactic constructions being used in Czech. Number 3, building a formal grammar, was the main task of the project. The grammar is of course far from complete (Mr. Kubon notes that it is debatable whether any formal grammar describing a natural language may ever be complete), but it covers the most frequent syntactic phenomena, allowing for the representation of a syntactic structure of simple clauses and also the structure of certain types of complex sentences. The stress was not so much on building a wide coverage grammar, but on the description and demonstration of a method. This method uses a similar approach as that of grammar-based grammar checking. The problem of reconstructing the "correct" form of the syntactic representation of a sentence is closely related to the problem of localisation and identification of syntactic errors. Without a precise knowledge of the nature and location of syntactic errors it is not possible to build a reliable estimation of a "correct" syntactic tree. The incremental way of building the grammar used in this project is also an important methodological issue. Experience from previous projects showed that building a grammar by creating a huge block of metarules is more complicated than the incremental method, which begins with the metarules covering most common syntactic phenomena first, and adds less important ones later, especially from the point of view of testing and debugging the grammar. The sample of the syntactic dictionary containing lexico-syntactical information (task 4) now has slightly more than 1000 lexical items representing all classes of words. During the creation of the dictionary it turned out that the task of assigning complete and correct lexico-syntactic information to verbs is a very complicated and time-consuming process which would itself be worth a separate project. The final task undertaken in this project was the development of a method allowing effective testing and debugging of the grammar during the process of its development. The problem of the consistency of new and modified rules of the formal grammar with the rules already existing is one of the crucial problems of every project aiming at the development of a large-scale formal grammar of a natural language. This method allows for the detection of any discrepancy or inconsistency of the grammar with respect to a test-bed of sentences containing all syntactic phenomena covered by the grammar. This is not only the first robust parser of Czech, but also one of the first robust parsers of a Slavic language. Since Slavic languages display a wide range of common features, it is reasonable to claim that this system may serve as a pattern for similar systems in other languages. To transfer the system into any other language it is only necessary to revise the grammar and to change the data contained in the dictionary (but not necessarily the structure of primary lexico-syntactic information). The formalism and methods used in this project can be used in other Slavic languages without substantial changes.
Resumo:
This introductory chapter briefly introduces a few milestones in the voluminous previous literature on semantic roles, and charts the territory in which the papers of this volume aim to make a contribution. This territory is characterized by fairly disparate conceptualizations of semantic roles and their status in theories of grammar and the lexicon, as well as by diverse and probably complementary ways of deriving or identifying them based on linguistic data. Particular attention is given to the question of how selected roles appear to relate to each other, and we preliminarily address the issue of how roles, subroles, and role complexes are best thought of in general.