943 resultados para Textual complexity for Romanian language
Resumo:
We define a multi-modal version of Computation Tree Logic (ctl) by extending the language with path quantifiers E and A where d denotes one of finitely many dimensions, interpreted over Kripke structures with one total relation for each dimension. As expected, the logic is axiomatised by taking a copy of a ctl axiomatisation for each dimension. Completeness is proved by employing the completeness result for ctl to obtain a model along each dimension in turn. We also show that the logic is decidable and that its satisfiability problem is no harder than the corresponding problem for ctl. We then demonstrate how Normative Systems can be conceived as a natural interpretation of such a multi-dimensional ctl logic. © 2009 Springer Science+Business Media B.V.
Resumo:
Decision making is an important element throughout the life-cycle of large-scale projects. Decisions are critical as they have a direct impact upon the success/outcome of a project and are affected by many factors including the certainty and precision of information. In this paper we present an evidential reasoning framework which applies Dempster-Shafer Theory and its variant Dezert-Smarandache Theory to aid decision makers in making decisions where the knowledge available may be imprecise, conflicting and uncertain. This conceptual framework is novel as natural language based information extraction techniques are utilized in the extraction and estimation of beliefs from diverse textual information sources, rather than assuming these estimations as already given. Furthermore we describe an algorithm to define a set of maximal consistent subsets before fusion occurs in the reasoning framework. This is important as inconsistencies between subsets may produce results which are incorrect/adverse in the decision making process. The proposed framework can be applied to problems involving material selection and a Use Case based in the Engineering domain is presented to illustrate the approach. © 2013 Elsevier B.V. All rights reserved.
Resumo:
This chapter examines the ramifications of continental travel and associated epistolary communication for English poets of the period. It argues that recourse to neo-Latin, the universal language of diplomacy, served not only to establish a sense of shared space—linguistic, cultural, generic—between England and the continent, but also to signal self-conscious differences (climatic, geographical, historical, political) between England and her continental peers. Through an investigation of a range of ‘performances’ on stages that were ‘academic’, poetic, autobiographical, and epistolographic, it assesses the central role of neo-Latin as a language that underwent a series of textual itineraries. These ‘itineraries’ manifest themselves in a number of ways. Neo-Latin as a shared linguistic medium can facilitate, and quite uniquely so, intertextual engagement with the classics, but now ancient Rome, its language, its mythology, its hierarchy of genres, are viewed through a seventeenth-century lens and appropriated by poets in both England and Italy to describe contemporary events, whether personal, or political. Close examination of the neo-Latin poetry of Milton and Marvell reveals, it is argued, a self-fashioning coloured by such textual itineraries and interchanges. The absorption and replication of continental literary and linguistic methodologies (the academic debate; the etymological play of Marinism; the hybridity of neo-Latin and Italian voices) reveal in short a linguistic and textual reciprocity that gave birth to something very new.
Resumo:
The past decade had witnessed an unprecedented growth in the amount of available digital content, and its volume is expected to continue to grow the next few years. Unstructured text data generated from web and enterprise sources form a large fraction of such content. Many of these contain large volumes of reusable data such as solutions to frequently occurring problems, and general know-how that may be reused in appropriate contexts. In this work, we address issues around leveraging unstructured text data from sources as diverse as the web and the enterprise within the Case-based Reasoning framework. Case-based Reasoning (CBR) provides a framework and methodology for systematic reuse of historical knowledge that is available in the form of problemsolution
pairs, in solving new problems. Here, we consider possibilities of enhancing Textual CBR systems under three main themes: procurement, maintenance and retrieval. We adapt and build upon the stateof-the-art techniques from data mining and natural language processing in addressing various challenges therein. Under procurement, we investigate the problem of extracting cases (i.e., problem-solution pairs) from data sources such as incident/experience
reports. We develop case-base maintenance methods specifically tuned to text targeted towards retaining solutions such that the utility of the filtered case base in solving new problems is maximized. Further, we address the problem of query suggestions for textual case-bases and show that exploiting the problem-solution partition can enhance retrieval effectiveness by prioritizing more useful query suggestions. Additionally, we illustrate interpretable clustering as a tool to drill-down to domain specific text collections (since CBR systems are usually very domain specific) and develop techniques for improved similarity assessment in social media sources such as microblogs. Through extensive empirical evaluations, we illustrate the improvements that we are able to
achieve over the state-of-the-art methods for the respective tasks.
Resumo:
Language experience clearly affects the perception of speech, but little is known about whether these differences in perception extend to non-speech sounds. In this study, we investigated rhythmic perception of non-linguistic sounds in speakers of French and German using a grouping task, in which complexity (variability in sounds, presence of pauses) was manipulated. In this task, participants grouped sequences of auditory chimeras formed from musical instruments. These chimeras mimic the complexity of speech without being speech. We found that, while showing the same overall grouping preferences, the German speakers showed stronger biases than the French speakers in grouping complex sequences. Sound variability reduced all participants' biases, resulting in the French group showing no grouping preference for the most variable sequences, though this reduction was attenuated by musical experience. In sum, this study demonstrates that linguistic experience, musical experience, and complexity affect rhythmic grouping of non-linguistic sounds and suggests that experience with acoustic cues in a meaningful context (language or music) is necessary for developing a robust grouping preference that survives acoustic variability.
Resumo:
Tese de doutoramento, Linguística (Linguística Aplicada), Universidade de Lisboa, Faculdade de Letras, 2015
Resumo:
The long term goal of this research is to develop a program able to produce an automatic segmentation and categorization of textual sequences into discourse types. In this preliminary contribution, we present the construction of an algorithm which takes a segmented text as input and attempts to produce a categorization of sequences, such as narrative, argumentative, descriptive and so on. Also, this work aims at investigating a possible convergence between the typological approach developed in particular in the field of text and discourse analysis in French by Adam (2008) and Bronckart (1997) and unsupervised statistical learning.
Resumo:
This study was undertaken to investigate any textual differences and similarities within essays written with a word processing program and an e-mail editor by non-native writers. It arose from many contradictions and a paucity of empirical research within the field of second language learning and electronic technology. To further explore these contradictory observations, 3 classes of intermediate level ESL (English as a Second Language) students v^ote 6 essays, alternating between a word processing program and an e-mail editor. Prior to the data collection, students read brief texts and responded to questions that focused upon three formal topics: immigration, economics, and multiculturalism. Data were examined for (a) the differences in the frequency counts of 12 cohesive devices, (b) sentence complexity, which focused upon the occurrences of simple and complex sentences, (c) the number of words within the writings, (d) the method of contextualization preferred by writers, and (e) any variations in the final grades of the students' texts that resulted from holistic rating. Results of analysis indicated that there were no statistically significant differences in the frequency counts of the linguistic features. Sentence complexity did not vary within the off-line and on-line essays. The average number of words found within the off-line essays was approximately 20% greater than within on-line essays. Contextualization methods were not different within word-processed or e-mailed essays. Finally, there was no difference in the quality of the texts when holistically rated.
Resumo:
The purpose of this study was to examine the disability discourses present in Ontario elementary schools curriculum. The study used a critical social analysis perspective to employ a textual discourse analysis on the Planning [title of subject] Programs for Students with Special Education Needs (PPSSEN) section of the curriculum. The present study utilized Parker's (1992) seven criteria for distinguishing discourses and discovered five main discourses; Independent, dependent, legal, scientific and agency discourses. The second step to this research was the placement and discussion of these five discourses on three diverse texts, Paulo Freire's (2008) Pedagogy o/ the Oppressed, Psychiatry Inside Out, Selected writings of Franco Basaglia, written by Scheper-Huges and Lovell (1987) and Aronowitz and Giroux's (1985) Education Under Siege: The Conservative, Liberal and Radical Debate over Schooling. These unique perspectives were used as methods of analysis tools to further analyze the dominate disability discourses. The texts provided textual support in three major areas; dialectics, critical education and structural conditions of power and language of traditional roles and responsibilities. The findings and discussions presented in this project contain significant implications for anyone involved with students with disabilities in any education system.
Resumo:
Les systèmes statistiques de traduction automatique ont pour tâche la traduction d’une langue source vers une langue cible. Dans la plupart des systèmes de traduction de référence, l'unité de base considérée dans l'analyse textuelle est la forme telle qu’observée dans un texte. Une telle conception permet d’obtenir une bonne performance quand il s'agit de traduire entre deux langues morphologiquement pauvres. Toutefois, ceci n'est plus vrai lorsqu’il s’agit de traduire vers une langue morphologiquement riche (ou complexe). Le but de notre travail est de développer un système statistique de traduction automatique comme solution pour relever les défis soulevés par la complexité morphologique. Dans ce mémoire, nous examinons, dans un premier temps, un certain nombre de méthodes considérées comme des extensions aux systèmes de traduction traditionnels et nous évaluons leurs performances. Cette évaluation est faite par rapport aux systèmes à l’état de l’art (système de référence) et ceci dans des tâches de traduction anglais-inuktitut et anglais-finnois. Nous développons ensuite un nouvel algorithme de segmentation qui prend en compte les informations provenant de la paire de langues objet de la traduction. Cet algorithme de segmentation est ensuite intégré dans le modèle de traduction à base d’unités lexicales « Phrase-Based Models » pour former notre système de traduction à base de séquences de segments. Enfin, nous combinons le système obtenu avec des algorithmes de post-traitement pour obtenir un système de traduction complet. Les résultats des expériences réalisées dans ce mémoire montrent que le système de traduction à base de séquences de segments proposé permet d’obtenir des améliorations significatives au niveau de la qualité de la traduction en terme de le métrique d’évaluation BLEU (Papineni et al., 2002) et qui sert à évaluer. Plus particulièrement, notre approche de segmentation réussie à améliorer légèrement la qualité de la traduction par rapport au système de référence et une amélioration significative de la qualité de la traduction est observée par rapport aux techniques de prétraitement de base (baseline).
Resumo:
Analysis by reduction is a method used in linguistics for checking the correctness of sentences of natural languages. This method is modelled by restarting automata. All types of restarting automata considered in the literature up to now accept at least the deterministic context-free languages. Here we introduce and study a new type of restarting automaton, the so-called t-RL-automaton, which is an RL-automaton that is rather restricted in that it has a window of size one only, and that it works under a minimal acceptance condition. On the other hand, it is allowed to perform up to t rewrite (that is, delete) steps per cycle. Here we study the gap-complexity of these automata. The membership problem for a language that is accepted by a t-RL-automaton with a bounded number of gaps can be solved in polynomial time. On the other hand, t-RL-automata with an unbounded number of gaps accept NP-complete languages.
Resumo:
Fine-grained parallel machines have the potential for very high speed computation. To program massively-concurrent MIMD machines, programmers need tools for managing complexity. These tools should not restrict program concurrency. Concurrent Aggregates (CA) provides multiple-access data abstraction tools, Aggregates, which can be used to implement abstractions with virtually unlimited potential for concurrency. Such tools allow programmers to modularize programs without reducing concurrency. I describe the design, motivation, implementation and evaluation of Concurrent Aggregates. CA has been used to construct a number of application programs. Multi-access data abstractions are found to be useful in constructing highly concurrent programs.
Resumo:
The central thesis of this report is that human language is NP-complete. That is, the process of comprehending and producing utterances is bounded above by the class NP, and below by NP-hardness. This constructive complexity thesis has two empirical consequences. The first is to predict that a linguistic theory outside NP is unnaturally powerful. The second is to predict that a linguistic theory easier than NP-hard is descriptively inadequate. To prove the lower bound, I show that the following three subproblems of language comprehension are all NP-hard: decide whether a given sound is possible sound of a given language; disambiguate a sequence of words; and compute the antecedents of pronouns. The proofs are based directly on the empirical facts of the language user's knowledge, under an appropriate idealization. Therefore, they are invariant across linguistic theories. (For this reason, no knowledge of linguistic theory is needed to understand the proofs, only knowledge of English.) To illustrate the usefulness of the upper bound, I show that two widely-accepted analyses of the language user's knowledge (of syntactic ellipsis and phonological dependencies) lead to complexity outside of NP (PSPACE-hard and Undecidable, respectively). Next, guided by the complexity proofs, I construct alternate linguisitic analyses that are strictly superior on descriptive grounds, as well as being less complex computationally (in NP). The report also presents a new framework for linguistic theorizing, that resolves important puzzles in generative linguistics, and guides the mathematical investigation of human language.
Resumo:
Abarcar la enseñanza de la redacción en inglés como segunda lengua para fines académicos y profesionales en la universidad española. En primer lugar, se establece un marco teórico para la pedagogía de la redacción a base del entendimiento del texto escrito como nexo en una red compleja de relaciones sociales y negociaciones culturales. Luego se lleva a cabo un estudio de la práctica de la redacción en el contexto de la universidad española, con un análisis a fondo de los escritores y sus actitudes y expectativas, por un lado, y sus textos (un ensayo y un informe), por otro. Se analizan los textos usando técnicas cualitativas y cuantitativas. A partir de este estudio inicial, se diseña un proyecto de investigación-acción, en el que dos grupos paralelos de alumnos siguen dos programas diferentes en que se plasman dos aproximaciones distintas a la pedagogía de la redacción: el análisis textual, siguiendo la tradición del inglés para fines específicos y la escuela del género, y el análisis contextual, influenciado por los planteamientos y los procedimientos de la nueva retórica. Los textos resultantes son analizados mediante unas escalas detalladas de evaluación desarrolladas a base de los resultados del primer estudio. Los resultados de los dos programas son positivos, aunque el grupo de análisis contextual demuestra una mejora superior. Para concluir, se esboza una serie de principios que deberán servir de guía para el diseño de los futuros programas de redacción para universitarios españoles.
Resumo:
The thesis L’ús dels clítics pronominals del català i la seva adquisició per parlants de romanès i de tagal [The use of pronominal clitics in Catalan and their acquisition by Romanian and Tagalog speakers] analyzes the mechanisms of transfer from the L1 in the process of acquisition of Catalan (L2) in two groups of learners, one of which has Romanian and the other Tagalog as their native language. Our study lends support to the idea of transfer from the L1 to a second language in general, and, in particular, within the process of acquisition of pronominal clitics from a Romance language (Catalan). The results show that the differences between the two groups are statistically significant and are attributable to the characteristics of the L1. Moreover, starting from a detailed description of the grammar of pronominal clitics in the three languages involved, we define the specific grammatical aspects of the Tagalog and Romanian languages that can have an influence on certain productions and on certain errors in the use of pronominal clitics in Catalan, within the process of acquisition of this Romance language as L2. In the theoretical domain, we started from studies on functional markedness to determine four reference terms that allowed us to carry out a systematized study of the difficulties in acquisition of the use of Catalan clitic pronouns according to their complexity and their degree of grammaticalization.