873 resultados para scholarly text editing
Resumo:
[ES] Este Trabajo de Fin de Grado es un servicio basado en tecnologías web. El objetivo principal es ofrecer un servicio de creación y gestión de actas para el Ayuntamiento de Las Palmas de Gran Canaria. Para ello, consta de dos módulos principales, uno para “crear actas” y otro para “editar actas”. También se ha desarrollado otro módulo llamado plantillas donde se genera un PDF a partir de una plantilla preestablecida. Esta aplicación ha sido dividida en diferentes partes. La primera parte consistió en generar todas las configuraciones de base de datos necesarias para el funcionamiento de la aplicación. Después generamos todos los ficheros HTML y las interconexiones entre ellos. Finalmente, dotamos a esos HTML estáticos de un estilo mucho más claro y organizado, dando a la aplicación una apariencia mucho más bonita. Una vez finalizada la parte frontal de la aplicación, empezamos a implementar la lógica detrás de la aplicación. Los módulos de “crear” y “editar” se hicieron utilizando formularios HTML y combinando la información obtenida de esos formularios con unas plantillas HTML generadas por nosotros. Toda esa información obtenida de los formularios se guarda en unos ficheros .txt para poder ser utilizados por el módulo editar. El módulo de plantillas nos muestra un editor HTML rellenado con una plantilla que ha sido previamente seleccionada por el usuario. Los ficheros pdf de este módulo no pueden editados con posterioridad por lo que no se generan ficheros .txt. Por último, hay dos módulos que nos permiten ver todas las actas generadas por la aplicación. El primero de los dos módulos es el módulo de búsqueda, que nos permite buscar una palabra clave dentro de todos los ficheros pdf. El otro módulo nos muestra todas las actas que han sido marcadas como “cerradas”. Esta aplicación ha sido diseñada de forma modular, de manera que podemos añadir o quitar módulos de manera sencilla.
Resumo:
The need for a convergence between semi-structured data management and Information Retrieval techniques is manifest to the scientific community. In order to fulfil this growing request, W3C has recently proposed XQuery Full Text, an IR-oriented extension of XQuery. However, the issue of query optimization requires the study of important properties like query equivalence and containment; to this aim, a formal representation of document and queries is needed. The goal of this thesis is to establish such formal background. We define a data model for XML documents and propose an algebra able to represent most of XQuery Full-Text expressions. We show how an XQuery Full-Text expression can be translated into an algebraic expression and how an algebraic expression can be optimized.
Resumo:
The Ph chromosome is the most frequent cytogenetic aberration associated with adult ALL and it represents the single most significant adverse prognostic marker. Despite imatinib has led to significant improvements in the treatment of patients with Ph+ ALL, in the majority of cases resistance developed quickly and disease progressed. Some mechanisms of resistance have been widely described but the full knowledge of contributing factors, driving both the disease and resistance, remains to be defined. The observation of rapid development of lymphoblastic leukemia in mice expressing altered Ikaros (Ik) isoforms represented the background of this study. Ikaros is a zinc finger transcription factor required for normal hemopoietic differentiation and proliferation, particularly in the lymphoid lineages. By means of alternative splicing, Ikaros encodes several proteins that differ in their abilities to bind to a consensus DNA-binding site. Shorter, DNA nonbinding isoforms exert a dominant negative effect, inhibiting the ability of longer heterodimer partners to bind DNA. The differential expression pattern of Ik isoforms in Ph+ ALL patients was analyzed in order to determine if molecular abnormalities involving the Ik gene could associate with resistance to imatinib and dasatinib. Bone marrow and peripheral blood samples from 46 adult patients (median age 55 yrs, 18-76) with Ph+ ALL at diagnosis and during treatment with imatinib (16 pts) or dasatinib (30 pts) were collected. We set up a fast, high-throughput method based on capillary electrophoresis technology to detect and quantify splice variants. 41% Ph+ ALL patients expressed high levels of the non DNA-binding dominant negative Ik6 isoform lacking critical N-terminal zinc-fingers which display abnormal subcellular compartmentalization pattern. Nuclear extracts from patients expressed Ik6 failed to bind DNA in mobility shift assay using a DNA probe containing an Ikaros-specific DNA binding sequence. In 59% Ph+ ALL patients there was the coexistence in the same PCR sample and at the same time of many splice variants corresponded to Ik1, Ik2, Ik4, Ik4A, Ik5A, Ik6, Ik6 and Ik8 isoforms. In these patients aberrant full-length Ikaros isoforms in Ph+ ALL characterized by a 60-bp insertion immediately downstream of exon 3 and a recurring 30-bp in-frame deletion at the end of exon 7 involving most frequently the Ik2, Ik4 isoforms were also identified. Both the insertion and deletion were due to the selection of alternative splice donor and acceptor sites. The molecular monitoring of minimal residual disease showed for the first time in vivo that the Ik6 expression strongly correlated with the BCR-ABL transcript levels suggesting that this alteration could depend on the Bcr-Abl activity. Patient-derived leukaemia cells expressed dominant-negative Ik6 at diagnosis and at the time of relapse, but never during remission. In order to mechanistically demonstrated whether in vitro the overexpression of Ik6 impairs the response to tyrosine kinase inhibitors (TKIs) and contributes to resistance, an imatinib-sensitive Ik6-negative Ph+ ALL cell line (SUP-B15) was transfected with the complete Ik6 DNA coding sequence. The expression of Ik6 strongly increased proliferation and inhibited apoptosis in TKI sensitive cells establishing a previously unknown link between specific molecular defects that involve the Ikaros gene and the resistance to TKIs in Ph+ ALL patients. Amplification and genomic sequence analysis of the exon splice junction regions showed the presence of 2 single nucleotide polymorphisms (SNPs): rs10251980 [A/G] in the exon2/3 splice junction and of rs10262731 [A/G] in the exon 7/8 splice junction in 50% and 36% of patients, respectively. A variant of the rs11329346 [-/C], in 16% of patients was also found. Other two different single nucleotide substitutions not recognized as SNP were observed. Some mutations were predicted by computational analyses (RESCUE approach) to alter cis-splicing elements. In conclusion, these findings demonstrated that the post-transcriptional regulation of alternative splicing of Ikaros gene is defective in the majority of Ph+ ALL patients treated with TKIs. The overexpression of Ik6 blocking B-cell differentiation could contribute to resistance opening a time frame, during which leukaemia cells acquire secondary transforming events that confer definitive resistance to imatinib and dasatinib.
Resumo:
This study aims to the elaboration of juridical and administrative terminology in Ladin language, actually on the Ladin idiom spoken in Val Badia. The necessity of this study is strictly connected to the fact that in South Tyrol the Ladin language is not just safeguarded, but the editing of administrative and normative text is guaranteed by law. This means that there is a need for a unique terminology in order to support translators and editors of specialised texts. The starting point of this study are, on one side the need of a unique terminology, and on the other side the translation work done till now from the employees of the public administration in Ladin language. In order to document their efforts a corpus made up of digitalized administrative and normative documents was build. The first two chapters focuses on the state of the art of projects on terminology and corpus linguistics for lesser used languages. The information were collected thanks to the help of institutes, universities and researchers dealing with lesser used languages. The third chapter focuses on the development of administrative language in Ladin language and the fourth chapter focuses on the creation of the trilingual Italian – German – Ladin corpus made up of administrative and normative documents. The last chapter deals with the methodologies applied in order to elaborate the terminology entries in Ladin language though the use of the trilingual corpus. Starting from the terminology entry all steps are described, from term extraction, to the extraction of equivalents, contexts and definitions and of course also of the elaboration of translation proposals for not found equivalences. Finally the problems referring to the elaboration of terminology in Ladin language are illustrated.
Resumo:
The purpose of this thesis is to enhance the functionalities of GAFFE, a flexible, interactive and user-friendly application for editing metadata in office documents by supporting different ontologies stored inside and outside of the digital document, by adding new views and forms and by improving its ease of use.
Resumo:
Ontology design and population -core aspects of semantic technologies- re- cently have become fields of great interest due to the increasing need of domain-specific knowledge bases that can boost the use of Semantic Web. For building such knowledge resources, the state of the art tools for ontology design require a lot of human work. Producing meaningful schemas and populating them with domain-specific data is in fact a very difficult and time-consuming task. Even more if the task consists in modelling knowledge at a web scale. The primary aim of this work is to investigate a novel and flexible method- ology for automatically learning ontology from textual data, lightening the human workload required for conceptualizing domain-specific knowledge and populating an extracted schema with real data, speeding up the whole ontology production process. Here computational linguistics plays a fundamental role, from automati- cally identifying facts from natural language and extracting frame of relations among recognized entities, to producing linked data with which extending existing knowledge bases or creating new ones. In the state of the art, automatic ontology learning systems are mainly based on plain-pipelined linguistics classifiers performing tasks such as Named Entity recognition, Entity resolution, Taxonomy and Relation extraction [11]. These approaches present some weaknesses, specially in capturing struc- tures through which the meaning of complex concepts is expressed [24]. Humans, in fact, tend to organize knowledge in well-defined patterns, which include participant entities and meaningful relations linking entities with each other. In literature, these structures have been called Semantic Frames by Fill- 6 Introduction more [20], or more recently as Knowledge Patterns [23]. Some NLP studies has recently shown the possibility of performing more accurate deep parsing with the ability of logically understanding the structure of discourse [7]. In this work, some of these technologies have been investigated and em- ployed to produce accurate ontology schemas. The long-term goal is to collect large amounts of semantically structured information from the web of crowds, through an automated process, in order to identify and investigate the cognitive patterns used by human to organize their knowledge.