4 resultados para Document Model
em AMS Tesi di Dottorato - Alm@DL - Università di Bologna
Resumo:
This thesis proposes a new document model, according to which any document can be segmented in some independent components and transformed in a pattern-based projection, that only uses a very small set of objects and composition rules. The point is that such a normalized document expresses the same fundamental information of the original one, in a simple, clear and unambiguous way. The central part of my work consists of discussing that model, investigating how a digital document can be segmented, and how a segmented version can be used to implement advanced tools of conversion. I present seven patterns which are versatile enough to capture the most relevant documents’ structures, and whose minimality and rigour make that implementation possible. The abstract model is then instantiated into an actual markup language, called IML. IML is a general and extensible language, which basically adopts an XHTML syntax, able to capture a posteriori the only content of a digital document. It is compared with other languages and proposals, in order to clarify its role and objectives. Finally, I present some systems built upon these ideas. These applications are evaluated in terms of users’ advantages, workflow improvements and impact over the overall quality of the output. In particular, they cover heterogeneous content management processes: from web editing to collaboration (IsaWiki and WikiFactory), from e-learning (IsaLearning) to professional printing (IsaPress).
Resumo:
The need for a convergence between semi-structured data management and Information Retrieval techniques is manifest to the scientific community. In order to fulfil this growing request, W3C has recently proposed XQuery Full Text, an IR-oriented extension of XQuery. However, the issue of query optimization requires the study of important properties like query equivalence and containment; to this aim, a formal representation of document and queries is needed. The goal of this thesis is to establish such formal background. We define a data model for XML documents and propose an algebra able to represent most of XQuery Full-Text expressions. We show how an XQuery Full-Text expression can be translated into an algebraic expression and how an algebraic expression can be optimized.
Resumo:
Since the publication of the book of Russell and Burch in 1959, scientific research has never stopped improving itself with regard to the important issue of animal experimentation. The European Directive 2010/63/EU “On the protection of animals used for scientific purposes” focuses mainly on the animal welfare, fixing the Russell and Burch’s 3Rs principles as the foundations of the document. In particular, the legislator clearly states the responsibility of the scientific community to improve the number of alternative methods to animal experimentation. The swine is considered a species of relevant interest for translational research and medicine due to its biological similarities with humans. The surgical community has, in fact, recognized the swine as an excellent model replicating the human cardiovascular system. There have been several wild-type and transgenic porcine models which were produced for biomedicine and translational research. Among these, the cardiovascular ones are the most represented. The continuous involvement of the porcine animal model in the biomedical research, as the continuous advances achieved using swine in translational medicine, support the need for alternative methods to animal experimentation involving pigs. The main purpose of the present work was to develop and characterize novel porcine alternative methods for cardiovascular translational biology/medicine. The work was mainly based on two different models: the first consisted in an ex vivo culture of porcine aortic cylinders and the second consisted in an in vitro culture of porcine aortic derived progenitor cells. Both the models were properly characterized and results indicated that they could be useful to the study of vascular biology. Nevertheless, both the models aim to reduce the use of experimental animals and to refine animal based-trials. In conclusion, the present research aims to be a small, but significant, contribution to the important and necessary field of study of alternative methods to animal experimentation.
Resumo:
This thesis aims at investigating a new approach to document analysis based on the idea of structural patterns in XML vocabularies. My work is founded on the belief that authors do naturally converge to a reasonable use of markup languages and that extreme, yet valid instances are rare and limited. Actual documents, therefore, may be used to derive classes of elements (patterns) persisting across documents and distilling the conceptualization of the documents and their components, and may give ground for automatic tools and services that rely on no background information (such as schemas) at all. The central part of my work consists in introducing from the ground up a formal theory of eight structural patterns (with three sub-patterns) that are able to express the logical organization of any XML document, and verifying their identifiability in a number of different vocabularies. This model is characterized by and validated against three main dimensions: terseness (i.e. the ability to represent the structure of a document with a small number of objects and composition rules), coverage (i.e. the ability to capture any possible situation in any document) and expressiveness (i.e. the ability to make explicit the semantics of structures, relations and dependencies). An algorithm for the automatic recognition of structural patterns is then presented, together with an evaluation of the results of a test performed on a set of more than 1100 documents from eight very different vocabularies. This language-independent analysis confirms the ability of patterns to capture and summarize the guidelines used by the authors in their everyday practice. Finally, I present some systems that work directly on the pattern-based representation of documents. The ability of these tools to cover very different situations and contexts confirms the effectiveness of the model.