Biblioteca Digital

877 resultados para annotation sémantique

Video metadata extraction in a videoMail system

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Currently the world swiftly adapts to visual communication. Online services like YouTube and Vine show that video is no longer the domain of broadcast television only. Video is used for different purposes like entertainment, information, education or communication. The rapid growth of today’s video archives with sparsely available editorial data creates a big problem of its retrieval. The humans see a video like a complex interplay of cognitive concepts. As a result there is a need to build a bridge between numeric values and semantic concepts. This establishes a connection that will facilitate videos’ retrieval by humans. The critical aspect of this bridge is video annotation. The process could be done manually or automatically. Manual annotation is very tedious, subjective and expensive. Therefore automatic annotation is being actively studied. In this thesis we focus on the multimedia content automatic annotation. Namely the use of analysis techniques for information retrieval allowing to automatically extract metadata from video in a videomail system. Furthermore the identification of text, people, actions, spaces, objects, including animals and plants. Hence it will be possible to align multimedia content with the text presented in the email message and the creation of applications for semantic video database indexing and retrieving.

Genome-scale metabolic network reconstruction of Polaromonas sp. strain JS666: analysis of cDCE degradation rates and design of experiments for bioremediation improvement

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Release of chloroethene compounds into the environment often results in groundwater contamination, which puts people at risk of exposure by drinking contaminated water. cDCE (cis-1,2-dichloroethene) accumulation on subsurface environments is a common environmental problem due to stagnation and partial degradation of other precursor chloroethene species. Polaromonas sp. strain JS666 apparently requires no exotic growth factors to be used as a bioaugmentation agent for aerobic cDCE degradation. Although being the only suitable microorganism found capable of such, further studies are needed for improving the intrinsic bioremediation rates and fully comprehend the metabolic processes involved. In order to do so, a metabolic model, iJS666, was reconstructed from genome annotation and available bibliographic data. FVA (Flux Variability Analysis) and FBA (Flux Balance Analysis) techniques were used to satisfactory validate the predictive capabilities of the iJS666 model. The iJS666 model was able to predict biomass growth for different previously tested conditions, allowed to design key experiments which should be done for further model improvement and, also, produced viable predictions for the use of biostimulant metabolites in the cDCE biodegradation.

Modeling microbes: New methods for integrated metabolic and regulatory network reconstruction

Relevância:

10.00% 10.00%

Publicador:

Resumo:

PhD Thesis in Bioengineering

Integrating data from heterogeneous DNA microarray platforms

Relevância:

10.00% 10.00%

Publicador:

Resumo:

DNA microarrays are one of the most used technologies for gene expression measurement. However, there are several distinct microarray platforms, from different manufacturers, each with its own measurement protocol, resulting in data that can hardly be compared or directly integrated. Data integration from multiple sources aims to improve the assertiveness of statistical tests, reducing the data dimensionality problem. The integration of heterogeneous DNA microarray platforms comprehends a set of tasks that range from the re-annotation of the features used on gene expression, to data normalization and batch effect elimination. In this work, a complete methodology for gene expression data integration and application is proposed, which comprehends a transcript-based re-annotation process and several methods for batch effect attenuation. The integrated data will be used to select the best feature set and learning algorithm for a brain tumor classification case study. The integration will consider data from heterogeneous Agilent and Affymetrix platforms, collected from public gene expression databases, such as The Cancer Genome Atlas and Gene Expression Omnibus.

Tapping the wealth of microbial data in high-throughput metabolic model reconstruction

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Genome-scale metabolic models are valuable tools in the metabolic engineering process, based on the ability of these models to integrate diverse sources of data to produce global predictions of organism behavior. At the most basic level, these models require only a genome sequence to construct, and once built, they may be used to predict essential genes, culture conditions, pathway utilization, and the modifications required to enhance a desired organism behavior. In this chapter, we address two key challenges associated with the reconstruction of metabolic models: (a) leveraging existing knowledge of microbiology, biochemistry, and available omics data to produce the best possible model; and (b) applying available tools and data to automate the reconstruction process. We consider these challenges as we progress through the model reconstruction process, beginning with genome assembly, and culminating in the integration of constraints to capture the impact of transcriptional regulation. We divide the reconstruction process into ten distinct steps: (1) genome assembly from sequenced reads; (2) automated structural and functional annotation; (3) phylogenetic tree-based curation of genome annotations; (4) assembly and standardization of biochemistry database; (5) genome-scale metabolic reconstruction; (6) generation of core metabolic model; (7) generation of biomass composition reaction; (8) completion of draft metabolic model; (9) curation of metabolic model; and (10) integration of regulatory constraints. Each of these ten steps is documented in detail.

Personalisierung digitaler Dokumente

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Personalisierung, Interaktion, Annotation, Lesen, Handschrift

Embedding metadata in computer graphics for interaction

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Illustration Watermarks, Image annotation, Virtual data exploration, Interaction techniques

Minimal-invasive provenance integration into data-intensive systems

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Magdeburg, Univ., Fak. für Informatik, Diss., 2014

Behind the Scenes of an Ant Genome Project

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Dramatic improvements in DNA sequencing technologies have led to amore than 1,000-fold reduction in sequencing costs over the past five years.Genome-wide research approaches can thus now be applied beyond medicallyrelevant questions to examine the molecular-genetic basis of behavior,development and unique life histories in almost any organism. A first step foran emerging model organism is usually establishing a reference genomesequence. I offer insight gained from the fire ant genome project. First, I detailhow the project came to be and how sequencing, assembly and annotationstrategies were chosen. Subsequently, I describe some of the issues linked toworking with data from recently sequenced genomes. Finally, I discuss anapproach undertaken in a follow-up project based on the fire ant genomesequence.

Sistemas de detección y extracción semiautomática de siglas: estado de la cuestión

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Informe de investigación realizado a partir de una estancia en el Équipe de Recherche en Syntaxe et Sémantique de la Université de Toulouse-Le Mirail, Francia, entre julio y setiembre de 2006. En la actualidad existen diversos diccionarios de siglas en línea. Entre ellos sobresalen Acronym Finder, Abbreviations.com y Acronyma; todos ellos dedicados mayoritariamente a las siglas inglesas. Al igual que los diccionarios en papel, este tipo de diccionarios presenta problemas de desactualización por la gran cantidad de siglas que se crean a diario. Por ejemplo, en 2001, un estudio de Pustejovsky et al. mostraba que en los abstracts de Medline aparecían mensualmente cerca de 12.000 nuevas siglas. El mecanismo de actualización empleado por estos recursos es la remisión de nuevas siglas por parte de los usuarios. Sin embargo, esta técnica tiene la desventaja de que la edición de la información es muy lenta y costosa. Un ejemplo de ello es el caso de Abbreviations.com que en octubre de 2006 tenía alrededor de 100.000 siglas pendientes de edición e incorporación definitiva. Como solución a este tipo de problema, se plantea el diseño de sistemas de detección y extracción automática de siglas a partir de corpus. El proceso de detección comporta dos pasos; el primero, consiste en la identificación de las siglas dentro de un corpus y, el segundo, la desambiguación, es decir, la selección de la forma desarrollada apropiada de una sigla en un contexto dado. En la actualidad, los sistemas de detección de siglas emplean métodos basados en patrones, estadística, aprendizaje máquina, o combinaciones de ellos. En este estudio se analizan los principales sistemas de detección y desambiguación de siglas y los métodos que emplean. Cada uno se evalúa desde el punto de vista del rendimiento, medido en términos de precisión (porcentaje de siglas correctas con respecto al número total de siglas extraídas por el sistema) y exhaustividad (porcentaje de siglas correctas identificadas por el sistema con respecto al número total de siglas existente en el corpus). Como resultado, se presentan los criterios para el diseño de un futuro sistema de detección de siglas en español.

Expression analysis of the AtPHO1 gene family in Arabidopsis thaliana and the involvement of AtPHO1 in the regulation of stomatal movements

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Résumé Le transfert du phosphate des racines vers les feuilles s'effectue par la voie du xylème. Il a été précédemment démontré que la protéine AtPHO1 était indispensable au transfert du phosphate dans les vaisseaux du xylème des racines chez la plante modèle Arabidopsis thaliana. Le séquençage et l'annotation du génome d'Arabidopsis ont permis d'identifier dix séquences présentant un niveau de similarité significatif avec le gène AtPHO1 et constituant une nouvelle famille de gène appelé la famille de AtPHO1. Basée sur une étude moléculaire et génétique, cette thèse apporte des éléments de réponse pour déterminer le rôle des membres de ia famille de AtPHO1 chez Arabidopsis, inconnue à ce jour. Dans un premier temps, une analyse bioinformatique des séquences protéiques des membres de la famille de AtPHO1 a révélé la présence dans leur région N-terminale d'un domaine nommé SPX. Ce dernier est conservé parmi de nombreuses protéines impliquées dans l'homéostasie du phosphate chez la levure, renforçant ainsi l'hypothèse que les membres de la famille de AtPHO1 auraient comme AtPHO1 un rôle dans l'équilibre du phosphate dans la plante. En parallèle, la localisation tissulaire de l'expression des gènes AtPHO dans Arabidopsis a été identifiée par l'analyse de plantes transgéniques exprimant le gène rapporteur uidA sous le contrôle des promoteurs respectifs des gènes AtPHO. Un profil d'expression de chaque gène AtPHO au cours du développement de la plante a été obtenu. Une expression prédominante au niveau des tissus vasculaires des racines, des feuilles, des tiges et des fleurs a été observée, suggérant que les gènes AtPHO pourraient avoir des fonctions redondantes au niveau du transfert de phosphate dans le cylindre vasculaire de ces différents organes. Toutefois, plusieurs régions promotrices des gènes AtPHO contrôlent également un profil d'expression GUS non-vasculaire, indiquant un rôle putatif des gènes AtPHO dans l'acquisition ou le recyclage de phosphate dans la plante. Dans un deuxième temps, l'analyse de l'expression des gènes AtPHO durant une carence en phosphate a établi que seule l'expression des gènes AtPHO1, AtPHO1; H1 et AtPHO1; H10 est régulée par cette carence. Une étude approfondie de leur expression en réponse à des traitements affectant l'homéostasie du phosphate dans la plante a ensuite démontré leur régulation par différentes voies de signalisation. Ensuite, une analyse détaillée de la régulation de l'expression du gène AtPHO1; H1O dans des feuilles d'Arabidopsis blessées ou déshydratées a révélé que ce gène constitue le premìer gène marqueur d'une nouvelle voie de signalisation induite par l'OPDA, pas par le JA et dépendante de la protéine COI1. Ces résultats démontrent pour la première fois que l'OPDA et le JA peuvent activer différents gènes via des voies de signalisation dépendantes de COI1. Enfin, cette thèse révèle l'identification d'un nouveau rôle de la protéine AtPHO1 dans la régulation de l'action de l'ABA au cours des processus de fermeture stomatique et de germination des graines chez Arabidopsis. Bien que les fonctions exactes des protéines AtPHO restent à être déterminées, ce travail de thèse suggère leur implication dans la propagation de différents signaux dans la plante via la modulation du potentiel membranaire et/ou l'affectation de la composition en ions des cellules comme le font de nombreux transporteurs ou régulateur du transport d'ions. Summary Phosphate is transferred from the roots to the shoot via the xylem. The requirement for AtPHO1 protein to transfer phosphate to the xylem vessels of the root has been previously demonstrated in Arabidopsis thaliana. The sequencing and the annotation of the Arabidopsis genome had allowed the identification of ten sequences that show a significant level of similarity with the AtPHO1 gene. These 10 genes, of unknown functions, constitute a new gene family called the AtPHO1 gene family. Based on a molecular and genetics study, this thesis reveals some information needed to understand the role of the AtPHO1 family members in the plant Arabidopsis. First, a bioinformatics study revealed that the AtPHO sequences contained, in the N-terminal hydrophilic region, a motif called SPX and conserved among multiple proteins involved in phosphate homeostasis in yeast. This finding reinforces the hypothesis that all AtPHO1 family members have, as AtPHO1, a role in phosphate homeostasis. In parallel, we identified the pattern of expression of AtPHO genes in Arabidopsis via analysis of transgenic plants expressing the uidA reporter gene under the control of respective AtPHO promoter regions. The results exhibit a predominant expression of AtPHO genes in vascular tissues of all organs of the plant, implying that these AtPHO genes could have redundant functions in the transfer of phosphate to the vascular cylinder of various organs. The GUS expression pattern for several AtPHO promoter regions was also detected in non-vascular tissue indicating a broad role of AtPHO genes in the acquisition or in the recycling of phosphate in the plant. In a second step, the analysis of the expression of AtPHO genes during phosphate starvation established that only the expression of the AtPHO1, AtPHO1; H1 and AtPHO1; H10 genes were regulated by Pi starvation. Interestingly, different signalling pathways appeared to regulate these three genes during various treatments affecting Pi homeostasis in the plant. The third chapter presents a detailed analysis of the signalling pathways regulating the expression of the AtPHO1; H10 gene in Arabidopsis leaves during wound and dehydrated stresses. Surprisingly, the expression of AtPHO1; H10 was found to be regulated by OPDA (the precursor of JA) but not by JA itself and via the COI1 protein (the central regulator of the JA signalling pathway). These results demonstrated for the first time that OPDA and JA could activate distinct genes via COI1-dependent pathways. Finally, this thesis presents the identification of a novel role of the AtPHO1 protein in the regulation of ABA action in Arabidopsis guard cells and during seed germination. Although the exact role and function of AtPHO1 still need to be determined, these last findings suggest that AtPHO1 and by extension other AtPHO proteins could mediate the propagation of various signals in the plant by modulating the membrane potential and/or by affecting cellular ion composition, as it is the case for many ion transporters or regulators of ion transport.

Structured RNAs in the ENCODE selected regions of the human genome.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Functional RNA structures play an important role both in the context of noncoding RNA transcripts as well as regulatory elements in mRNAs. Here we present a computational study to detect functional RNA structures within the ENCODE regions of the human genome. Since structural RNAs in general lack characteristic signals in primary sequence, comparative approaches evaluating evolutionary conservation of structures are most promising. We have used three recently introduced programs based on either phylogenetic-stochastic context-free grammar (EvoFold) or energy directed folding (RNAz and AlifoldZ), yielding several thousand candidate structures (corresponding to approximately 2.7% of the ENCODE regions). EvoFold has its highest sensitivity in highly conserved and relatively AU-rich regions, while RNAz favors slightly GC-rich regions, resulting in a relatively small overlap between methods. Comparison with the GENCODE annotation points to functional RNAs in all genomic contexts, with a slightly increased density in 3'-UTRs. While we estimate a significant false discovery rate of approximately 50%-70% many of the predictions can be further substantiated by additional criteria: 248 loci are predicted by both RNAz and EvoFold, and an additional 239 RNAz or EvoFold predictions are supported by the (more stringent) AlifoldZ algorithm. Five hundred seventy RNAz structure predictions fall into regions that show signs of selection pressure also on the sequence level (i.e., conserved elements). More than 700 predictions overlap with noncoding transcripts detected by oligonucleotide tiling arrays. One hundred seventy-five selected candidates were tested by RT-PCR in six tissues, and expression could be verified in 43 cases (24.6%).

Plataforma de adquisición de imágenes en escenarios virtuales para uso en sistemas de visión

Relevância:

10.00% 10.00%

Publicador:

Resumo:

En aquest projecte presentem un mètode per generar bases de imatges de vianants, requerides per a l'entrenament o validació de sistemes d'aprenentatge basats en exemples, en un entorn virtual. S'ha desenvolupat una plataforma que permet simular una navegació d'una càmara en una escena virtual i recuperar el fluxe de vídeo amb el seu groundtruth. Amb l'ús d'aquesta plataforma es suprimeix el procés d'anotació, necesari per obtenir el groundtruth en entorns reals, i es redueixen els costos al treballar en un entorn virtual.

Luc-Actes entre Jérusalem et Rome. Un procédé lucanien de double signification

Relevância:

10.00% 10.00%

Publicador:

Resumo:

L'auteur de Luc-Actes use d'un procédé rhétorique de double signification, qui oriente la lecture aussi bien vers une culture nourrie de la LXX que vers une conceptualité gréco-romaine. Exemples de cette ambivalence sémantique: Lc 23.47; Ac 4.32-4; 17.16-34; 18.13; 27.1-28, 10, etc. Ce procédé témoigne d'une volonté de fixer l'identité chrétienne entre Jérusalem et Rome, inscrivant le christianisme à la fois comme la continuité d'une histoire de salut et comme une réponse à la quête religieuse gréco-romaine. Le programme théologique qui se dessine ainsi rapproche Luc de Flavius Josèphe.

Annotating the human genome

Relevância:

10.00% 10.00%

Publicador:

Resumo:

2 Abstract2.1 En françaisLe séquençage du génome humain est un pré-requis fondamental à la compréhension de la biologie de l'être humain. Ce projet achevé, les scientifiques ont dû faire face à une tâche aussi importante, comprendre cette suite de 3 milliards de lettres qui compose notre génome. Le consortium ENCODE (ENCyclopedia Of Dna Elements) fût formé comme une suite logique au projet du génome humain. Son rôle est d'identifier tous les éléments fonctionnels de notre génome incluant les régions transcrites, les sites d'attachement des facteurs de transcription, les sites hypersensibles à la DNAse I ainsi que les marqueurs de modification des histones. Dans le cadre de ma thèse doctorale, j'ai participé à 2 sous-projets d'ENCODE. En premier lieu, j'ai eu la tâche de développer et d'optimiser une technique de validation expérimentale à haut rendement de modèles de gènes qui m'a permis d'estimer la qualité de la plus récente annotation manuelle. Ce nouveau processus de validation est bien plus efficace que la technique RNAseq qui est actuellement en train de devenir la norme. Cette technique basée sur la RT-PCR, m'a notamment permis de découvrir de nouveaux exons dans 10% des régions interrogées. En second lieu j'ai participé à une étude ayant pour but d'identifier les extrémités de tous les gènes des chromosomes humains 21 et 22. Cette étude à permis l'identification à large échelle de transcrits chimères comportant des séquences provenant de deux gènes distincts pouvant être à une grande distance l'un de autre.2.2 In EnglishThe completion of the human genome sequence js the prerequisite to fully understand the biology of human beings. This project achieved, scientists had to face another challenging task, understanding the meaning of the 3 billion letters composing this genome. As a logical continuation of the human genome project, the ENCODE (ENCyclopedia Of DNA Elements) consortium was formed with the aim of annotating all its functional elements. These elements include transcribed regions, transcription binding sites, DNAse I hypersensitive sites and histone modification marks. In the frame of my PhD thesis, I was involved in two sub-projects of ENCODE. Firstly I developed and optimized an high throughput method to validate gene models, which allowed me to assess the quality of the most recent manually-curated annotation. This novel experimental validation pipeline is extremely effective, far more so than transcriptome profiling through RNA sequencing, which is becoming the norm. This RT-PCR-seq targeted-approach is likewise particularly efficient in identifying novel exons, as we discovered about 10% of loci with unannotated exons. Secondly, I participated to a study aiming to identify the gene boundaries of all genes in the human chromosome 21 and 22. This study led to the identification of chimeric transcripts that are composed of sequences coming form two distinct genes that can be map far away from each other.

«
1
2
...
14
15
16
17
18
19
20
...
58
59
»