697 resultados para Annotation informatisée
Resumo:
International audience
Dinoflagellate Genomic Organization and Phylogenetic Marker Discovery Utilizing Deep Sequencing Data
Resumo:
Dinoflagellates possess large genomes in which most genes are present in many copies. This has made studies of their genomic organization and phylogenetics challenging. Recent advances in sequencing technology have made deep sequencing of dinoflagellate transcriptomes feasible. This dissertation investigates the genomic organization of dinoflagellates to better understand the challenges of assembling dinoflagellate transcriptomic and genomic data from short read sequencing methods, and develops new techniques that utilize deep sequencing data to identify orthologous genes across a diverse set of taxa. To better understand the genomic organization of dinoflagellates, a genomic cosmid clone of the tandemly repeated gene Alchohol Dehydrogenase (AHD) was sequenced and analyzed. The organization of this clone was found to be counter to prevailing hypotheses of genomic organization in dinoflagellates. Further, a new non-canonical splicing motif was described that could greatly improve the automated modeling and annotation of genomic data. A custom phylogenetic marker discovery pipeline, incorporating methods that leverage the statistical power of large data sets was written. A case study on Stramenopiles was undertaken to test the utility in resolving relationships between known groups as well as the phylogenetic affinity of seven unknown taxa. The pipeline generated a set of 373 genes useful as phylogenetic markers that successfully resolved relationships among the major groups of Stramenopiles, and placed all unknown taxa on the tree with strong bootstrap support. This pipeline was then used to discover 668 genes useful as phylogenetic markers in dinoflagellates. Phylogenetic analysis of 58 dinoflagellates, using this set of markers, produced a phylogeny with good support of all branches. The Suessiales were found to be sister to the Peridinales. The Prorocentrales formed a monophyletic group with the Dinophysiales that was sister to the Gonyaulacales. The Gymnodinales was found to be paraphyletic, forming three monophyletic groups. While this pipeline was used to find phylogenetic markers, it will likely also be useful for finding orthologs of interest for other purposes, for the discovery of horizontally transferred genes, and for the separation of sequences in metagenomic data sets.
Resumo:
[EU]Testu bat koherente egiten duten arrazoiak ulertzea oso baliagarria da testuaren beraren ulermenerako, koherentzia eta koherentzia-erlazioak testu bat edo gehiago koherente diren ondorioztatzen laguntzen baitigu. Lan honetan gai bera duten testu ezberdinen arteko koherentziazko 3 Cross Document Structure Theory edo CST (Radev, 2000) erlazio aztertu eta sailkatu dira. Hori egin ahal izateko, euskaraz idatziriko gai berari buruzko testuak segmentatzeko eta beraien arteko erlazioak etiketatzeko gidalerroak proposatzen dira. 10 testuz osaturiko corpusa etiketatu da; horietako 3 cluster bi etiketatzailek aztertu dute. Etiketatzaileen arteko adostasunaren berri ematen dugu. Koherentzia-erlazioak garatzea oso garrantzitsua da Hizkuntzaren Prozesamenduko hainbat sistementzat, hala nola, informazioa erauzteko sistementzat, itzulpen automatikoarentzat, galde-erantzun sistementzat eta laburpen automatikoarentzat. Etorkizunean CSTko erlazio guztiak corpus esanguratsuan aztertuko balira, testuen arteko koherentzia- erlazioak euskarazko testuen prozesaketa automatikoa bideratzeko lehenengo pausua litzateke hemen egindakoa.
Resumo:
Tese (doutorado)—Universidade de Brasília, Instituto de Ciências Biológicas, Programa de Pós-Graduação em Biologia Molecular, 2016.
Resumo:
Background: Statistical analysis of DNA microarray data provides a valuable diagnostic tool for the investigation of genetic components of diseases. To take advantage of the multitude of available data sets and analysis methods, it is desirable to combine both different algorithms and data from different studies. Applying ensemble learning, consensus clustering and cross-study normalization methods for this purpose in an almost fully automated process and linking different analysis modules together under a single interface would simplify many microarray analysis tasks. Results: We present ArrayMining.net, a web-application for microarray analysis that provides easy access to a wide choice of feature selection, clustering, prediction, gene set analysis and cross-study normalization methods. In contrast to other microarray-related web-tools, multiple algorithms and data sets for an analysis task can be combined using ensemble feature selection, ensemble prediction, consensus clustering and cross-platform data integration. By interlinking different analysis tools in a modular fashion, new exploratory routes become available, e.g. ensemble sample classification using features obtained from a gene set analysis and data from multiple studies. The analysis is further simplified by automatic parameter selection mechanisms and linkage to web tools and databases for functional annotation and literature mining. Conclusion: ArrayMining.net is a free web-application for microarray analysis combining a broad choice of algorithms based on ensemble and consensus methods, using automatic parameter selection and integration with annotation databases.
Resumo:
Background The Grooved Carpet shell clam Ruditapes decussatus is the autochthonous European clam and the most appreciated from a gastronomic and economic point of view. The production is in decline due to several factors such as Perkinsiosis and habitat invasion and competition by the introduced exotic species, the manila clam Ruditapes philippinarum. After we sequenced R. decussatus transcriptome we have designed an oligo microarray capable of contributing to provide some clues on molecular response of the clam to Perkinsiosis. Results A database consisting of 41,119 unique transcripts was constructed, of which 12,479 (30.3%) were annotated by similarity. An oligo-DNA microarray platform was then designed and applied to profile gene expression in R. decussatus heavily infected by Perkinsus olseni. Functional annotation of differentially expressed genes between those two conditionswas performed by gene set enrichment analysis. As expected, microarrays unveil genes related with stress/infectious agents such as hydrolases, proteases and others. The extensive role of innate immune system was also analyzed and effect of parasitosis upon expression of important molecules such as lectins reviewed. Conclusions This study represents a first attempt to characterize Ruditapes decussatus transcriptome, an important marine resource for the European aquaculture. The trancriptome sequencing and consequent annotation will increase the available tools and resources for this specie, introducing the possibility of high throughput experiments such as microarrays analysis. In this specific case microarray approach was used to unveil some important aspects of host-parasite interaction between the Carpet shell clam and Perkinsus, two non-model species, highlighting some genes associated with this interaction. Ample information was obtained to identify biological processes significantly enriched among differentially expressed genes in Perkinsus infected versus non-infected gills. An overview on the genes related with the immune system on R. decussatus transcriptome is also reported.
Resumo:
Dissertação de Mestrado, Processamento de Linguagem Natural e Indústrias da Língua, Faculdade de Ciências Humanas e Sociais, Universidade do Algarve, 2014
Resumo:
In this article we describe a semantic localization dataset for indoor environments named ViDRILO. The dataset provides five sequences of frames acquired with a mobile robot in two similar office buildings under different lighting conditions. Each frame consists of a point cloud representation of the scene and a perspective image. The frames in the dataset are annotated with the semantic category of the scene, but also with the presence or absence of a list of predefined objects appearing in the scene. In addition to the frames and annotations, the dataset is distributed with a set of tools for its use in both place classification and object recognition tasks. The large number of labeled frames in conjunction with the annotation scheme make this dataset different from existing ones. The ViDRILO dataset is released for use as a benchmark for different problems such as multimodal place classification and object recognition, 3D reconstruction or point cloud data compression.
Resumo:
Background: Copy number variations (CNVs) have been shown to account for substantial portions of observed genomic variation and have been associated with qualitative and quantitative traits and the onset of disease in a number of species. Information from high-resolution studies to detect, characterize and estimate population-specific variant frequencies will facilitate the incorporation of CNVs in genomic studies to identify genes affecting traits of importance. Results: Genome-wide CNVs were detected in high-density single nucleotide polymorphism (SNP) genotyping data from 1,717 Nelore (Bos indicus) cattle, and in NGS data from eight key ancestral bulls. A total of 68,007 and 12,786 distinct CNVs were observed, respectively. Cross-comparisons of results obtained for the eight resequenced animals revealed that 92 % of the CNVs were observed in both datasets, while 62 % of all detected CNVs were observed to overlap with previously validated cattle copy number variant regions (CNVRs). Observed CNVs were used for obtaining breed-specific CNV frequencies and identification of CNVRs, which were subsequently used for gene annotation. A total of 688 of the detected CNVRs were observed to overlap with 286 non-redundant QTLs associated with important production traits in cattle. All of 34 CNVs previously reported to be associated with milk production traits in Holsteins were also observed in Nelore cattle. Comparisons of estimated frequencies of these CNVs in the two breeds revealed 14, 13, 6 and 14 regions in high (>20 %), low (<20 %) and divergent (NEL > HOL, NEL < HOL) frequencies, respectively. Conclusions: Obtained results significantly enriched the bovine CNV map and enabled the identification of variants that are potentially associated with traits under selection in Nelore cattle, particularly in genome regions harboring QTLs affecting production traits.
Resumo:
Pour respecter les droits d’auteur, la version électronique de ce mémoire a été dépouillée d'un document visuel. La version intégrale du mémoire a été déposée au Service de la gestion des documents et des archives de l'Université de Montréal.
Resumo:
Pour respecter les droits d’auteur, la version électronique de ce mémoire a été dépouillée d'un document visuel. La version intégrale du mémoire a été déposée au Service de la gestion des documents et des archives de l'Université de Montréal.
Resumo:
In this seminar, I will share my experience in the early process of becoming an entrepreneur from a research background. Since 2008, I have been working with Prof. Mike Wald on an innovative video annotation tool called Synote. After about eight years of research around Synote, I have applied for the Royal Acadamy of Engineering Enterprise Fellowship in order to focus on developing Synote for real clients and making Synote sustainable and profitable. Now, it is already eight months into the fellowship, which has totally changed my life. It is very exciting, but at the same time I'm struggling all the time. The seminar will briefly go through my experience so far on the way of commercializing Synote from a research background. I will also discuss the valuable resources you can get from RAEng Enterprise Hub and Future Worlds, which is a Southampton based organization to help startups. If you are a Ph.D. student or research fellow in the University, and you want to start your own business, this is the seminar you want to attend.
Resumo:
Resumo: 1 – Sumário do Acórdão do Supremo Tribunal de Justiça, de 19 de Abril de 2012; 2 – Texto completo do Acórdão do Supremo Tribunal de Justiça, de 19 de Abril de 2012: cfr. http://www.dgsi.pt/jstj.nsf/954f0ce6ad9dd8b980256b5f003fa814/fc664c231f3e73cf802579ea003d91d2?OpenDocument&Highlight=0,polui%C3%A7%C3%A3o , 2 de Junho de 2012; 3 – Anotação sintética; 3.1 – Introdução à anotação sintética e suas características neste caso concreto; 4 – Algumas referências constitucionais centrais em relação a Direitos humanos e, nomeadamente, a um Direito humano a um meio-ambiente sadio, saudável em todas as suas vertentes e sentidos – o exemplo central do artigo 9.º da CRP; 4.1 – Algumas referências constitucionais centrais em relação a Direitos humanos e, nomeadamente, a um Direito humano a um meio-ambiente sadio, saudável em todas as suas vertentes e sentidos – o exemplo central do artigo 66.º da CRP e o Regime Geral do Ruído; 5 – O direito humano ao descanso e à saúde, rectius o direito ao ambiente sadio vs o direito ao lazer e/ou exploração económica de indústrias de diversão, rectius o direito à liberdade de iniciativa económica privada; 6 – A violação do direito humano, de personalidade, ao descanso e à saúde, rectius o direito a um ambiente sadio, numa perspectiva de Direito privado e Direito civil; 7 – A criminalização da poluição, designadamente a criminalização da poluição sonora – uma perspectiva de Direito público e Direito penal; 8 - A necessidade duma adequada política tributária que compatibilize desenvolvimento sustentado com a protecção dum meio ambiente sadio e com qualidade de vida; 9 – Conclusões. Palavras-chave: Direitos Humanos; Direito constitucional; Direito público; Direito penal; Direito privado; Direito civil; Direito ambiental; meio ambiente sadio; Direito ao descanso; Direito à saúde; Direito ao lazer e/ou exploração económica de indústrias de diversão; direito à liberdade de iniciativa económica privada; Direito tributário; Direito fiscal; Direito aduaneiro. Abstract: 1 - Summary of the Judgment of the Supreme Court of April 19, 2012, 2 - Complete text of the Judgment of the Supreme Court of April 19, 2012: cf. http://www.dgsi.pt/jstj.nsf/954f0ce6ad9dd8b980256b5f003fa814/fc664c231f3e73cf802579ea003d91d2?OpenDocument&Highlight=0,polui%C3%A7%C3%A3o , June 2, 2012, 3 - Synthetic Note: 3.1 - Introduction to synthetic annotation and its characteristics in this case 4 - Some references constitutional power over human rights and in particular to a human right to a healthy environment, healthy in all its forms and meanings - the central example of Article 9. of CRP; 4.1 - Some references constitutional power over human rights and in particular to a human right to a healthy environment, healthy in all its forms and meanings - the central example of Article 66. No of CRP and the General Noise; 5 - the human right to rest and health, rectius the right to healthy environment vs. the right to leisure and / or economic exploitation of industries fun, rectius the right to freedom of private economic initiative; 6 - the violation of human personality, to rest and health, rectius the right to a healthy environment, a perspective of private law and civil law; 7 - criminalization of pollution, including the criminalization of noise - a perspective of public law and criminal law; 8 - the need for appropriate tax policy that reconciles sustainable development with the protection of a healthy environment and quality of life; 9 - Conclusions.
Resumo:
This article presents the results of a systematic critical review of interdisciplinary literature concerned with digital text (or e-text) uses in education and proposes recommendations for how e-texts can be implemented for impactful learning. A variety of e-texts can be found in the repertoire of educational resources accessible to students, and in the constantly changing terrain of educational technologies, they are rapidly evolving, presenting new opportunities and affordances for student learning. We highlight some of the ways in which academic studies have examined e-texts as part of teaching and learning practices, placing a particular emphasis on aspects of learning such as recall, comprehension, retention of information and feedback. We also review diverse practices associated with uses of e-text tools such as note-taking, annotation, bookmarking, hypertexts and highlighting. We argue that evidence-based studies into e-texts are overwhelmingly structured around reinforcing the existing dichotomy pitting print-based (‘traditional’) texts against e-texts. In this article, we query this approach and instead propose to focus on factors such as students’ level of awareness of their options in accessing learning materials and whether they are instructed and trained in how to take full advantage of the capabilities of e-texts, both of which have been found to affect learning performance.
Resumo:
This research project is based on the Multimodal Corpus of Chinese Court Interpreting (MUCCCI [mutʃɪ]), a small-scale multimodal corpus on the basis of eight authentic court hearings with Chinese-English interpreting in Mainland China. The corpus has approximately 92,500 word tokens in total. Besides the transcription of linguistic and para-linguistic features, utilizing the facial expression classification rules suggested by Black and Yacoob (1995), MUCCCI also includes approximately 1,200 annotations of facial expressions linked to the six basic types of human emotions, namely, anger, disgust, happiness, surprise, sadness, and fear (Black & Yacoob, 1995). This thesis is an example of conducting qualitative analysis on interpreter-mediated courtroom interactions through a multimodal corpus. In particular, miscommunication events (MEs) and the reasons behind them were investigated in detail. During the analysis, although queries were conducted based on non-verbal annotations when searching for MEs, both verbal and non-verbal features were considered indispensable parts contributing to the entire context. This thesis also includes a detailed description of the compilation process of MUCCCI utilizing ELAN, from data collection to transcription, POS tagging and non-verbal annotation. The research aims at assessing the possibility and feasibility of conducting qualitative analysis through a multimodal corpus of court interpreting. The concept of integrating both verbal and non-verbal features to contribute to the entire context is emphasized. The qualitative analysis focusing on MEs can provide an inspiration for improving court interpreters’ performances. All the constraints and difficulties presented can be regarded as a reference for similar research in the future.