39 resultados para annotation sémantique

em BORIS: Bern Open Repository and Information System - Berna - Suiça


Relevância:

20.00% 20.00%

Publicador:

Relevância:

20.00% 20.00%

Publicador:

Resumo:

BACKGROUND: The mollicute Mycoplasma conjunctivae is the etiological agent leading to infectious keratoconjunctivitis (IKC) in domestic sheep and wild caprinae. Although this pathogen is relatively benign for domestic animals treated by antibiotics, it can lead wild animals to blindness and death. This is a major cause of death in the protected species in the Alps (e.g., Capra ibex, Rupicapra rupicapra). METHODS: The genome was sequenced using a combined technique of GS-FLX (454) and Sanger sequencing, and annotated by an automatic pipeline that we designed using several tools interconnected via PERL scripts. The resulting annotations are stored in a MySQL database. RESULTS: The annotated sequence is deposited in the EMBL database (FM864216) and uploaded into the mollicutes database MolliGen http://cbi.labri.fr/outils/molligen/ allowing for comparative genomics. CONCLUSION: We show that our automatic pipeline allows for annotating a complete mycoplasma genome and present several examples of analysis in search for biological targets (e.g., pathogenic proteins).

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This paper describes methods and results for the annotation of two discourse-level phenomena, connectives and pronouns, over a multilingual parallel corpus. Excerpts from Europarl in English and French have been annotated with disambiguation information for connectives and pronouns, for about 3600 tokens. This data is then used in several ways: for cross-linguistic studies, for training automatic disambiguation software, and ultimately for training and testing discourse-aware statistical machine translation systems. The paper presents the annotation procedures and their results in detail, and overviews the first systems trained on the annotated resources and their use for machine translation.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Enterococcus hirae ATCC 9790 is a Gram-positive lactic acid bacterium that has been used in basic research for over 4 decades. Here we report the sequence and annotation of the 2.8-Mb genome of E. hirae and its endemic 29-kb plasmid pTG9790.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

With the advent of high through-put sequencing (HTS), the emerging science of metagenomics is transforming our understanding of the relationships of microbial communities with their environments. While metagenomics aims to catalogue the genes present in a sample through assessing which genes are actively expressed, metatranscriptomics can provide a mechanistic understanding of community inter-relationships. To achieve these goals, several challenges need to be addressed from sample preparation to sequence processing, statistical analysis and functional annotation. Here we use an inbred non-obese diabetic (NOD) mouse model in which germ-free animals were colonized with a defined mixture of eight commensal bacteria, to explore methods of RNA extraction and to develop a pipeline for the generation and analysis of metatranscriptomic data. Applying the Illumina HTS platform, we sequenced 12 NOD cecal samples prepared using multiple RNA-extraction protocols. The absence of a complete set of reference genomes necessitated a peptide-based search strategy. Up to 16% of sequence reads could be matched to a known bacterial gene. Phylogenetic analysis of the mapped ORFs revealed a distribution consistent with ribosomal RNA, the majority from Bacteroides or Clostridium species. To place these HTS data within a systems context, we mapped the relative abundance of corresponding Escherichia coli homologs onto metabolic and protein-protein interaction networks. These maps identified bacterial processes with components that were well-represented in the datasets. In summary this study highlights the potential of exploiting the economy of HTS platforms for metatranscriptomics.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

A porcine BAC clone harboring the tightly linked IFNAR1 and IFNGR2 genes was identified by comparative analysis of the publicly available porcine BAC end sequences. The complete 168,835 bp insert sequence of this clone was determined. Sequence comparisons of the genomic sequence with EST sequences from public databases were performed and allowed a detailed annotation of the IFNAR1 and IFNGR2 genes. The analyzed genes showed a conserved genomic organization with their known mammalian orthologs, however the sequence conservation of these genes across species was relatively low. In addition to the IFNAR1 and IFNGR2 genes, which were completely sequenced, the analyzed BAC clone also contained parts of an orphan gene encoding a putative transmembrane protein (TMEM50B). In contrast to the IFNAR1 and IFNGR2 genes the sequence conservation of the TMEM50B gene across different mammalian species was extremely high.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

MicroRNAs (miRNAs) are an abundant class of non-coding RNAs that are believed to be important in many biological processes through regulation of gene expression. The precise molecular function of miRNAs in mammals is largely unknown and a better understanding will require loss-of-function studies in vivo. Here we show that a novel class of chemically engineered oligonucleotides, termed 'antagomirs', are efficient and specific silencers of endogenous miRNAs in mice. Intravenous administration of antagomirs against miR-16, miR-122, miR-192 and miR-194 resulted in a marked reduction of corresponding miRNA levels in liver, lung, kidney, heart, intestine, fat, skin, bone marrow, muscle, ovaries and adrenals. The silencing of endogenous miRNAs by this novel method is specific, efficient and long-lasting. The biological significance of silencing miRNAs with the use of antagomirs was studied for miR-122, an abundant liver-specific miRNA. Gene expression and bioinformatic analysis of messenger RNA from antagomir-treated animals revealed that the 3' untranslated regions of upregulated genes are strongly enriched in miR-122 recognition motifs, whereas downregulated genes are depleted in these motifs. Analysis of the functional annotation of downregulated genes specifically predicted that cholesterol biosynthesis genes would be affected by miR-122, and plasma cholesterol measurements showed reduced levels in antagomir-122-treated mice. Our findings show that antagomirs are powerful tools to silence specific miRNAs in vivo and may represent a therapeutic strategy for silencing miRNAs in disease.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Much of the knowledge about software systems is implicit, and therefore difficult to recover by purely automated techniques. Architectural layers and the externally visible features of software systems are two examples of information that can be difficult to detect from source code alone, and that would benefit from additional human knowledge. Typical approaches to reasoning about data involve encoding an explicit meta-model and expressing analyses at that level. Due to its informal nature, however, human knowledge can be difficult to characterize up-front and integrate into such a meta-model. We propose a generic, annotation-based approach to capture such knowledge during the reverse engineering process. Annotation types can be iteratively defined, refined and transformed, without requiring a fixed meta-model to be defined in advance. We show how our approach supports reverse engineering by implementing it in a tool called Metanool and by applying it to (i) analyzing architectural layering, (ii) tracking reengineering tasks, (iii) detecting design flaws, and (iv) analyzing features.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Avibacterium paragallinarum is an important pathogen of chicken livestock causing infectious coryza. Here, we report the draft genome sequence of the virulent A. paragallinarum serotype A strain JF4211 (2.8 Mbp and G+C content of 41%) and the two toxin operons discovered from the annotation of the genome.