128 resultados para Genome annotation
em BORIS: Bern Open Repository and Information System - Berna - Suiça
Resumo:
BACKGROUND A novel Gram-negative, non-haemolytic, non-motile, rod-shaped bacterium was discovered in the lungs of a dead parakeet (Melopsittacus undulatus) that was kept in captivity in a petshop in Basel, Switzerland. The organism is described with a chemotaxonomic profile and the nearly complete genome sequence obtained through the assembly of short sequence reads. RESULTS Genome sequence analysis and characterization of respiratory quinones, fatty acids, polar lipids, and biochemical phenotype is presented here. Comparison of gene sequences revealed that the most similar species is Pelistega europaea, with BLAST identities of only 93% to the 16S rDNA gene, 76% identity to the rpoB gene, and a similar GC content (~43%) as the organism isolated from the parakeet, DSM 24701 (40%). The closest full genome sequences are those of Bordetella spp. and Taylorella spp. High-throughput sequencing reads from the Illumina-Solexa platform were assembled with the Edena de novo assembler to form 195 contigs comprising the ~2 Mb genome. Genome annotation with RAST, construction of phylogenetic trees with the 16S rDNA (rrs) gene sequence and the rpoB gene, and phylogenetic placement using other highly conserved marker genes with ML Tree all suggest that the bacterial species belongs to the Alcaligenaceae family. Analysis of samples from cages with healthy parakeets suggested that the newly discovered bacterial species is not widespread in parakeet living quarters. CONCLUSIONS Classification of this organism in the current taxonomy system requires the formation of a new genus and species. We designate the new genus Basilea and the new species psittacipulmonis. The type strain of Basilea psittacipulmonis is DSM 24701 (= CIP 110308 T, 16S rDNA gene sequence Genbank accession number JX412111 and GI 406042063).
Resumo:
Complete transcriptomic data at high resolution are available only for a few model organisms with medical importance. The gene structures of non-model organisms are mostly computationally predicted based on comparative genomics with other species. As a result, more than half of the horse gene models are known only by projection. Experimental data supporting these gene models are scarce. Moreover, most of the annotated equine genes are single-transcript genes. Utilizing RNA sequencing (RNA-seq) the experimental validation of predicted transcriptomes has become accessible at reasonable costs. To improve the horse genome annotation we performed RNA-seq on 561 samples of peripheral blood mononuclear cells (PBMCs) derived from 85 Warmblood horses. The mapped sequencing reads were used to build a new transcriptome assembly. The new assembly revealed many alternative isoforms associated to known genes or to those predicted by the Ensembl and/or Gnomon pipelines. We also identified 7,531 transcripts not associated with any horse gene annotated in public databases. Of these, 3,280 transcripts did not have a homologous match to any sequence deposited in the NCBI EST database suggesting horse specificity. The unknown transcripts were categorized as coding and noncoding based on predicted coding potential scores. Among them 230 transcripts had high coding potential score, at least 2 exons, and an open reading frame of at least 300 nt. We experimentally validated 9 new equine coding transcripts using RT-PCR and Sanger sequencing. Our results provide valuable detailed information on many transcripts yet to be annotated in the horse genome.
Resumo:
BACKGROUND: The mollicute Mycoplasma conjunctivae is the etiological agent leading to infectious keratoconjunctivitis (IKC) in domestic sheep and wild caprinae. Although this pathogen is relatively benign for domestic animals treated by antibiotics, it can lead wild animals to blindness and death. This is a major cause of death in the protected species in the Alps (e.g., Capra ibex, Rupicapra rupicapra). METHODS: The genome was sequenced using a combined technique of GS-FLX (454) and Sanger sequencing, and annotated by an automatic pipeline that we designed using several tools interconnected via PERL scripts. The resulting annotations are stored in a MySQL database. RESULTS: The annotated sequence is deposited in the EMBL database (FM864216) and uploaded into the mollicutes database MolliGen http://cbi.labri.fr/outils/molligen/ allowing for comparative genomics. CONCLUSION: We show that our automatic pipeline allows for annotating a complete mycoplasma genome and present several examples of analysis in search for biological targets (e.g., pathogenic proteins).
Resumo:
Enterococcus hirae ATCC 9790 is a Gram-positive lactic acid bacterium that has been used in basic research for over 4 decades. Here we report the sequence and annotation of the 2.8-Mb genome of E. hirae and its endemic 29-kb plasmid pTG9790.
Resumo:
Avibacterium paragallinarum is an important pathogen of chicken livestock causing infectious coryza. Here, we report the draft genome sequence of the virulent A. paragallinarum serotype A strain JF4211 (2.8 Mbp and G+C content of 41%) and the two toxin operons discovered from the annotation of the genome.
Resumo:
Clostridium chauvoei is the etiological agent of blackleg, a disease of cattle and sheep with high mortality rates, causing severe economic losses in livestock production. Here, we report the draft genome sequence of the virulent C. chauvoei strain JF4335 (2.8 Mbp and 28% G+C content) and the annotation of the genome.
Resumo:
BACKGROUND The free-living amoeba Naegleria fowleri is the causative agent of the rapidly progressing and typically fatal primary amoebic meningoencephalitis (PAM) in humans. Despite the devastating nature of this disease, which results in > 97% mortality, knowledge of the pathogenic mechanisms of the amoeba is incomplete. This work presents a comparative proteomic approach based on an experimental model in which the pathogenic potential of N. fowleri trophozoites is influenced by the compositions of different media. RESULTS As a scaffold for proteomic analysis, we sequenced the genome and transcriptome of N. fowleri. Since the sequence similarity of the recently published genome of Naegleria gruberi was far lower than the close taxonomic relationship of these species would suggest, a de novo sequencing approach was chosen. After excluding cell regulatory mechanisms originating from different media compositions, we identified 22 proteins with a potential role in the pathogenesis of PAM. Functional annotation of these proteins revealed, that the membrane is the major location where the amoeba exerts its pathogenic potential, possibly involving actin-dependent processes such as intracellular trafficking via vesicles. CONCLUSION This study describes for the first time the 30 Mb-genome and the transcriptome sequence of N. fowleri and provides the basis for the further definition of effective intervention strategies against the rare but highly fatal form of amoebic meningoencephalitis.
Resumo:
PURPOSE Whole saliva comprises components of the salivary pellicle that spontaneously forms on surfaces of implants and teeth. However, there are no studies that functionally link the salivary pellicle with a possible change in gene expression. MATERIALS AND METHODS This study examined the genetic response of oral fibroblasts exposed to the salivary pellicle and whole saliva. Oral fibroblasts were seeded onto a salivary pellicle and the respective untreated surface. Oral fibroblasts were also exposed to freshly harvested sterile-filtered whole saliva. A genome-wide microarray of oral fibroblasts was performed, followed by gene ontology screening with DAVID functional annotation clustering, KEGG pathway analysis, and the STRING functional protein association network. RESULTS Exposure of oral fibroblasts to saliva caused 61 genes to be differentially expressed (P < .05). Gene ontology screening assigned the respective genes into 262 biologic processes, 3 cellular components, 13 molecular functions, and 7 pathways. Most remarkable was the enrichment in the inflammatory response. None of the genes regulated by whole saliva was significantly changed when cells were placed onto a salivary pellicle. CONCLUSION The salivary pellicle per se does not provoke a significant inflammatory response of oral fibroblasts in vitro, whereas sterile-filtered whole saliva does produce a strong inflammatory response.
Resumo:
Marginal zone B-cell lymphomas (MZLs) have been divided into 3 distinct subtypes (extranodal MZLs of mucosa-associated lymphoid tissue [MALT] type, nodal MZLs, and splenic MZLs). Nevertheless, the relationship between the subtypes is still unclear. We performed a comprehensive analysis of genomic DNA copy number changes in a very large series of MZL cases with the aim of addressing this question. Samples from 218 MZL patients (25 nodal, 57 MALT, 134 splenic, and 2 not better specified MZLs) were analyzed with the Affymetrix Human Mapping 250K SNP arrays, and the data combined with matched gene expression in 33 of 218 cases. MALT lymphoma presented significantly more frequently gains at 3p, 6p, 18p, and del(6q23) (TNFAIP3/A20), whereas splenic MZLs was associated with del(7q31), del(8p). Nodal MZLs did not show statistically significant differences compared with MALT lymphoma while lacking the splenic MZLs-related 7q losses. Gains of 3q and 18q were common to all 3 subtypes. del(8p) was often present together with del(17p) (TP53). Although del(17p) did not determine a worse outcome and del(8p) was only of borderline significance, the presence of both deletions had a highly significant negative impact on the outcome of splenic MZLs.
Resumo:
We undertook a meta-analysis of six Crohn's disease genome-wide association studies (GWAS) comprising 6,333 affected individuals (cases) and 15,056 controls and followed up the top association signals in 15,694 cases, 14,026 controls and 414 parent-offspring trios. We identified 30 new susceptibility loci meeting genome-wide significance (P < 5 × 10 ? ? ). A series of in silico analyses highlighted particular genes within these loci and, together with manual curation, implicated functionally interesting candidate genes including SMAD3, ERAP2, IL10, IL2RA, TYK2, FUT2, DNMT3A, DENND1B, BACH2 and TAGAP. Combined with previously confirmed loci, these results identify 71 distinct loci with genome-wide significant evidence for association with Crohn's disease.
Resumo:
Saccular intracranial aneurysms are balloon-like dilations of the intracranial arterial wall; their hemorrhage commonly results in severe neurologic impairment and death. We report a second genome-wide association study with discovery and replication cohorts from Europe and Japan comprising 5,891 cases and 14,181 controls with approximately 832,000 genotyped and imputed SNPs across discovery cohorts. We identified three new loci showing strong evidence for association with intracranial aneurysms in the combined dataset, including intervals near RBBP8 on 18q11.2 (odds ratio (OR) = 1.22, P = 1.1 x 10(-12)), STARD13-KL on 13q13.1 (OR = 1.20, P = 2.5 x 10(-9)) and a gene-rich region on 10q24.32 (OR = 1.29, P = 1.2 x 10(-9)). We also confirmed prior associations near SOX17 (8q11.23-q12.1; OR = 1.28, P = 1.3 x 10(-12)) and CDKN2A-CDKN2B (9p21.3; OR = 1.31, P = 1.5 x 10(-22)). It is noteworthy that several putative risk genes play a role in cell-cycle progression, potentially affecting the proliferation and senescence of progenitor-cell populations that are responsible for vascular formation and repair.
Resumo:
Hepatitis C virus (HCV) induces chronic infection in 50% to 80% of infected persons; approximately 50% of these do not respond to therapy. We performed a genome-wide association study to screen for host genetic determinants of HCV persistence and response to therapy.
Resumo:
Narcolepsy is a rare sleep disorder with the strongest human leukocyte antigen (HLA) association ever reported. Since the associated HLA-DRB1*1501-DQB1*0602 haplotype is common in the general population (15-25%), it has been suggested that it is almost necessary but not sufficient for developing narcolepsy. To further define the genetic basis of narcolepsy risk, we performed a genome-wide association study (GWAS) in 562 European individuals with narcolepsy (cases) and 702 ethnically matched controls, with independent replication in 370 cases and 495 controls, all heterozygous for DRB1*1501-DQB1*0602. We found association with a protective variant near HLA-DQA2 (rs2858884; P < 3 x 10(-8)). Further analysis revealed that rs2858884 is strongly linked to DRB1*03-DQB1*02 (P < 4 x 10(-43)) and DRB1*1301-DQB1*0603 (P < 3 x 10(-7)). Cases almost never carried a trans DRB1*1301-DQB1*0603 haplotype (odds ratio = 0.02; P < 6 x 10(-14)). This unexpected protective HLA haplotype suggests a virtually causal involvement of the HLA region in narcolepsy susceptibility.
Resumo:
P>Outcrossing Arabidopsis species that diverged from their inbreeding relative Arabidopsis thaliana 5 million yr ago and display a biogeographical pattern of interspecific sympatry vs intraspecific allopatry provides an ideal model for studying impacts of gene introgression and polyploidization on species diversification. Flow cytometry analyses detected ploidy polymorphisms of 2x and 4x in Arabidopsis lyrata ssp. kamchatica of Taiwan. Genomic divergence between species/subspecies was estimated based on 98 randomly chosen nuclear genes. Multilocus analyses revealed a mosaic genome in diploid A. l. kamchatica composed of Arabidopsis halleri-like and A. lyrata-like alleles. Coalescent analyses suggest that the segregation of ancestral polymorphisms alone cannot explain the high inconsistency between gene trees across loci, and that gene introgression via diploid A. l. kamchatica likely distorts the molecular phylogenies of Arabidopsis species. However, not all genes migrated across species freely. Gene ontology analyses suggested that some nonmigrating genes were constrained by natural selection. High levels of estimated ancestral polymorphisms between A. halleri and A. lyrata suggest that gene flow between these species has not completely ceased since their initial isolation. Polymorphism data of extant populations also imply recent gene flow between the species. Our study reveals that interspecific gene flow affects the genome evolution in Arabidopsis.