23 resultados para bacteria genome nucleotide usage
em Chinese Academy of Sciences Institutional Repositories Grid Portal
Resumo:
Background: Serine/threonine kinases (STKs) have been found in an increasing number of prokaryotes, showing important roles in signal transduction that supplement the well known role of two-component system. Cyanobacteria are photoautotrophic prokaryotes able to grow in a wide range of ecological environments, and their signal transduction systems are important in adaptation to the environment. Sequence information from several cyanobacterial genomes offers a unique opportunity to conduct a comprehensive comparative analysis of this kinase family. In this study, we extracted information regarding Ser/Thr kinases from 21 species of sequenced cyanobacteria and investigated their diversity, conservation, domain structure, and evolution. Results: 286 putative STK homologues were identified. STKs are absent in four Prochlorococcus strains and one marine Synechococcus strain and abundant in filamentous nitrogen-fixing cyanobacteria. Motifs and invariant amino acids typical in eukaryotic STKs were conserved well in these proteins, and six more cyanobacteria- or bacteria-specific conserved residues were found. These STK proteins were classified into three major families according to their domain structures. Fourteen types and a total of 131 additional domains were identified, some of which are reported to participate in the recognition of signals or substrates. Cyanobacterial STKs show rather complicated phylogenetic relationships that correspond poorly with phylogenies based on 16S rRNA and those based on additional domains. Conclusion: The number of STK genes in different cyanobacteria is the result of the genome size, ecophysiology, and physiological properties of the organism. Similar conserved motifs and amino acids indicate that cyanobacterial STKs make use of a similar catalytic mechanism as eukaryotic STKs. Gene gain-and-loss is significant during STK evolution, along with domain shuffling and insertion. This study has established an overall framework of sequence-structure-function interactions for the STK gene family, which may facilitate further studies of the role of STKs in various organisms.
Resumo:
Background: Serine/threonine kinases (STKs) have been found in an increasing number of prokaryotes, showing important roles in signal transduction that supplement the well known role of two-component system. Cyanobacteria are photoautotrophic prokaryotes able to grow in a wide range of ecological environments, and their signal transduction systems are important in adaptation to the environment. Sequence information from several cyanobacterial genomes offers a unique opportunity to conduct a comprehensive comparative analysis of this kinase family. In this study, we extracted information regarding Ser/Thr kinases from 21 species of sequenced cyanobacteria and investigated their diversity, conservation, domain structure, and evolution. Results: 286 putative STK homologues were identified. STKs are absent in four Prochlorococcus strains and one marine Synechococcus strain and abundant in filamentous nitrogen-fixing cyanobacteria. Motifs and invariant amino acids typical in eukaryotic STKs were conserved well in these proteins, and six more cyanobacteria- or bacteria-specific conserved residues were found. These STK proteins were classified into three major families according to their domain structures. Fourteen types and a total of 131 additional domains were identified, some of which are reported to participate in the recognition of signals or substrates. Cyanobacterial STKs show rather complicated phylogenetic relationships that correspond poorly with phylogenies based on 16S rRNA and those based on additional domains. Conclusion: The number of STK genes in different cyanobacteria is the result of the genome size, ecophysiology, and physiological properties of the organism. Similar conserved motifs and amino acids indicate that cyanobacterial STKs make use of a similar catalytic mechanism as eukaryotic STKs. Gene gain-and-loss is significant during STK evolution, along with domain shuffling and insertion. This study has established an overall framework of sequence-structure-function interactions for the STK gene family, which may facilitate further studies of the role of STKs in various organisms.
Resumo:
We analyzed n-mers (n=3-8) in the local environment of 8,249,446 human SNPs and compared their distribution with that in the genome reference sequences. The results revealed that the short sequences, which contained at least one CpG dinucleotide, occurred
Resumo:
A genome-wide view of sequence mutability in mice is still limited, although biologists usually assume the same scenario for mice as for humans. In this study, we examined the sequence context in the local environment of 482,528 mouse single nucleotide po
Resumo:
The complete mitochondrial genome sequence of the Chinese hook snout carp, Opsariichthys bidens, was newly determined using the long and accurate polymerase chain reaction method. The 16,611-nucleotide mitogenome contains 13 protein-coding genes, two rRNA genes (12S, 16S) 22 tRNA genes, and a noncoding control region. We use these data and homologous sequence data from multiple other ostariophysan fishes in a phylogenetic evaluation to test hypothesis pertaining to codon usage pattern of O. bidens mitochondrial protein genes as well as to re-examine the ostariophysan phylogeny. The mitochondrial genome of O. bidens reveals an alternative pattern of vertebrate mitochondrial evolution. For the mitochondrial protein genes of O. bidens, the most frequently used codon generally ends with either A or C, with C preferred over A for most fourfold degenerate codon families; the relative synonymous codon usage of G-ending codons is greatly elevated in all categories. The codon usage pattern of O. bidens mitochondrial protein genes is remarkably different from the general pattern found previously in the relatively closely 9 related zebrafish and most other vertebrate mitochondria. Nucleotide bias at third codon positions is the main cause of codon bias in the mitochondrial protein genes of O. bidens, as it is biased particularly in favor of C over A. Bayesian analysis of 12 concatenated mitochondrial protein sequences for O. bidens and 46 other teleostean taxa supports the monophyly of Cypriniformes and Otophysi and results in a robust estimate of the otophysan phylogeny. (C) 2007 Published by Elsevier B.V.
Resumo:
Hemorrhagic disease, caused by the grass carp reovirus (GCRV), is one of the major diseases of grass carp in China. Little is known about the structure and function of the gene segments of this reovirus. The S10 genome segment of GCRV was cloned and the complete nucleotide sequence is reported here. The S10 is 909 nucleotides long and contains a large open reading frame (ORF) encoding a protein of 276 amino acids with a deduced molecular weight of approximately 29.7 kDa. Comparisons of the deduced amino acid sequence of GCRV S10 with those of other reoviruses revealed no significant homologies. However, GCRV S10 shared a putative zinc-finger sequence and a similar distribution of hydrophilic motifs with the outer capsid proteins encoded by Coho salmon aquareovirus (SCSV) S10, striped bass reovirus (SBRV) S10, and mammalian reovirus (MRV) S4. It was predicted that this segment gene encodes an outer capsid protein.
Resumo:
The mitochondrial DNA of the rice frog, Fejervarya limnocharis (Amphibia, Anura), was obtained using long-and-accurate polymerase chain reaction (LA-PCR) combining with subcloning method. The complete nucleotide sequence (17,717 bp) of mitochondrial genome was determined subsequently. This mitochondrial genome is characterized by four distinctive features: the translocation of ND5 gene, a cluster of rearranged tRNA genes (tRNA(Thr), tRNA(Pro), tRNA(Leu) ((CUN))) a tandem duplication of tRNA(Mer) gene, and eight large 89-bp tandem repeats in the control region, as well as three short noncoding regions containing two repeated motifs existing in the gene cluster of ND5/tRNA(Thr)/tRNA(Pro)/tRNA(Leu)/tRNA(Phe). The tandem duplication of gene regions followed by deletions of supernumerary genes can be invoked to explain the shuffling of tRNAM(Met) and a cluster of tRNA and ND5 genes, as observed in this study. Both ND5 gene translocation and tandem duplication of tRNA(Met) were first observed in the vertebrate mitochondrial genomes. (c) 2004 Elsevier B.V. All rights reserved.
Resumo:
Advances in genome technology have facilitated a new understanding of the historical and genetic processes crucial to rapid phenotypic evolution under domestication(1,2). To understand the process of dog diversification better, we conducted an extensive genome-wide survey of more than 48,000 single nucleotide polymorphisms in dogs and their wild progenitor, the grey wolf. Here we show that dog breeds share a higher proportion of multi-locus haplotypes unique to grey wolves from the Middle East, indicating that they are a dominant source of genetic diversity for dogs rather than wolves from east Asia, as suggested by mitochondrial DNA sequence data(3). Furthermore, we find a surprising correspondence between genetic and phenotypic/functional breed groupings but there are exceptions that suggest phenotypic diversification depended in part on the repeated crossing of individuals with novel phenotypes. Our results show that Middle Eastern wolves were a critical source of genome diversity, although interbreeding with local wolf populations clearly occurred elsewhere in the early history of specific lineages. More recently, the evolution of modern dog breeds seems to have been an iterative process that drew on a limited genetic toolkit to create remarkable phenotypic diversity.
Resumo:
Using next-generation sequencing technology alone, we have successfully generated and assembled a draft sequence of the giant panda genome. The assembled contigs (2.25 gigabases (Gb)) cover approximately 94% of the whole genome, and the remaining gaps (0.05 Gb) seem to contain carnivore-specific repeats and tandem repeats. Comparisons with the dog and human showed that the panda genome has a lower divergence rate. The assessment of panda genes potentially underlying some of its unique traits indicated that its bamboo diet might be more dependent on its gut microbiome than its own genetic composition. We also identified more than 2.7 million heterozygous single nucleotide polymorphisms in the diploid genome. Our data and analyses provide a foundation for promoting mammalian genetic research, and demonstrate the feasibility for using next-generation sequencing technologies for accurate, cost-effective and rapid de novo assembly of large eukaryotic genomes.
Resumo:
SNPNB is a user-friendly and platform-independent application for analyzing Single Nucleotide Polymorphism NeighBoring sequence context and nucleotide bias patterns, and subsequently evaluating the effective SNP size for the bias patterns observed from the whole data. It was implemented by Java and Perl. SNPNB can efficiently handle genome-wide or chromosome-wide SNP data analysis in a PC or a workstation. It provides visualizations of the bias patterns for SNPs or each type of SNPs.
Resumo:
Background: Cytochrome P450 monooxygenases play key roles in the metabolism of a wide variety of substrates and they are closely associated with endocellular physiological processes or detoxification metabolism under environmental exposure. To date, however, none has been systematically characterized in the phylum Ciliophora. T. thermophila possess many advantages as a eukaryotic model organism and it exhibits rapid and sensitive responses to xenobiotics, making it an ideal model system to study the evolutionary and functional diversity of the P450 monooxygenase gene family. Results: A total of 44 putative functional cytochrome P450 genes were identified and could be classified into 13 families and 21 sub-families according to standard nomenclature. The characteristics of both the conserved intron-exon organization and scaffold localization of tandem repeats within each P450 family clade suggested that the enlargement of T. thermophila P450 families probably resulted from recent separate small duplication events. Gene expression patterns of all T. thermophila P450s during three important cell physiological stages (vegetative growth, starvation and conjugation) were analyzed based on EST and microarray data, and three main categories of expression patterns were postulated. Evolutionary analysis including codon usage preference, sit-especific selection and gene-expression evolution patterns were investigated and the results indicated remarkable divergences among the T. thermophila P450 genes. Conclusion: The characterization, expression and evolutionary analysis of T. thermophila P450 monooxygenase genes in the current study provides useful information for understanding the characteristics and diversities of the P450 genes in the Ciliophora, and provides the baseline for functional analyses of individual P450 isoforms in this model ciliate species.
Resumo:
The complete genome of spring viraemia of carp virus (SVCV) strain A-1 isolated from cultured common carp (Cyprinus carpio) in China was sequenced and characterized. Reverse transcription-polymerase chain reaction (RT-PCR) derived clones were constructed and the DNA was sequenced. It showed that the entire genome of SVCV A-1 consists of 11,100 nucleotide base pairs, the predicted size of the viral RNA of rhabdoviruses. However, the additional insertions in bp 4633-4676 and bp 4684-4724 of SVCV A-1 were different from the other two published SVCV complete genomes. Five open reading frames (ORFs) of SVCV A-1 were identified and further confirmed by RT-PCR and DNA sequencing of their respective RT-PCR products. The 5 structural proteins encoded by the viral RNA were ordered 3'-N-P-M-G-L-5'. This is the first report of a complete genome sequence of SVCV isolated from cultured carp in China. Phylogenetic analysis indicates that SVCV A-1 is closely related to the members of the genus Vesiculovirus, family Rhabdoviridae.
Resumo:
The complete sequence of the 16,539 nucleotide mitochondrial genome from the single species of the catfish family Cranoglanididae, the helmet catfish Cranoglanis bouderius, was determined using the long and accurate polymerase chain reaction (LA PCR) method. The nucleotide sequences of C. bouderius mitochondrial DNA have been compared with those of three other catfish species in the same order. The contents of the C. bouderius mitochondrial genome are 13 protein-coding genes, two ribosomal RNA and 22 transfer RNA genes, and a non-coding control region, the gene order of which is identical to that observed in most other vertebrates. Phylogenetic analyses for 13 otophysan fishes were performed using Bayesian method based on the concatenated mtDNA protein-coding gene sequence and the individual protein-coding gene sequence data set. The competing otophysan topologies were then tested by using the approximately unbiased test, the Kishino-Hasegawa test, and the Shimodaira-Hasegawa test. The results show that the grouping ((((Characifonnes, Gymnotiformes), Siluriformes), Cyprinifionnes), outgroup) is the most likely but there is no significant difference between this one and the other alternative hypotheses. In addition, the phylogenetic placement of the family Cranoglanididae among siluriform families was also discussed. (c) 2006 Elsevier B.V. All rights reserved.
Resumo:
The complete nucleotide sequence of the genome segment S8 of grass carp hemorrhage virus (GCHV) was determined from cDNA corresponding to the viral genomic RNA. It is 1,287 nucleotides in length and contains a large open reading frame that could encode a protein of 409 amino acids with a predicted molecular mass of 44 kD. The S8 was expressed using the pET fusion protein vector and detected by Western blotting analysis using the chicken egg IgY against intact GCHV particles, indicating that S8 encodes a virion protein. Amino acid sequence comparisons revealed that the protein encoded by S8 is closely related to protein alpha2 of mammalian reovirus, suggesting that the deduced protein of S8 is an inner capsid protein. Copyright (C) 2001 S. Karger AG, Basel.