831 resultados para Klebsiella pneumoniae genome sequence


Relevância:

30.00% 30.00%

Publicador:

Resumo:

Advances in genome technology have facilitated a new understanding of the historical and genetic processes crucial to rapid phenotypic evolution under domestication(1,2). To understand the process of dog diversification better, we conducted an extensive genome-wide survey of more than 48,000 single nucleotide polymorphisms in dogs and their wild progenitor, the grey wolf. Here we show that dog breeds share a higher proportion of multi-locus haplotypes unique to grey wolves from the Middle East, indicating that they are a dominant source of genetic diversity for dogs rather than wolves from east Asia, as suggested by mitochondrial DNA sequence data(3). Furthermore, we find a surprising correspondence between genetic and phenotypic/functional breed groupings but there are exceptions that suggest phenotypic diversification depended in part on the repeated crossing of individuals with novel phenotypes. Our results show that Middle Eastern wolves were a critical source of genome diversity, although interbreeding with local wolf populations clearly occurred elsewhere in the early history of specific lineages. More recently, the evolution of modern dog breeds seems to have been an iterative process that drew on a limited genetic toolkit to create remarkable phenotypic diversity.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

High-throughput DNA sequencing (HTS) instruments today are capable of generating millions of sequencing reads in a short period of time, and this represents a serious challenge to current bioinformatics pipeline in processing such an enormous amount of data in a fast and economical fashion. Modern graphics cards are powerful processing units that consist of hundreds of scalar processors in parallel in order to handle the rendering of high-definition graphics in real-time. It is this computational capability that we propose to harness in order to accelerate some of the time-consuming steps in analyzing data generated by the HTS instruments. We have developed BarraCUDA, a novel sequence mapping software that utilizes the parallelism of NVIDIA CUDA graphics cards to map sequencing reads to a particular location on a reference genome. While delivering a similar mapping fidelity as other mainstream programs , BarraCUDA is a magnitude faster in mapping throughput compared to its CPU counterparts. The software is also capable of supporting multiple CUDA devices in parallel to further accelerate the mapping throughput. BarraCUDA is designed to take advantage of the parallelism of GPU to accelerate the mapping of millions of sequencing reads generated by HTS instruments. By doing this, we could, at least in part streamline the current bioinformatics pipeline such that the wider scientific community could benefit from the sequencing technology. BarraCUDA is currently available at http://seqbarracuda.sf.net

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Background: Cytochrome P450 monooxygenases play key roles in the metabolism of a wide variety of substrates and they are closely associated with endocellular physiological processes or detoxification metabolism under environmental exposure. To date, however, none has been systematically characterized in the phylum Ciliophora. T. thermophila possess many advantages as a eukaryotic model organism and it exhibits rapid and sensitive responses to xenobiotics, making it an ideal model system to study the evolutionary and functional diversity of the P450 monooxygenase gene family. Results: A total of 44 putative functional cytochrome P450 genes were identified and could be classified into 13 families and 21 sub-families according to standard nomenclature. The characteristics of both the conserved intron-exon organization and scaffold localization of tandem repeats within each P450 family clade suggested that the enlargement of T. thermophila P450 families probably resulted from recent separate small duplication events. Gene expression patterns of all T. thermophila P450s during three important cell physiological stages (vegetative growth, starvation and conjugation) were analyzed based on EST and microarray data, and three main categories of expression patterns were postulated. Evolutionary analysis including codon usage preference, sit-especific selection and gene-expression evolution patterns were investigated and the results indicated remarkable divergences among the T. thermophila P450 genes. Conclusion: The characterization, expression and evolutionary analysis of T. thermophila P450 monooxygenase genes in the current study provides useful information for understanding the characteristics and diversities of the P450 genes in the Ciliophora, and provides the baseline for functional analyses of individual P450 isoforms in this model ciliate species.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The mitochondrial genome complete sequence of Achalinus meiguensis was reported for the first time in the present study. The complete mitochondrial genome of A. meiguensis is 17239 bp in length and contains 13 protein-coding genes, 22 tRNA, 2 rRNA, and 2 non-coding regions (Control regions). On the basis of comparison with the other complete mitochondrial sequences reported, we explored the characteristic of structure and evolution. For example, duplication control regions independently occurred in the evolutionary history of reptiles; the pseudo-tRNA of snakes occurred in the Caenophidia; snake is shorter than other vertebrates in the length of tRNA because of the truncations of T psi C arm (less than 5 bp) and "DHU" arm. The phylogenic analysis by MP and BI analysis showed that the phylogenetic position of A. meiguensis was placed in Caenophidia as a sister group to other advanced snakes with the exclusion of Acrochordus granulatus which was rooted in the Caenophidia. Therefore we suggested that the subfamily Xenodermatinae, which contains A. meiguensis, should be raised to a family rank or higher rank. At the same time, based on the phylogenic statistic test, the tree of Bayesian was used for estimating the divergence time. The results showed that the divergence time between Henophidia and Caenophidia was 109.50 Mya; 106.18 Mya for divergence between Acrochordus granulatus and the other snakes of the Caenophidia; the divergence time of A. meiguensis was 103 Mya, and Viperidae diverged from the unilateral of Elapidae and Colubridae was 96.06 Mya.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

A PCR survey for Sox genes in a young tetraploid fish Tor douronensis (Teleostei: Cyprinidae) was performed to access the evolutionary fates of important functional genes after genome duplication caused by polyploidization event. Totally 13 Sox genes were obtained in Tor douronensis, which represent SoxB, SoxC and SoxE groups. Phylogenetic analysis of Sox genes in Tor douronensis provided evidence for fish-specific genome duplication, and suggested that Sox19 might be a teleost specific Sox gene member. Sequence analysis revealed most of the nucleotide substitutions between duplicated copies of Sox genes caused by tetraploidization event or their orthologues in other species are silent substitutions. It would appear that the sequences are under purifying selective pressure, strongly suggesting that they represent functional genes and supporting selection against all null allele at either of two duplicated loci of Sox4a, Sox9a and Sox9b. Surprising variations of the intron length and similarities of two duplicated copies of Sox9a and Sox9b, suggest that Tor douronensis might be an allotetraploidy.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The complete genome of mandarin fish Siniperca chuatsi rhabdovirus (SCRV) was cloned and sequenced. It comprises 11,545 nucleotides and contains five genes encoding the nucleoprotein N, the phosphoprotein P, the matrix protein M, the glycoprotein G, and the RNA-dependent RNA polymerase protein L. At the 3' and 5' termini of SCRV genome, leader and trailer sequences show inverse complementarity. The N, P, M and G proteins share the highest sequence identities (ranging from 14.8 to 41.5%) with the respective proteins of rhabdovirus 903/87, the L protein has the highest identity with those of vesiculoviruses, especially with Chandipura virus (44.7%). Phylogenetic analysis of L proteins showed that SCRV clustered with spring vireamia of carp virus (SVCV) and was most closely related to viruses in the genus Vesiculovirus. In addition, an overlapping open reading frame (ORF) predicted to encode a protein similar to vesicular stomatitis virus C protein is present within the P gene of SCRV. Furthermore, an unoverlapping small ORF downstream of M ORF within M gene is predicted (tentatively called orf4). Therefore, the genomic organization of SCRV can be proposed as 3' leader-N-P/C-M-(orf4)-G-L-trailer 5'. Orf4 transcription or translation products could not be detected by northern or Western blot, respectively, though one similar mRNA band to M mRNA was found. This is the first report on one small unoverlapping ORF in M gene of a fish rhabdovirus. (c) 2007 Elsevier B.V. All rights reserved.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The complete sequence of the 16,539 nucleotide mitochondrial genome from the single species of the catfish family Cranoglanididae, the helmet catfish Cranoglanis bouderius, was determined using the long and accurate polymerase chain reaction (LA PCR) method. The nucleotide sequences of C. bouderius mitochondrial DNA have been compared with those of three other catfish species in the same order. The contents of the C. bouderius mitochondrial genome are 13 protein-coding genes, two ribosomal RNA and 22 transfer RNA genes, and a non-coding control region, the gene order of which is identical to that observed in most other vertebrates. Phylogenetic analyses for 13 otophysan fishes were performed using Bayesian method based on the concatenated mtDNA protein-coding gene sequence and the individual protein-coding gene sequence data set. The competing otophysan topologies were then tested by using the approximately unbiased test, the Kishino-Hasegawa test, and the Shimodaira-Hasegawa test. The results show that the grouping ((((Characifonnes, Gymnotiformes), Siluriformes), Cyprinifionnes), outgroup) is the most likely but there is no significant difference between this one and the other alternative hypotheses. In addition, the phylogenetic placement of the family Cranoglanididae among siluriform families was also discussed. (c) 2006 Elsevier B.V. All rights reserved.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Antimicrobial peptides (AMPs) are important components of the host innate immune response against microbial invasion. In addition to the previously known four classes of antimicrobial peptides, a fifth class of antimicrobial peptides has been recently identified to include NK-lysins that have a globular three-dimensional structure and are larger with 74-78 amino acid residues. NK-lysin has been shown to harbor antimicrobial activities against a wide spectrum of microorganisms including bacteria, fungi, protozoa, and parasites. To date, NK-lysin genes have been reported from only a limited number of organisms. We previously identified a NK-lysin cDNA in channel catfish. Here we report the identification of two noveltypes of NK-lysin transcripts in channel catfish. Altogether, three distinct NK-lysin transcripts exist in channel catfish. In this work, their encoding genes were identified, sequenced, and characterized. We provide strong evidence that the catfish NK-lysin gene is tripled in the same genomic neighborhood. All three catfish NK-lysin genes are present in the same genomic region and are tightly linked on the same chromosome, as the same BAC clones harbor all three copies of the NK-lysin genes. All three NK-lysin genes are expressed, but exhibit distinct expression profiles in various tissues. In spite of the existence of a single copy of NK-lysin gene in the human genome, and only a single hit from the pufferfish,genome, there are two tripled clusters of NK-lysin genes on chromosome 17 of zebrafish in addition to one more copy on its chromosome 5. The similarity in the genomic arrangement of the tripled NK-lysin genes in channel catfish and zebrafish suggest similar evolution of NK-lysin genes. (c) 2005 Elsevier Ltd. All rights reserved.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Full-length and partial genome sequences of four members of the genus Aquareovirus, family Reoviridae (Golden shiner reovirus, Grass carp reovirus, Striped bass reovirus and golden ide reovirus) were characterized. Based on sequence comparison, the unclassified Grass carp reovirus was shown to be a member of the species Aquareovirus C The status of golden ide reovirus, another unclassified aquareovirus, was also examined. Sequence analysis showed that it did not belong to the species Aquareovirus A or C, but assessment of its relationship to the species Aquareovirus B, D, E and F was hampered by the absence of genetic data from these species. In agreement with previous reports of ultrastructural resemblance between aquareoviruses and orthoreoviruses, genetic analysis revealed homology in the genes of the two groups. This homology concerned eight of the 11 segments of the aquareovirus genome (amino acid identity 17-42%), and similar genetic organization was observed in two other segments. The conserved terminal sequences in the genomes of members of the two groups were also similar. These data are undoubtedly an indication of the common evolutionary origin of these viruses. This clear genetic relatedness between members of distinct genera is unique within the family Reoviridae. Such a genetic relationship is usually observed between members of a single genus. However, the current taxonomic classification of aquareoviruses and orthoreoviruses in two different genera is supported by a number of characteristics, including their distinct G+C contents, unequal numbers of genome segments, absence of an antigenic relationship, different cytopathic effects and specific econiches.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The complete nucleotide sequence of the genome segment S8 of grass carp hemorrhage virus (GCHV) was determined from cDNA corresponding to the viral genomic RNA. It is 1,287 nucleotides in length and contains a large open reading frame that could encode a protein of 409 amino acids with a predicted molecular mass of 44 kD. The S8 was expressed using the pET fusion protein vector and detected by Western blotting analysis using the chicken egg IgY against intact GCHV particles, indicating that S8 encodes a virion protein. Amino acid sequence comparisons revealed that the protein encoded by S8 is closely related to protein alpha2 of mammalian reovirus, suggesting that the deduced protein of S8 is an inner capsid protein. Copyright (C) 2001 S. Karger AG, Basel.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Background: Serine/threonine kinases (STKs) have been found in an increasing number of prokaryotes, showing important roles in signal transduction that supplement the well known role of two-component system. Cyanobacteria are photoautotrophic prokaryotes able to grow in a wide range of ecological environments, and their signal transduction systems are important in adaptation to the environment. Sequence information from several cyanobacterial genomes offers a unique opportunity to conduct a comprehensive comparative analysis of this kinase family. In this study, we extracted information regarding Ser/Thr kinases from 21 species of sequenced cyanobacteria and investigated their diversity, conservation, domain structure, and evolution. Results: 286 putative STK homologues were identified. STKs are absent in four Prochlorococcus strains and one marine Synechococcus strain and abundant in filamentous nitrogen-fixing cyanobacteria. Motifs and invariant amino acids typical in eukaryotic STKs were conserved well in these proteins, and six more cyanobacteria- or bacteria-specific conserved residues were found. These STK proteins were classified into three major families according to their domain structures. Fourteen types and a total of 131 additional domains were identified, some of which are reported to participate in the recognition of signals or substrates. Cyanobacterial STKs show rather complicated phylogenetic relationships that correspond poorly with phylogenies based on 16S rRNA and those based on additional domains. Conclusion: The number of STK genes in different cyanobacteria is the result of the genome size, ecophysiology, and physiological properties of the organism. Similar conserved motifs and amino acids indicate that cyanobacterial STKs make use of a similar catalytic mechanism as eukaryotic STKs. Gene gain-and-loss is significant during STK evolution, along with domain shuffling and insertion. This study has established an overall framework of sequence-structure-function interactions for the STK gene family, which may facilitate further studies of the role of STKs in various organisms.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Background: Serine/threonine kinases (STKs) have been found in an increasing number of prokaryotes, showing important roles in signal transduction that supplement the well known role of two-component system. Cyanobacteria are photoautotrophic prokaryotes able to grow in a wide range of ecological environments, and their signal transduction systems are important in adaptation to the environment. Sequence information from several cyanobacterial genomes offers a unique opportunity to conduct a comprehensive comparative analysis of this kinase family. In this study, we extracted information regarding Ser/Thr kinases from 21 species of sequenced cyanobacteria and investigated their diversity, conservation, domain structure, and evolution. Results: 286 putative STK homologues were identified. STKs are absent in four Prochlorococcus strains and one marine Synechococcus strain and abundant in filamentous nitrogen-fixing cyanobacteria. Motifs and invariant amino acids typical in eukaryotic STKs were conserved well in these proteins, and six more cyanobacteria- or bacteria-specific conserved residues were found. These STK proteins were classified into three major families according to their domain structures. Fourteen types and a total of 131 additional domains were identified, some of which are reported to participate in the recognition of signals or substrates. Cyanobacterial STKs show rather complicated phylogenetic relationships that correspond poorly with phylogenies based on 16S rRNA and those based on additional domains. Conclusion: The number of STK genes in different cyanobacteria is the result of the genome size, ecophysiology, and physiological properties of the organism. Similar conserved motifs and amino acids indicate that cyanobacterial STKs make use of a similar catalytic mechanism as eukaryotic STKs. Gene gain-and-loss is significant during STK evolution, along with domain shuffling and insertion. This study has established an overall framework of sequence-structure-function interactions for the STK gene family, which may facilitate further studies of the role of STKs in various organisms.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Cyanobacteria are an ancient group of gram-negative bacteria with strong genome size variation ranging from 1.6 to 9.1 Mb. Here, we first retrieved all the putative restriction-modification (RM) genes in the draft genome of Spirulina and then performed a range of comparative and bioinformatic analyses on RM genes from unicellular and filamentous cyanobacterial genomes. We have identified 6 gene clusters containing putative Type I RMs and 11 putative Type II RMs or the solitary methyltransferases (MTases). RT-PCR analysis reveals that 6 of 18 MTases are not expressed in Spirulina, whereas one hsdM gene, with a mutated cognate hsdS, was detected to be expressed. Our results indicate that the number of RM genes in filamentous cyanobacteria is significantly higher than in unicellular species, and this expansion of RM systems in filamentous cyanobacteria may be related to their wide range of ecological tolerance. Furthermore, a coevolutionary pattern is found between hsdM and hsdR, with a large number of site pairs positively or negatively correlated, indicating the functional importance of these pairing interactions between their tertiary structures. No evidence for positive selection is found for the majority of RMs, e. g., hsdM, hsdS, hsdR, and Type II restriction endonuclease gene families, while a group of MTases exhibit a remarkable signature of adaptive evolution. Sites and genes identified here to have been under positive selection would provide targets for further research on their structural and functional evaluations.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Through random sequencing, we found a total of 884000 base-pairs (bp) of random genomic sequences in the genome of Chinese shrimp (Fenneropenaeus chinensis). Using bio-soft Tandem Repeat Finder (TRF) software, 2159 tandem repeats were found, in which there were 1714 microsatellites and 445 minisatellites, accounting for 79.4% and 20.6% of repeat sequences, respectively. The cumulative length of repeat sequences was found to be 116685 bp, accounting for 13.2% of the total DNA sequence; the cumulative length of microsatellites occupied 9.78% of the total DNA sequence, and that of minisatellites occupied 3.42%. In decreasing order, the 20 most abundant repeat sequence classes were as follows: AT (557), AC (471), AG (274), AAT (92), A (56), AAG (28), ATC (27), ATAG (27), AGG (18), ACT (15), C (11), AAC (11), ACAT (11), CAGA (10), AGAA (9), AGGG (7), CAAA (7), CGCA (6), ATAA (6), AGAGAA (6). Dinucleotide repeats, not only in the aspect of the number, but also in cumulative length, were the preponderant repeat type. There were few classes and low copy numbers of repeat units of the pentanucleotide repeat type, which included only three classes: AGAGA, GAGGC and AAAGA. The classes and copy numbers of heptanucleotide, eleven-nucleotide and thirteen-nucleotide primer-number-composed repeats were distinctly less than that of repeat types beside them.