888 resultados para Genome Sequence
Resumo:
The complete sequence of the 16,539 nucleotide mitochondrial genome from the single species of the catfish family Cranoglanididae, the helmet catfish Cranoglanis bouderius, was determined using the long and accurate polymerase chain reaction (LA PCR) method. The nucleotide sequences of C. bouderius mitochondrial DNA have been compared with those of three other catfish species in the same order. The contents of the C. bouderius mitochondrial genome are 13 protein-coding genes, two ribosomal RNA and 22 transfer RNA genes, and a non-coding control region, the gene order of which is identical to that observed in most other vertebrates. Phylogenetic analyses for 13 otophysan fishes were performed using Bayesian method based on the concatenated mtDNA protein-coding gene sequence and the individual protein-coding gene sequence data set. The competing otophysan topologies were then tested by using the approximately unbiased test, the Kishino-Hasegawa test, and the Shimodaira-Hasegawa test. The results show that the grouping ((((Characifonnes, Gymnotiformes), Siluriformes), Cyprinifionnes), outgroup) is the most likely but there is no significant difference between this one and the other alternative hypotheses. In addition, the phylogenetic placement of the family Cranoglanididae among siluriform families was also discussed. (c) 2006 Elsevier B.V. All rights reserved.
Resumo:
Antimicrobial peptides (AMPs) are important components of the host innate immune response against microbial invasion. In addition to the previously known four classes of antimicrobial peptides, a fifth class of antimicrobial peptides has been recently identified to include NK-lysins that have a globular three-dimensional structure and are larger with 74-78 amino acid residues. NK-lysin has been shown to harbor antimicrobial activities against a wide spectrum of microorganisms including bacteria, fungi, protozoa, and parasites. To date, NK-lysin genes have been reported from only a limited number of organisms. We previously identified a NK-lysin cDNA in channel catfish. Here we report the identification of two noveltypes of NK-lysin transcripts in channel catfish. Altogether, three distinct NK-lysin transcripts exist in channel catfish. In this work, their encoding genes were identified, sequenced, and characterized. We provide strong evidence that the catfish NK-lysin gene is tripled in the same genomic neighborhood. All three catfish NK-lysin genes are present in the same genomic region and are tightly linked on the same chromosome, as the same BAC clones harbor all three copies of the NK-lysin genes. All three NK-lysin genes are expressed, but exhibit distinct expression profiles in various tissues. In spite of the existence of a single copy of NK-lysin gene in the human genome, and only a single hit from the pufferfish,genome, there are two tripled clusters of NK-lysin genes on chromosome 17 of zebrafish in addition to one more copy on its chromosome 5. The similarity in the genomic arrangement of the tripled NK-lysin genes in channel catfish and zebrafish suggest similar evolution of NK-lysin genes. (c) 2005 Elsevier Ltd. All rights reserved.
Resumo:
Full-length and partial genome sequences of four members of the genus Aquareovirus, family Reoviridae (Golden shiner reovirus, Grass carp reovirus, Striped bass reovirus and golden ide reovirus) were characterized. Based on sequence comparison, the unclassified Grass carp reovirus was shown to be a member of the species Aquareovirus C The status of golden ide reovirus, another unclassified aquareovirus, was also examined. Sequence analysis showed that it did not belong to the species Aquareovirus A or C, but assessment of its relationship to the species Aquareovirus B, D, E and F was hampered by the absence of genetic data from these species. In agreement with previous reports of ultrastructural resemblance between aquareoviruses and orthoreoviruses, genetic analysis revealed homology in the genes of the two groups. This homology concerned eight of the 11 segments of the aquareovirus genome (amino acid identity 17-42%), and similar genetic organization was observed in two other segments. The conserved terminal sequences in the genomes of members of the two groups were also similar. These data are undoubtedly an indication of the common evolutionary origin of these viruses. This clear genetic relatedness between members of distinct genera is unique within the family Reoviridae. Such a genetic relationship is usually observed between members of a single genus. However, the current taxonomic classification of aquareoviruses and orthoreoviruses in two different genera is supported by a number of characteristics, including their distinct G+C contents, unequal numbers of genome segments, absence of an antigenic relationship, different cytopathic effects and specific econiches.
Resumo:
The complete nucleotide sequence of the genome segment S8 of grass carp hemorrhage virus (GCHV) was determined from cDNA corresponding to the viral genomic RNA. It is 1,287 nucleotides in length and contains a large open reading frame that could encode a protein of 409 amino acids with a predicted molecular mass of 44 kD. The S8 was expressed using the pET fusion protein vector and detected by Western blotting analysis using the chicken egg IgY against intact GCHV particles, indicating that S8 encodes a virion protein. Amino acid sequence comparisons revealed that the protein encoded by S8 is closely related to protein alpha2 of mammalian reovirus, suggesting that the deduced protein of S8 is an inner capsid protein. Copyright (C) 2001 S. Karger AG, Basel.
Resumo:
Background: Serine/threonine kinases (STKs) have been found in an increasing number of prokaryotes, showing important roles in signal transduction that supplement the well known role of two-component system. Cyanobacteria are photoautotrophic prokaryotes able to grow in a wide range of ecological environments, and their signal transduction systems are important in adaptation to the environment. Sequence information from several cyanobacterial genomes offers a unique opportunity to conduct a comprehensive comparative analysis of this kinase family. In this study, we extracted information regarding Ser/Thr kinases from 21 species of sequenced cyanobacteria and investigated their diversity, conservation, domain structure, and evolution. Results: 286 putative STK homologues were identified. STKs are absent in four Prochlorococcus strains and one marine Synechococcus strain and abundant in filamentous nitrogen-fixing cyanobacteria. Motifs and invariant amino acids typical in eukaryotic STKs were conserved well in these proteins, and six more cyanobacteria- or bacteria-specific conserved residues were found. These STK proteins were classified into three major families according to their domain structures. Fourteen types and a total of 131 additional domains were identified, some of which are reported to participate in the recognition of signals or substrates. Cyanobacterial STKs show rather complicated phylogenetic relationships that correspond poorly with phylogenies based on 16S rRNA and those based on additional domains. Conclusion: The number of STK genes in different cyanobacteria is the result of the genome size, ecophysiology, and physiological properties of the organism. Similar conserved motifs and amino acids indicate that cyanobacterial STKs make use of a similar catalytic mechanism as eukaryotic STKs. Gene gain-and-loss is significant during STK evolution, along with domain shuffling and insertion. This study has established an overall framework of sequence-structure-function interactions for the STK gene family, which may facilitate further studies of the role of STKs in various organisms.
Resumo:
Background: Serine/threonine kinases (STKs) have been found in an increasing number of prokaryotes, showing important roles in signal transduction that supplement the well known role of two-component system. Cyanobacteria are photoautotrophic prokaryotes able to grow in a wide range of ecological environments, and their signal transduction systems are important in adaptation to the environment. Sequence information from several cyanobacterial genomes offers a unique opportunity to conduct a comprehensive comparative analysis of this kinase family. In this study, we extracted information regarding Ser/Thr kinases from 21 species of sequenced cyanobacteria and investigated their diversity, conservation, domain structure, and evolution. Results: 286 putative STK homologues were identified. STKs are absent in four Prochlorococcus strains and one marine Synechococcus strain and abundant in filamentous nitrogen-fixing cyanobacteria. Motifs and invariant amino acids typical in eukaryotic STKs were conserved well in these proteins, and six more cyanobacteria- or bacteria-specific conserved residues were found. These STK proteins were classified into three major families according to their domain structures. Fourteen types and a total of 131 additional domains were identified, some of which are reported to participate in the recognition of signals or substrates. Cyanobacterial STKs show rather complicated phylogenetic relationships that correspond poorly with phylogenies based on 16S rRNA and those based on additional domains. Conclusion: The number of STK genes in different cyanobacteria is the result of the genome size, ecophysiology, and physiological properties of the organism. Similar conserved motifs and amino acids indicate that cyanobacterial STKs make use of a similar catalytic mechanism as eukaryotic STKs. Gene gain-and-loss is significant during STK evolution, along with domain shuffling and insertion. This study has established an overall framework of sequence-structure-function interactions for the STK gene family, which may facilitate further studies of the role of STKs in various organisms.
Genome-wide analysis of restriction-modification system in unicellular and filamentous cyanobacteria
Resumo:
Cyanobacteria are an ancient group of gram-negative bacteria with strong genome size variation ranging from 1.6 to 9.1 Mb. Here, we first retrieved all the putative restriction-modification (RM) genes in the draft genome of Spirulina and then performed a range of comparative and bioinformatic analyses on RM genes from unicellular and filamentous cyanobacterial genomes. We have identified 6 gene clusters containing putative Type I RMs and 11 putative Type II RMs or the solitary methyltransferases (MTases). RT-PCR analysis reveals that 6 of 18 MTases are not expressed in Spirulina, whereas one hsdM gene, with a mutated cognate hsdS, was detected to be expressed. Our results indicate that the number of RM genes in filamentous cyanobacteria is significantly higher than in unicellular species, and this expansion of RM systems in filamentous cyanobacteria may be related to their wide range of ecological tolerance. Furthermore, a coevolutionary pattern is found between hsdM and hsdR, with a large number of site pairs positively or negatively correlated, indicating the functional importance of these pairing interactions between their tertiary structures. No evidence for positive selection is found for the majority of RMs, e. g., hsdM, hsdS, hsdR, and Type II restriction endonuclease gene families, while a group of MTases exhibit a remarkable signature of adaptive evolution. Sites and genes identified here to have been under positive selection would provide targets for further research on their structural and functional evaluations.
Resumo:
Through random sequencing, we found a total of 884000 base-pairs (bp) of random genomic sequences in the genome of Chinese shrimp (Fenneropenaeus chinensis). Using bio-soft Tandem Repeat Finder (TRF) software, 2159 tandem repeats were found, in which there were 1714 microsatellites and 445 minisatellites, accounting for 79.4% and 20.6% of repeat sequences, respectively. The cumulative length of repeat sequences was found to be 116685 bp, accounting for 13.2% of the total DNA sequence; the cumulative length of microsatellites occupied 9.78% of the total DNA sequence, and that of minisatellites occupied 3.42%. In decreasing order, the 20 most abundant repeat sequence classes were as follows: AT (557), AC (471), AG (274), AAT (92), A (56), AAG (28), ATC (27), ATAG (27), AGG (18), ACT (15), C (11), AAC (11), ACAT (11), CAGA (10), AGAA (9), AGGG (7), CAAA (7), CGCA (6), ATAA (6), AGAGAA (6). Dinucleotide repeats, not only in the aspect of the number, but also in cumulative length, were the preponderant repeat type. There were few classes and low copy numbers of repeat units of the pentanucleotide repeat type, which included only three classes: AGAGA, GAGGC and AAAGA. The classes and copy numbers of heptanucleotide, eleven-nucleotide and thirteen-nucleotide primer-number-composed repeats were distinctly less than that of repeat types beside them.
Resumo:
A highly repetitive satellite sequence was previously identified in the Pacific oyster Crassostrea gigas Thunberg. The sequence has 168 bp per unit, present in tandem repeats, and accounts for 1% to 4% of the genome. We studied the chromosomal location of this satellite sequence by fluorescence in situ hybridization (FISH), A probe was made by polymerase chain reaction and incorporation of digoxigenin-11-dUTP. Hybridization was detected with fluorescein-labeled antidigoxigenin antibodies. FISH signals were located at centromeric regions of 7 pairs of the Pacific oyster chromosomes. No interstitial site was found. Signals were strong and consistent on chromosomes 1, 2, 4, and 7, but weak or variable oil chromosomes 5, 8, and 10. No signal was observed on chromosomes 3, 6, and 9. Our results showed that this sequence is clearly a centromeric satellite, disputing its previous assignment to the telomeric and submetacentric regions of 2 chromosomes. No signal was detected in the American oyster (Crassostrea virginica Gmelin).
Resumo:
Arthrospira (Spirulina) (Setchell& Gardner) is an important cyanobacterium not only in its nutritional potential but in its special biological characteristics. An unbiased fosmid library of Arthrospira maxima FACHB438 that contains 4300 clones was constructed. The size distribution of insert fragments is from 15.5 to 48.9 kb and the average size is 37.6 kb. The recombination frequency is 100%. Therefore the library is 29.9 equivalents to the Arthrospira genome size of 5.4 Mb. A total of 719 sample clones were randomly chosen from the library and 602 available sequences, which consisted of 307,547 bases, covering 5.70% of the whole genome. The codon usage of A. maxima was not strongly biased. GC content at the first position of codons (46.9%) was higher than the second (39.8%) and the third (45.5%) positions. GC content of the genome was 43.6%. Of these sequences, 287 (47.7%) showed high similarities to known genes, 63 (10.5%) to hypothetical genes and the remaining 252 (41.8%) had no significant similarities. The assigned genes were classified into 22 categories with respect to different biological roles. Remarkably, the high presence of 25 sequences (4.2%) encoding reverse transcriptase indicates the RT gene may have multiple copies in the A. maxima genome and might play an important role in the evolutionary history and metabolic regulation. In addition, the sequences encoding the ATP-binding cassette transport system and the two-component signal transduction system were the second and third most frequent genes, respectively. These genomic features provide some clues as to the mechanisms by which this organism adapts to the high concentration of bicarbonate and to the high pH environment.
Resumo:
A large number of polymorphic simple sequence repeats (SSRs) or microsatellites are needed to develop a genetic map for shrimp. However, developing an SSR map is very time-consuming, expensive, and most SSRs are not specifically linked to gene loci of immediate interest. We report here on our strategy to develop polymorphic markers using expressed sequence tags (ESTs) by designing primers flanking single or multiple SSRs with three or more repeats. A subtracted cDNA library was prepared using RNA from specific pathogen-free (SPF) Litopenaeus vannamei juveniles (similar to 1 g) collected before (0) and after (48 h) inoculation with the China isolate of white spot syndrome virus (WSSV). A total of 224 clones were sequenced, 194 of which were useful for homology comparisons against annotated genes in NCBI nonredundant (nr) and protein databases, providing 179 sequences encoded by nuclear DNA, 4 mitochondrial DNA, and 11 were similar to portions of WSSV genome. The nuclear sequences clustered in 43 groups, 11 of which were homologous to various ESTs of unknown function, 4 had no homology to any sequence, and 28 showed similarities to known genes of invertebrates and vertebrates, representatives of cellular metabolic processes such as calcium ion balance, cytoskeleton mRNAs, and protein synthesis. A few sequences were homologous to immune system-related (allergens) genes and two were similar to motifs of the sex-lethal gene of Drosophila. A large number of EST sequences were similar to domains of the EF-hand superfamily (Ca2+ binding motif and FRQ protein domain of myosin light chains). Single or multiple SSRs with three or more repeats were found in approximately 61 % of the 179 nuclear sequences. Primer sets were designed from 28 sequences representing 19 known or putative genes and tested for polymorphism (EST-SSR marker) in a small test panel containing 16 individuals. Ten (53%) of the 19 putative or unknown function genes were polymorphic, 4 monomorphic, and 3 either failed to satisfactorily amplify genomic DNA or the allele amplification conditions need to be further optimized. Five polymorphic ESTs were genotyped with the entire reference mapping family, two of them (actin, accession #CX535973 and shrimp allergen arginine kinase, accession #CX535999) did not amplify with all offspring of the IRMF panel suggesting presence of null alleles, and three of them amplified in most of the IRM F offspring and were used for linkage analysis. EF-hand motif of myosin light chain (accession #CX535935) was placed in ShrimpMap's linkage group 7, whereas ribosomal protein S5 (accession #CX535957) and troponin I (accession #CX535976) remained unassigned. Results indicate that (a) a large number of ESTs isolated from this cDNA library are similar to cytoskeleton mRNAs and may reflect a normal pathway of the cellular response after im infection with WSSV, and (b) primers flanking single or multiple SSRs with three or more repeats from shrimp ESTs could be an efficient approach to develop polymorphic markers useful for linkage mapping. Work is underway to map additional SSR-containing ESTs from this and other cDNA libraries as a plausible strategy to increase marker density in ShrimpMap.
Resumo:
The complete mitochondrial (mt) DNA sequence was determined for a ridgetail white prawn, Exopalaemon carinicauda Holthuis, 1950 (Crustacea: Decopoda: Palaemonidae). The mt genome is 15,730 bp in length, encoding a standard set of 13 protein-coding genes, 2 ribosomal RNA genes, and 22 transfer RNA genes, which is typical for metazoans. The majority-strand consists of 33.6% A, 23.0% C, 13.4% G, and 30.0% T bases (AT skew = 0.057: GC skew = -0.264). A total of 1045 bp of non-coding nucleotides were observed in 16 intergenic regions,,including a major A+ T rich (79.7%) noncoding region (886 bp). A novel translocation of tRNA(Pro) and tRNA(Thr) was found when comparing this genome with the pancrustacean ground pattern indicating that gene order is not conserved among caridean mitochondria. Furthermore, the rate of Ka/Ks in 13 protein-coding genes between three caridean species is Much less than 1, which indicates a strong Purifying selection within this group. To investigate the phylogenetic relationship within Malacostraca, phylogenetic trees based oil Currently available malacostracan complete mitochondrial sequences were built with the maximum likelihood and Bayesian models. All analyses based oil nucleotide and amino acid data strongly support the monophyly of Decapoda. The Penaeidae, Reptantia, Caridea, and Meiura clades were also recovered as monophyletic groups with Strong Statistical Support. However, the phylogenetic relationships within Pleocyemata are unstable, as represented by the inclusion or exclusion of Caridea. (C) 2009 Elsevier B.V. All rights reserved.
Resumo:
The x- and y-type high molecular weight (HMW) glutenin subunits are conserved seed storage proteins in wheat and related species. Here we describe investigations on the HMW glutenin subunits from several Pseudoroegneria accessions. The electrophoretic mobilities of the HMW glutenin subunits from Pd. stipifolia, Pd tauri and Pd strigosa were much faster than those of orthologous wheat subunits, indicating that their protein size may be smaller than that of wheat subunits. The coding sequence of the Glu-1St1 subunit (encoded by the Pseudoroegneria stipifolia accession PI325181) was isolated, and found to represent the native open reading frame (ORF) by in vitro expression. The deduced amino acid sequence of Glu-1St1 matched with that determined from the native subunit by mass spectrometric analysis. The domain organization in Glu-1St1 showed high similarity with that of typical HMW glutenin subunits. However, Glu-1St1 exhibited several distinct characteristics. First, the length of its repetitive domain was substantially smaller than that of conventional subunits, which explains its much faster electrophoretic mobility in SDS-PAGE. Second, although the N-terminal domain of Glu-1St1 resembled that of y-type subunit, its C-terminal domain was more similar to that of x-type subunit. Third, the N- and C-terminat domains of Glu-1St1 shared conserved features with those of barley D-hordein, but the repeat motifs and the organization of its repetitive domain were more similar to those of HMW glutenin subunits than to D-hordein. We conclude that Glu-1St1 is a novel variant of HMW glutenin subunits. The analysis of Glu-1St1 may provide new insight into the evolution of HMW glutenin subunits in Triticeae species. (C) 2007 Elsevier Ltd. All rights reserved.
Resumo:
Matthew J. Nicholson, Michael K. Theodorou and Jayne L. Brookman. (2005). Molecular analysis of the anaerobic rumen fungus Orpinomyces - insights into an AT-rich genome. Microbiology, 151 (1), 121-133. Sponsorship: BBSRC RAE2008
Resumo:
BACKGROUND:Blood lipid levels including low-density lipoprotein cholesterol (LDL-C), high-density lipoprotein cholesterol (HDL-C), and triglycerides (TG) are highly heritable. Genome-wide association is a promising approach to map genetic loci related to these heritable phenotypes.METHODS:In 1087 Framingham Heart Study Offspring cohort participants (mean age 47 years, 52% women), we conducted genome-wide analyses (Affymetrix 100K GeneChip) for fasting blood lipid traits. Total cholesterol, HDL-C, and TG were measured by standard enzymatic methods and LDL-C was calculated using the Friedewald formula. The long-term averages of up to seven measurements of LDL-C, HDL-C, and TG over a ~30 year span were the primary phenotypes. We used generalized estimating equations (GEE), family-based association tests (FBAT) and variance components linkage to investigate the relationships between SNPs (on autosomes, with minor allele frequency [greater than or equal to]10%, genotypic call rate [greater than or equal to]80%, and Hardy-Weinberg equilibrium p [greater than or equal to] 0.001) and multivariable-adjusted residuals. We pursued a three-stage replication strategy of the GEE association results with 287 SNPs (P < 0.001 in Stage I) tested in Stage II (n ~1450 individuals) and 40 SNPs (P < 0.001 in joint analysis of Stages I and II) tested in Stage III (n~6650 individuals).RESULTS:Long-term averages of LDL-C, HDL-C, and TG were highly heritable (h2 = 0.66, 0.69, 0.58, respectively; each P < 0.0001). Of 70,987 tests for each of the phenotypes, two SNPs had p < 10-5 in GEE results for LDL-C, four for HDL-C, and one for TG. For each multivariable-adjusted phenotype, the number of SNPs with association p < 10-4 ranged from 13 to 18 and with p < 10-3, from 94 to 149. Some results confirmed previously reported associations with candidate genes including variation in the lipoprotein lipase gene (LPL) and HDL-C and TG (rs7007797; P = 0.0005 for HDL-C and 0.002 for TG). The full set of GEE, FBAT and linkage results are posted at the database of Genotype and Phenotype (dbGaP). After three stages of replication, there was no convincing statistical evidence for association (i.e., combined P < 10-5 across all three stages) between any of the tested SNPs and lipid phenotypes.CONCLUSION:Using a 100K genome-wide scan, we have generated a set of putative associations for common sequence variants and lipid phenotypes. Validation of selected hypotheses in additional samples did not identify any new loci underlying variability in blood lipids. Lack of replication may be due to inadequate statistical power to detect modest quantitative trait locus effects (i.e., < 1% of trait variance explained) or reduced genomic coverage of the 100K array. GWAS in FHS using a denser genome-wide genotyping platform and a better-powered replication strategy may identify novel loci underlying blood lipids.