831 resultados para Klebsiella pneumoniae genome sequence
Resumo:
Complementary DNAs covering the entire RNA genome of soybean dwarf luteovirus (SDV) were cloned and sequenced. Computer analysis of the 5861 nucleotide sequence revealed five major open reading frames (ORFs) possessing conservation of sequence and organisation with known luteovirus sequences. Comparative analyses of the genome structure show that SDV shares sequence homology and features of gene organisation with barley yellow dwarf virus (PAV isolate) in the 5' half of the genome, yet is more closely related to potato leafroll virus in its 3' coding regions. In addition, SDV differs from other known luteoviruses in possessing an exceptionally long 3' terminal sequence with no apparent coding capacity. We conclude from these data that the SDV genome represents a third variant genome type in the luteovirus group.
Resumo:
Bats account for one-fifth of mammalian species, are the only mammals with powered flight, and are among the few animals that echolocate. The insect-eating Brandt’s bat (Myotis brandtii) is the longest-lived bat species known to date (lifespan exceeds 40 years) and, at 4–8 g adult body weight, is the most extreme mammal with regard to disparity between body mass and longevity. Here we report sequencing and analysis of the Brandt’s bat genome and transcriptome, which suggest adaptations consistent with echolocation and hibernation, as well as altered metabolism, reproduction and visual function. Unique sequence changes in growth hormone and insulin-like growth factor 1 receptors are also observed. The data suggest that an altered growth hormone/insulin-like growth factor 1 axis, which may be common to other long-lived bat species, together with adaptations such as hibernation and low reproductive rate, contribute to the exceptional lifespan of the Brandt’s bat.
Resumo:
The woodland strawberry, Fragaria vesca (2n = 2x = 14), is a versatile experimental plant system. This diminutive herbaceous perennial has a small genome (240 Mb), is amenable to genetic transformation and shares substantial sequence identity with the cultivated strawberry (Fragaria Ã- ananassa) and other economically important rosaceous plants. Here we report the draft F. vesca genome, which was sequenced to ×-39 coverage using second-generation technology, assembled de novo and then anchored to the genetic linkage map into seven pseudochromosomes. This diploid strawberry sequence lacks the large genome duplications seen in other rosids. Gene prediction modeling identified 34,809 genes, with most being supported by transcriptome mapping. Genes critical to valuable horticultural traits including flavor, nutritional value and flowering time were identified. Macrosyntenic relationships between Fragaria and Prunus predict a hypothetical ancestral Rosaceae genome that had nine chromosomes. New phylogenetic analysis of 154 protein-coding genes suggests that assignment of Populus to Malvidae, rather than Fabidae, is warranted.
Resumo:
Over the past decade the mitochondrial (mt) genome has become the most widely used genomic resource available for systematic entomology. While the availability of other types of ‘–omics’ data – in particular transcriptomes – is increasing rapidly, mt genomes are still vastly cheaper to sequence and are far less demanding of high quality templates. Furthermore, almost all other ‘–omics’ approaches also sequence the mt genome, and so it can form a bridge between legacy and contemporary datasets. Mitochondrial genomes have now been sequenced for all insect orders, and in many instances representatives of each major lineage within orders (suborders, series or superfamilies depending on the group). They have also been applied to systematic questions at all taxonomic scales from resolving interordinal relationships (e.g. Cameron et al., 2009; Wan et al., 2012; Wang et al., 2012), through many intraordinal (e.g. Dowton et al., 2009; Timmermans et al., 2010; Zhao et al. 2013a) and family-level studies (e.g. Nelson et al., 2012; Zhao et al., 2013b) to population/biogeographic studies (e.g. Ma et al., 2012). Methodological issues around the use of mt genomes in insect phylogenetic analyses and the empirical results found to date have recently been reviewed by Cameron (2014), yet the technical aspects of sequencing and annotating mt genomes were not covered. Most papers which generate new mt genome report their methods in a simplified form which can be difficult to replicate without specific knowledge of the field. Published studies utilize a sufficiently wide range of approaches, usually without justification for the one chosen, that confusion about commonly used jargon such as ‘long PCR’ and ‘primer walking’ could be a serious barrier to entry. Furthermore, sequenced mt genomes have been annotated (gene locations defined) to wildly varying standards and improving data quality through consistent annotation procedures will benefit all downstream users of these datasets. The aims of this review are therefore to: 1. Describe in detail the various sequencing methods used on insect mt genomes; 2. Explore the strengths/weakness of different approaches; 3. Outline the procedures and software used for insect mt genome annotation, and; 4. Highlight quality control steps used for new annotations, and to improve the re-annotation of previously sequenced mt genomes used in systematic or comparative research.
Resumo:
We determined the nucleotide sequence of the mitochondrial genome (mtgenome) of Spilonota lechriaspis Meyrick (Lepidoptera: Tortricidae). The entire closed circular molecule is 15,368 bp and contains 37 genes with the typical gene complement and order for lepidopteran mtgenomes. All tRNAs except tRNASer(AGN) can be folded into the typical cloverleaf secondary structures. The protein-coding genes (PCGs) have typical mitochondrial start codons, with the exception of COI, which uses the unusual CGA one as is found in all other Lepidoptera sequenced to date. In addition, six of 13 PCGs harbor the incomplete termination codons, a single T. The A+T-rich region contains some conserved structures that are similar to those found in other lepidopteran mtgenomes, including a structure combining the motif 'ATAGA', a 19-bp poly(T) stretch and three microsatellite (AT)n elements which are part of larger 122+ bp macrorepeats. This is the first report of macrorepeats in a lepidopteran mtgenome.
Resumo:
Coleoptera is the most diverse group of insects with over 360,000 described species divided into four suborders: Adephaga, Archostemata, Myxophaga, and Polyphaga. In this study, we present six new complete mitochondrial genome (mtgenome) descriptions, including a representative of each suborder, and analyze the evolution of mtgenomes from a comparative framework using all available coleopteran mtgenomes. We propose a modification of atypical cox1 start codons based on sequence alignment to better reflect the conservation observed across species as well as findings of TTG start codons in other genes. We also analyze tRNA-Ser(AGN) anticodons, usually GCU in arthropods, and report a conserved UCU anticodon as a possible synapomorphy across Polyphaga. We further analyze the secondary structure of tRNA-Ser(AGN) and present a consensus structure and an updated covariance model that allows tRNAscan-SE (via the COVE software package) to locate and fold these atypical tRNAs with much greater consistency. We also report secondary structure predictions for both rRNA genes based on conserved stems. All six species of beetle have the same gene order as the ancestral insect. We report noncoding DNA regions, including a small gap region of about 20 bp between tRNA-Ser(UCN) and nad1 that is present in all six genomes, and present results of a base composition analysis.
Resumo:
Autotransporter (AT) proteins are found in all Escherichia coli pathotypes and are often associated with virulence. In this study we took advantage of the large number of available E. coli genome sequences to perform an in-depth bioinformatic analysis of AT-encoding genes. Twenty-eight E. coli genome sequences were probed using an iterative approach, which revealed a total of 215 AT-encoding sequences that represented three major groups of distinct domain architecture: (i) serine protease AT proteins, (ii) trimeric AT adhesins and (iii) AIDA-I-type AT proteins. A number of subgroups were identified within each broad category, and most subgroups contained at least one characterized AT protein; however, seven subgroups contained no previously described proteins. The AIDA-I-type AT proteins represented the largest and most diverse group, with up to 16 subgroups identified from sequence-based comparisons. Nine of the AIDA-I-type AT protein subgroups contained at least one protein that possessed functional properties associated with aggregation and/or biofilm formation, suggesting a high degree of redundancy for this phenotype. The Ag43, YfaL/EhaC, EhaB/UpaC and UpaG subgroups were found in nearly all E. coli strains. Among the remaining subgroups, there was a tendency for AT proteins to be associated with individual E. coli pathotypes, suggesting that they contribute to tissue tropism or symptoms specific to different disease outcomes.
Resumo:
The chlamydiae are obligate intracellular parasites that have evolved specific interactions with their various hosts and host cell types to ensure their successful survival and consequential pathogenesis. The species Chlamydia pneumoniae is ubiquitous, with serological studies showing that most humans are infected at some stage in their lifetime. While most human infections are asymptomatic, C. pneumoniae can cause more-severe respiratory disease and pneumonia and has been linked to chronic diseases such as asthma, atherosclerosis, and even Alzheimer's disease. The widely dispersed animal-adapted C. pneumoniae strains cause an equally wide range of diseases in their hosts. It is emerging that the ability of C. pneumoniae to survive inside its target cells, including evasion of the host's immune attack mechanisms, is linked to the acquisition of key metabolites. Tryptophan and arginine are key checkpoint compounds in this host-parasite battle. Interestingly, the animal strains of C. pneumoniae have a slightly larger genome, enabling them to cope better with metabolite restrictions. It therefore appears that as the evolutionarily more ancient animal strains have evolved to infect humans, they have selectively become more "susceptible" to the levels of key metabolites, such as tryptophan. While this might initially appear to be a weakness, it allows these human C. pneumoniae strains to exquisitely sense host immune attack and respond by rapidly reverting to a persistent phase. During persistence, they reduce their metabolic levels, halting progression of their developmental cycle, waiting until the hostile external conditions have passed before they reemerge.
Resumo:
Streptococcus pneumoniae is a potentially deadly human pathogen associated with high morbidity, mortality and global economic burden. The universally used bacterial genotyping methods are multilocus sequence typing and pulsed field gel electrophoresis. However, another highly discriminatory, rapid and less expensive genotyping technique,multilocus variable number of tandem repeat analysis (MLVA), has been developed. Unfortunately, no universal MLVA protocol exists, and some MLVA protocols do not amplify certain loci for all pneumococcal serotypes, leaving genotyping profiles incomplete. A number of other genotyping or characterization methods have been developed and will be discussed. This review examines the various protocols for genotyping S. pneumoniae and highlights the current direction technology and research is heading to understand this bacterium.
Resumo:
Background Globally, over 800 000 children under five die each year from infectious diseases caused by Streptococcus pneumoniae. To understand genetic relatedness between isolates, study transmission routes, assess the impact of human interventions e.g. vaccines, and determine infection sources, genotyping methods are required. The ‘gold standard’ genotyping method, Multi-Locus Sequence Typing (MLST), is useful for long-term and global studies. Another genotyping method, Multi-Locus Variable Number of Tandem Repeat Analysis (MLVA), has emerged as a more discriminatory, inexpensive and faster technique; however there is no universally accepted method and it is currently suitable for short-term and localised epidemiology studies. Currently Australia has no national MLST database, nor has it adopted any MLVA method for short-term or localised studies. This study aims to improve S. pneumoniae genotyping methods by modifying the existing MLVA techniques to be more discriminatory, faster, cheaper and technically less demanding than previously published MLVA methods and MLST. Methods Four different MLVA protocols, including a modified method, were applied to 317 isolates of serotyped invasive S. pneumoniae isolated from sterile body sites of Queensland children under 15 years from 2007–2012. MLST was applied to 202 isolates for comparison. Results The modified MLVA4 is significantly more discriminatory than the ‘gold standard’ MLST method. MLVA4 has similar discrimination compared to other MLVA techniques in this study). The failure to amplify particular loci in previous MLVA methods were minimised in MLVA4. Failure to amplify BOX-13 and Spneu19 were found to be serotype specific. Conclusion We have modified a highly discriminatory MLVA technique for genotyping Queensland invasive S. pneumoniae. MLVA4 has the ability to enhance our understanding of the pneumococcal epidemiology and the changing genetics of the pneumococcus in localised and short-term studies.
Resumo:
It's akin to the old Spanish, English and Portuguese explorers. They would take their boats until they found some edge of land, then they would go up and plant the flag of their king or queen. They didn't know what they'd discovered; how big it is, where it goes to - but they would claim it anyway. David Korn of the Association of American Medical Colleges This article analyses recent litigation over patent law and expressed sequence tags (ESTs). In the case of In re Fisher, the United States Court of Appeals for the Federal Circuit engaged in judicial consideration of the revised utility guidelines of the United States Patent and Trademark Office (USPTO). In this matter, the agricultural biotechnology company Monsanto sought to patent ESTs in maize plants. A patent examiner and the Board of Patent Appeals and Interferences had doubted whether the patent application was useful. Monsanto appealed against the rulings of the USPTO. A number of amicus curiae intervened in the matter in support of the USPTO - including Genentech, Affymetrix, Dow AgroSciences, Eli Lilly, the National Academy of Sciences, and the Association of American Medical Colleges. The majority of the Court of Appeals for the Federal Circuit supported the position of the USPTO, and rejected the patent application on the grounds of utility. The split decision highlighted institutional tensions over the appropriate thresholds for patent criteria - such as novelty, non-obviousness, and utility. The litigation raised larger questions about the definition of research tools, the incremental nature of scientific progress, and the role of patent law in innovation policy. The decision of In re Fisher will have significant ramifications for gene patents, in the wake of the human genome project. Arguably, the USPTO utility guidelines need to be reinforced by a tougher application of the standards of novelty and non-obviousness in respect of gene patents.
Resumo:
Objective. To undertake a systematic wholegenome screen to identify regions exhibiting genetic linkage to rheumatoid arthritis (RA). Methods. Two hundred fifty-two RA-affected sibling pairs from 182 UK families were genotyped using 365 highly informative microsatellite markers. Microsatellite genotyping was performed using fluorescent polymerase chain reaction primers and semiautomated DNA sequencing technology. Linkage analysis was undertaken using MAPMAKER/SIBS for single-point and multipoint analysis. Results. Significant linkage (maximum logarithm of odds score 4.7 [P = 0.000003] at marker D6S276, 1 cM from HLA-DRB1) was identified around the major histocompatibility complex (MHC) region on chromosome 6. Suggestive linkage (P < 7.4 × 10-4) was identified on chromosome 6q by single- and multipoint analysis. Ten other sites of nominal linkage (P < 0.05) were identified on chromosomes 3p, 4q, 7p, 2 regions of 10q, 2 regions of 14q, 16p, 21q, and Xq by single-point analysis and on 3 sites (1q, 14q, and 14q) by multipoint analysis. Conclusion. Linkage to the MHC region was confirmed. Eleven non-HLA regions demonstrated evidence of suggestive or nominal linkage, but none reached the genome-wide threshold for significant linkage (P = 2.2 × 10-5). Results of previous genome screens have suggested that 6 of these regions may be involved in RA susceptibility.
Resumo:
Molecular phylogenetic studies of homologous sequences of nucleotides often assume that the underlying evolutionary process was globally stationary, reversible, and homogeneous (SRH), and that a model of evolution with one or more site-specific and time-reversible rate matrices (e.g., the GTR rate matrix) is enough to accurately model the evolution of data over the whole tree. However, an increasing body of data suggests that evolution under these conditions is an exception, rather than the norm. To address this issue, several non-SRH models of molecular evolution have been proposed, but they either ignore heterogeneity in the substitution process across sites (HAS) or assume it can be modeled accurately using the distribution. As an alternative to these models of evolution, we introduce a family of mixture models that approximate HAS without the assumption of an underlying predefined statistical distribution. This family of mixture models is combined with non-SRH models of evolution that account for heterogeneity in the substitution process across lineages (HAL). We also present two algorithms for searching model space and identifying an optimal model of evolution that is less likely to over- or underparameterize the data. The performance of the two new algorithms was evaluated using alignments of nucleotides with 10 000 sites simulated under complex non-SRH conditions on a 25-tipped tree. The algorithms were found to be very successful, identifying the correct HAL model with a 75% success rate (the average success rate for assigning rate matrices to the tree's 48 edges was 99.25%) and, for the correct HAL model, identifying the correct HAS model with a 98% success rate. Finally, parameter estimates obtained under the correct HAL-HAS model were found to be accurate and precise. The merits of our new algorithms were illustrated with an analysis of 42 337 second codon sites extracted from a concatenation of 106 alignments of orthologous genes encoded by the nuclear genomes of Saccharomyces cerevisiae, S. paradoxus, S. mikatae, S. kudriavzevii, S. castellii, S. kluyveri, S. bayanus, and Candida albicans. Our results show that second codon sites in the ancestral genome of these species contained 49.1% invariable sites, 39.6% variable sites belonging to one rate category (V1), and 11.3% variable sites belonging to a second rate category (V2). The ancestral nucleotide content was found to differ markedly across these three sets of sites, and the evolutionary processes operating at the variable sites were found to be non-SRH and best modeled by a combination of eight edge-specific rate matrices (four for V1 and four for V2). The number of substitutions per site at the variable sites also differed markedly, with sites belonging to V1 evolving slower than those belonging to V2 along the lineages separating the seven species of Saccharomyces. Finally, sites belonging to V1 appeared to have ceased evolving along the lineages separating S. cerevisiae, S. paradoxus, S. mikatae, S. kudriavzevii, and S. bayanus, implying that they might have become so selectively constrained that they could be considered invariable sites in these species.
Resumo:
To gain insight into the mechanisms by which the Myb transcription factor controls normal hematopoiesis and particularly, how it contributes to leukemogenesis, we mapped the genome-wide occupancy of Myb by chromatin immunoprecipitation followed by massively parallel sequencing (ChIP-Seq) in ERMYB myeloid progenitor cells. By integrating the genome occupancy data with whole genome expression profiling data, we identified a Myb-regulated transcriptional program. Gene signatures for leukemia stem cells, normal hematopoietic stem/progenitor cells and myeloid development were overrepresented in 2368 Myb regulated genes. Of these, Myb bound directly near or within 793 genes. Myb directly activates some genes known critical in maintaining hematopoietic stem cells, such as Gfi1 and Cited2. Importantly, we also show that, despite being usually considered as a transactivator, Myb also functions to repress approximately half of its direct targets, including several key regulators of myeloid differentiation, such as Sfpi1 (also known as Pu.1), Runx1, Junb and Cebpb. Furthermore, our results demonstrate that interaction with p300, an established coactivator for Myb, is unexpectedly required for Myb-mediated transcriptional repression. We propose that the repression of the above mentioned key pro-differentiation factors may contribute essentially to Myb's ability to suppress differentiation and promote self-renewal, thus maintaining progenitor cells in an undifferentiated state and promoting leukemic transformation. © 2011 The Author(s).
Resumo:
Familial juvenile hyperuricaemic (gouty) nephropathy (FJHN), is an autosomal dominant disease associated with a reduced fractional excretion of urate, and progressive renal failure. FJHN is genetically heterogeneous and due to mutations of three genes: uromodulin (UMOD), renin (REN) and hepatocyte nuclear factor-1beta (HNF-1β) on chromosomes 16p12, 1q32.1, and 17q12, respectively. However, UMOD, REN or HNF-1β mutations are found in only ~45% of FJHN probands, indicating the involvement of other genetic loci in ~55% of probands. To identify other FJHN loci, we performed a single nucleotide polymorphism (SNP)-based genome-wide linkage analysis, in six FJHN families in whom UMOD, HNF-1β and REN mutations had been excluded. Parametric linkage analysis using a 'rare dominant' model established linkage in five of the six FJHN families, with a LOD score >+3, at 0% recombination, between FJHN and SNPs at chromosome 2p22.1-p21. Analysis of individual recombinants in two unrelated affected individuals defined a ~5.5 Mbp interval, flanked telomerically by SNP RS372139 and centromerically by RS896986 that contained the locus, designated FJHN3. The interval contains 28 genes, and DNA sequence analysis of the most likely candidate, solute carrier family 8 member 1 (SLC8A1), did not identify any abnormalities in the FJHN3 probands. FJHN3 is likely located within a ~5.5 Mbp interval on chromosome 2p22.1-p21, and identifying the genetic abnormality will help to further elucidate mechanisms predisposing to gout and renal failure.