84 resultados para Genome-specific Sequence


Relevância:

30.00% 30.00%

Publicador:

Resumo:

Cryptococcus neoformans is a pathogenic basidiomycetous yeast responsible for more than 600,000 deaths each year. It occurs as two serotypes (A and D) representing two varieties (i.e. grubii and neoformans, respectively). Here, we sequenced the genome and performed an RNA-Seq-based analysis of the C. neoformans var. grubii transcriptome structure. We determined the chromosomal locations, analyzed the sequence/structural features of the centromeres, and identified origins of replication. The genome was annotated based on automated and manual curation. More than 40,000 introns populating more than 99% of the expressed genes were identified. Although most of these introns are located in the coding DNA sequences (CDS), over 2,000 introns in the untranslated regions (UTRs) were also identified. Poly(A)-containing reads were employed to locate the polyadenylation sites of more than 80% of the genes. Examination of the sequences around these sites revealed a new poly(A)-site-associated motif (AUGHAH). In addition, 1,197 miscRNAs were identified. These miscRNAs can be spliced and/or polyadenylated, but do not appear to have obvious coding capacities. Finally, this genome sequence enabled a comparative analysis of strain H99 variants obtained after laboratory passage. The spectrum of mutations identified provides insights into the genetics underlying the micro-evolution of a laboratory strain, and identifies mutations involved in stress responses, mating efficiency, and virulence.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Human Leukocyte Antigen (HLA) plays an important role, in presenting foreign pathogens to our immune system, there by eliciting early immune responses. HLA genes are highly polymorphic, giving rise to diverse antigen presentation capability. An important factor contributing to enormous variations in individual responses to diseases is differences in their HLA profiles. The heterogeneity in allele specific disease responses decides the overall disease epidemiological outcome. Here we propose an agent based computational framework, capable of incorporating allele specific information, to analyze disease epidemiology. This framework assumes a SIR model to estimate average disease transmission and recovery rate. Using epitope prediction tool, it performs sequence based epitope detection for a given the pathogenic genome and derives an allele specific disease susceptibility index depending on the epitope detection efficiency. The allele specific disease transmission rate, that follows, is then fed to the agent based epidemiology model, to analyze the disease outcome. The methodology presented here has a potential use in understanding how a disease spreads and effective measures to control the disease.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The significance of G-quadruplexes and the helicases that resolve G4 structures in prokaryotes is poorly understood. The Mycobacterium tuberculosis genome is GC-rich and contains >10,000 sequences that have the potential to form G4 structures. In Escherichia coli, RecQ helicase unwinds G4 structures. However, RecQ is absent in M. tuberculosis, and the helicase that participates in G4 resolution in M. tuberculosis is obscure. Here, we show that M. tuberculosis DinG (MtDinG) exhibits high affinity for ssDNA and ssDNA translocation with a 5' -> 3' polarity. Interestingly, MtDinG unwinds overhangs, flap structures, and forked duplexes but fails to unwind linear duplex DNA. Our data with DNase I footprinting provide mechanistic insights and suggest that MtDinG is a 5' -> 3' polarity helicase. Notably, in contrast to E. coli DinG, MtDinG catalyzes unwinding of replication fork and Holliday junction structures. Strikingly, we find that MtDinG resolves intermolecular G4 structures. These data suggest that MtDinG is a multifunctional structure-specific helicase that unwinds model structures of DNA replication, repair, and recombination as well as G4 structures. We finally demonstrate that promoter sequences of M. tuberculosis PE_PGRS2, mce1R, and moeB1 genes contain G4 structures, implying that G4 structures may regulate gene expression in M. tuberculosis. We discuss these data and implicate targeting G4 structures and DinG helicase in M. tuberculosis could be a novel therapeutic strategy for culminating the infection with this pathogen.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Background: mIHF belongs to a subfamily of proteins, distinct from E. coli IHF. Results: Functionally important amino acids of mIHF and the mechanism(s) underlying DNA binding, DNA bending, and site-specific recombination are distinct from that of E. coli IHF. Conclusion: mIHF functions could contribute beyond nucleoid compaction. Significance: Because mIHF is essential for growth, the molecular mechanisms identified here can be exploited in drug screening efforts. The annotated whole-genome sequence of Mycobacterium tuberculosis revealed that Rv1388 (Mtihf) is likely to encode for a putative 20-kDa integration host factor (mIHF). However, very little is known about the functional properties of mIHF or the organization of the mycobacterial nucleoid. Molecular modeling of the mIHF three-dimensional structure, based on the cocrystal structure of Streptomyces coelicolor IHF duplex DNA, a bona fide relative of mIHF, revealed the presence of Arg-170, Arg-171, and Arg-173, which might be involved in DNA binding, and a conserved proline (Pro-150) in the tight turn. The phenotypic sensitivity of Escherichia coli ihfA and ihfB strains to UV and methyl methanesulfonate could be complemented with the wild-type Mtihf but not its alleles bearing mutations in the DNA-binding residues. Protein-DNA interaction assays revealed that wild-type mIHF, but not its DNA-binding variants, binds with high affinity to fragments containing attB and attP sites and curved DNA. Strikingly, the functionally important amino acid residues of mIHF and the mechanism(s) underlying its binding to DNA, DNA bending, and site-specific recombination are fundamentally different from that of E. coli IHF. Furthermore, we reveal novel insights into IHF-mediated DNA compaction depending on the placement of its preferred binding sites; mIHF promotes DNA compaction into nucleoid-like or higher order filamentous structures. We therefore propose that mIHF is a distinct member of a subfamily of proteins that serve as essential cofactors in site-specific recombination and nucleoid organization and that these findings represent a significant advance in our understanding of the role(s) of nucleoid-associated proteins.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The availability of the genome sequence of Mycobacterium tuberculosis H37Rv has encouraged determination of large numbers of protein structures and detailed definition of the biological information encoded therein; yet, the functions of many proteins in M. tuberculosis remain unknown. The emergence of multidrug resistant strains makes it a priority to exploit recent advances in homology recognition and structure prediction to re-analyse its gene products. Here we report the structural and functional characterization of gene products encoded in the M. tuberculosis genome, with the help of sensitive profile-based remote homology search and fold recognition algorithms resulting in an enhanced annotation of the proteome where 95% of the M. tuberculosis proteins were identified wholly or partly with information on structure or function. New information includes association of 244 proteins with 205 domain families and a separate set of new association of folds to 64 proteins. Extending structural information across uncharacterized protein families represented in the M. tuberculosis proteome, by determining superfamily relationships between families of known and unknown structures, has contributed to an enhancement in the knowledge of structural content. In retrospect, such superfamily relationships have facilitated recognition of probable structure and/or function for several uncharacterized protein families, eventually aiding recognition of probable functions for homologous proteins corresponding to such families. Gene products unique to mycobacteria for which no functions could be identified are 183. Of these 18 were determined to be M. tuberculosis specific. Such pathogen-specific proteins are speculated to harbour virulence factors required for pathogenesis. A re-annotated proteome of M. tuberculosis, with greater completeness of annotated proteins and domain assigned regions, provides a valuable basis for experimental endeavours designed to obtain a better understanding of pathogenesis and to accelerate the process of drug target discovery. (C) 2014 Elsevier Ltd. All rights reserved.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

DNA sequence and structure play a key role in imparting fragility to different regions of the genome. Recent studies have shown that non-B DNA structures play a key role in causing genomic instability, apart from their physiological roles at telomeres and promoters. Structures such as G-quadruplexes, cruciforms, and triplexes have been implicated in making DNA susceptible to breakage, resulting in genomic rearrangements. Hence, techniques that aid in the easy identification of such non-B DNA motifs will prove to be very useful in determining factors responsible for genomic instability. In this study, we provide evidence for the use of primer extension as a sensitive and specific tool to detect such altered DNA structures. We have used the G-quadruplex motif, recently characterized at the BCL2 major breakpoint region as a proof of principle to demonstrate the advantages of the technique. Our results show that pause sites corresponding to the non-B DNA are specific, since they are absent when the G-quadruplex motif is mutated and their positions change in tandem with that of the primers. The efficiency of primer extension pause sites varied according to the concentration of monovalant cations tested, which support G-quadruplex formation. Overall, our results demonstrate that primer extension is a strong in vitro tool to detect non-B DNA structures such as G-quadruplex on a plasmid DNA, which can be further adapted to identify non-B DNA structures, even at the genomic level.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Background: The heterotrimeric M. tuberculosis RecBCD complex, or each of its individual subunits, remains uncharacterized. Results: MtRecD exists as a homodimer in solution, catalyzes ssDNA-dependent ATP hydrolysis, unwinding of DNA replication/recombination intermediates, and interacts with RecA. Conclusion: MtRecD possesses strong 5 3- and weak 3 5-helicase activities. Significance: These findings provide insights into the mechanism underlying DSB repair and homologous recombination in mycobacteria. The annotated whole-genome sequence of Mycobacterium tuberculosis revealed the presence of a putative recD gene; however, the biochemical characteristics of its encoded protein product (MtRecD) remain largely unknown. Here, we show that MtRecD exists in solution as a stable homodimer. Protein-DNA binding assays revealed that MtRecD binds efficiently to single-stranded DNA and linear duplexes containing 5 overhangs relative to the 3 overhangs but not to blunt-ended duplex. Furthermore, MtRecD bound more robustly to a variety of Y-shaped DNA structures having 18-nucleotide overhangs but not to a similar substrate containing 5-nucleotide overhangs. MtRecD formed more salt-tolerant complexes with Y-shaped structures compared with linear duplex having 3 overhangs. The intrinsic ATPase activity of MtRecD was stimulated by single-stranded DNA. Site-specific mutagenesis of Lys-179 in motif I abolished the ATPase activity of MtRecD. Interestingly, although MtRecD-catalyzed unwinding showed a markedly higher preference for duplex substrates with 5 overhangs, it could also catalyze significant unwinding of substrates containing 3 overhangs. These results support the notion that MtRecD is a bipolar helicase with strong 5 3 and weak 3 5 unwinding activities. The extent of unwinding of Y-shaped DNA structures was approximate to 3-fold lower compared with duplexes with 5 overhangs. Notably, direct interaction between MtRecD and its cognate RecA led to inhibition of DNA strand exchange promoted by RecA. Altogether, these studies provide the first detailed characterization of MtRecD and present important insights into the type of DNA structure the enzyme is likely to act upon during the processes of DNA repair or homologous recombination.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The annotated whole-genome sequence of Mycobacterium tuberculosis indicated that Rv1388 (Mtihf) likely encodes a putative 20 kDa integration host factor (mIHF). However, very little is known about the functional properties of mIHF or organization of mycobacterial nucleoid. Molecular modeling of the mIHF three-dimensional structure, based on the cocrystal structure of Streptomyces coelicolor IHF-duplex DNA, a bona fide relative of mIHF, revealed the presence of Arg170, Arg171, and Arg173, which might be involved in DNA binding, and a conserved proline (P150) in the tight turn. The phenotypic sensitivity of Escherichia coli Delta ihfA and Delta ihfB strains to UV and methylmethanesulfonate could be complemented with the wild-type Mtihf, but not its alleles bearing mutations in the DNA-binding residues. Protein DNA interaction assays revealed that wild-type mIHF, but not its DNA-binding variants, bind with high affinity to fragments containing attB and attP sites and curved DNA. Strikingly, the functionally important amino acid residues of mIHF and the mechanism(s) underlying its binding to DNA, DNA bending, and site-specific recombination are fundamentally different from that of E. coli IHF alpha beta. Furthermore, we reveal novel insights into IHF-mediated DNA compaction depending on the placement of its preferred binding sites; mIHF promotes compaction of DNA into nucleoid-like or higher-order filamentous structures. We hence propose that mIHF is a distinct member of a subfamily of proteins that serve as essential cofactors in site-specific recombination and nucleoid organization and that these findings represent a significant advance in our understanding of the role(s) of nucleoid-associated proteins.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Integrity in entirety is the preferred state of any organism. The temporal and spatial integrity of the genome ensures continued survival of a cell. DNA breakage is the first step towards creation of chromosomal translocations. In this review, we highlight the factors contributing towards the breakage of chromosomal DNA. It has been well-established that the structure and sequence of DNA play a critical role in selective fragility of the genome. Several non-B-DNA structures such as Z-DNA, cruciform DNA, G-quadruplexes, R loops and triplexes have been implicated in generation of genomic fragility leading to translocations. Similarly, specific sequences targeted by proteins such as Recombination Activating Genes and Activation Induced Cytidine Deaminase are involved in translocations. Processes that ensure the integrity of the genome through repair may lead to persistence of breakage and eventually translocations if their actions are anomalous. An insufficient supply of nucleotides and chromatin architecture may also play a critical role. This review focuses on a range of events with the potential to threaten the genomic integrity of a cell, leading to cancer.