963 resultados para Genome-specific Sequence


Relevância:

30.00% 30.00%

Publicador:

Resumo:

The availability of the genome sequence of Mycobacterium tuberculosis H37Rv has encouraged determination of large numbers of protein structures and detailed definition of the biological information encoded therein; yet, the functions of many proteins in M. tuberculosis remain unknown. The emergence of multidrug resistant strains makes it a priority to exploit recent advances in homology recognition and structure prediction to re-analyse its gene products. Here we report the structural and functional characterization of gene products encoded in the M. tuberculosis genome, with the help of sensitive profile-based remote homology search and fold recognition algorithms resulting in an enhanced annotation of the proteome where 95% of the M. tuberculosis proteins were identified wholly or partly with information on structure or function. New information includes association of 244 proteins with 205 domain families and a separate set of new association of folds to 64 proteins. Extending structural information across uncharacterized protein families represented in the M. tuberculosis proteome, by determining superfamily relationships between families of known and unknown structures, has contributed to an enhancement in the knowledge of structural content. In retrospect, such superfamily relationships have facilitated recognition of probable structure and/or function for several uncharacterized protein families, eventually aiding recognition of probable functions for homologous proteins corresponding to such families. Gene products unique to mycobacteria for which no functions could be identified are 183. Of these 18 were determined to be M. tuberculosis specific. Such pathogen-specific proteins are speculated to harbour virulence factors required for pathogenesis. A re-annotated proteome of M. tuberculosis, with greater completeness of annotated proteins and domain assigned regions, provides a valuable basis for experimental endeavours designed to obtain a better understanding of pathogenesis and to accelerate the process of drug target discovery. (C) 2014 Elsevier Ltd. All rights reserved.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

DNA sequence and structure play a key role in imparting fragility to different regions of the genome. Recent studies have shown that non-B DNA structures play a key role in causing genomic instability, apart from their physiological roles at telomeres and promoters. Structures such as G-quadruplexes, cruciforms, and triplexes have been implicated in making DNA susceptible to breakage, resulting in genomic rearrangements. Hence, techniques that aid in the easy identification of such non-B DNA motifs will prove to be very useful in determining factors responsible for genomic instability. In this study, we provide evidence for the use of primer extension as a sensitive and specific tool to detect such altered DNA structures. We have used the G-quadruplex motif, recently characterized at the BCL2 major breakpoint region as a proof of principle to demonstrate the advantages of the technique. Our results show that pause sites corresponding to the non-B DNA are specific, since they are absent when the G-quadruplex motif is mutated and their positions change in tandem with that of the primers. The efficiency of primer extension pause sites varied according to the concentration of monovalant cations tested, which support G-quadruplex formation. Overall, our results demonstrate that primer extension is a strong in vitro tool to detect non-B DNA structures such as G-quadruplex on a plasmid DNA, which can be further adapted to identify non-B DNA structures, even at the genomic level.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Background: The heterotrimeric M. tuberculosis RecBCD complex, or each of its individual subunits, remains uncharacterized. Results: MtRecD exists as a homodimer in solution, catalyzes ssDNA-dependent ATP hydrolysis, unwinding of DNA replication/recombination intermediates, and interacts with RecA. Conclusion: MtRecD possesses strong 5 3- and weak 3 5-helicase activities. Significance: These findings provide insights into the mechanism underlying DSB repair and homologous recombination in mycobacteria. The annotated whole-genome sequence of Mycobacterium tuberculosis revealed the presence of a putative recD gene; however, the biochemical characteristics of its encoded protein product (MtRecD) remain largely unknown. Here, we show that MtRecD exists in solution as a stable homodimer. Protein-DNA binding assays revealed that MtRecD binds efficiently to single-stranded DNA and linear duplexes containing 5 overhangs relative to the 3 overhangs but not to blunt-ended duplex. Furthermore, MtRecD bound more robustly to a variety of Y-shaped DNA structures having 18-nucleotide overhangs but not to a similar substrate containing 5-nucleotide overhangs. MtRecD formed more salt-tolerant complexes with Y-shaped structures compared with linear duplex having 3 overhangs. The intrinsic ATPase activity of MtRecD was stimulated by single-stranded DNA. Site-specific mutagenesis of Lys-179 in motif I abolished the ATPase activity of MtRecD. Interestingly, although MtRecD-catalyzed unwinding showed a markedly higher preference for duplex substrates with 5 overhangs, it could also catalyze significant unwinding of substrates containing 3 overhangs. These results support the notion that MtRecD is a bipolar helicase with strong 5 3 and weak 3 5 unwinding activities. The extent of unwinding of Y-shaped DNA structures was approximate to 3-fold lower compared with duplexes with 5 overhangs. Notably, direct interaction between MtRecD and its cognate RecA led to inhibition of DNA strand exchange promoted by RecA. Altogether, these studies provide the first detailed characterization of MtRecD and present important insights into the type of DNA structure the enzyme is likely to act upon during the processes of DNA repair or homologous recombination.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The annotated whole-genome sequence of Mycobacterium tuberculosis indicated that Rv1388 (Mtihf) likely encodes a putative 20 kDa integration host factor (mIHF). However, very little is known about the functional properties of mIHF or organization of mycobacterial nucleoid. Molecular modeling of the mIHF three-dimensional structure, based on the cocrystal structure of Streptomyces coelicolor IHF-duplex DNA, a bona fide relative of mIHF, revealed the presence of Arg170, Arg171, and Arg173, which might be involved in DNA binding, and a conserved proline (P150) in the tight turn. The phenotypic sensitivity of Escherichia coli Delta ihfA and Delta ihfB strains to UV and methylmethanesulfonate could be complemented with the wild-type Mtihf, but not its alleles bearing mutations in the DNA-binding residues. Protein DNA interaction assays revealed that wild-type mIHF, but not its DNA-binding variants, bind with high affinity to fragments containing attB and attP sites and curved DNA. Strikingly, the functionally important amino acid residues of mIHF and the mechanism(s) underlying its binding to DNA, DNA bending, and site-specific recombination are fundamentally different from that of E. coli IHF alpha beta. Furthermore, we reveal novel insights into IHF-mediated DNA compaction depending on the placement of its preferred binding sites; mIHF promotes compaction of DNA into nucleoid-like or higher-order filamentous structures. We hence propose that mIHF is a distinct member of a subfamily of proteins that serve as essential cofactors in site-specific recombination and nucleoid organization and that these findings represent a significant advance in our understanding of the role(s) of nucleoid-associated proteins.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Integrity in entirety is the preferred state of any organism. The temporal and spatial integrity of the genome ensures continued survival of a cell. DNA breakage is the first step towards creation of chromosomal translocations. In this review, we highlight the factors contributing towards the breakage of chromosomal DNA. It has been well-established that the structure and sequence of DNA play a critical role in selective fragility of the genome. Several non-B-DNA structures such as Z-DNA, cruciform DNA, G-quadruplexes, R loops and triplexes have been implicated in generation of genomic fragility leading to translocations. Similarly, specific sequences targeted by proteins such as Recombination Activating Genes and Activation Induced Cytidine Deaminase are involved in translocations. Processes that ensure the integrity of the genome through repair may lead to persistence of breakage and eventually translocations if their actions are anomalous. An insufficient supply of nucleotides and chromatin architecture may also play a critical role. This review focuses on a range of events with the potential to threaten the genomic integrity of a cell, leading to cancer.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

RNA interference (RNAi) is a powerful biological pathway allowing for sequence-specific knockdown of any gene of interest. While RNAi is a proven tool for probing gene function in biological circuits, it is limited by being constitutively ON and executes the logical operation: silence gene Y. To provide greater control over post-transcriptional gene silencing, we propose engineering a biological logic gate to implement “conditional RNAi.” Such a logic gate would silence gene Y only upon the expression of gene X, a completely unrelated gene, executing the logic: if gene X is transcribed, silence independent gene Y. Silencing of gene Y could be confined to a specific time and/or tissue by appropriately selecting gene X.

To implement the logic of conditional RNAi, we present the design and experimental validation of three nucleic acid self-assembly mechanisms which detect a sub-sequence of mRNA X and produce a Dicer substrate specific to gene Y. We introduce small conditional RNAs (scRNAs) to execute the signal transduction under isothermal conditions. scRNAs are small RNAs which change conformation, leading to both shape and sequence signal transduction, in response to hybridization to an input nucleic acid target. While all three conditional RNAi mechanisms execute the same logical operation, they explore various design alternatives for nucleic acid self-assembly pathways, including the use of duplex and monomer scRNAs, stable versus metastable reactants, multiple methods of nucleation, and 3-way and 4-way branch migration.

We demonstrate the isothermal execution of the conditional RNAi mechanisms in a test tube with recombinant Dicer. These mechanisms execute the logic: if mRNA X is detected, produce a Dicer substrate targeting independent mRNA Y. Only the final Dicer substrate, not the scRNA reactants or intermediates, is efficiently processed by Dicer. Additional work in human whole-cell extracts and a model tissue-culture system delves into both the promise and challenge of implementing conditional RNAi in vivo.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The sea urchin embryonic skeleton, or spicule, is deposited by mesenchymal progeny of four precursor cells, the micromeres, which are determined to the skeletogenic pathway by a process known as cytoplasmic localization. A gene encoding one of the major products of the skeletogenic mesenchyme, a prominent 50 kD protein of the spicule matrix, has been characterized in detail. cDNA clones were first isolated by antibody screening of a phage expression library, followed by isolation of homologous genomic clones. The gene, known as SM50, is single copy in the sea urchin genome, is divided into two exons of 213 and 1682 bp, and is expressed only in skeletogenic cells. Transcripts are first detectable at the 120 cell stage, shortly after the segregation of the skeletogenic precursors from the rest of the embryo. The SM50 open reading frame begins within the first exon, is 450 amino acids in length, and contains a loosely repeated 13 amino acid motif rich in acidic residues which accounts for 45% of the protein and which is possibly involved in interaction with the mineral phase of the spicule.

The important cis-acting regions of the SM50 gene necessary for proper regulation of expression were identified by gene transfer experiments. A 562 bp promoter fragment, containing 438 bp of 5' promoter sequence and 124 bp of the SM50 first exon (including the SM50 initiation codon), was both necessary and sufficient to direct high levels of expression of the bacterial chloramphenicol acetyltransferase (CAT) reporter gene specifically in the skeletogenic cells. Removal of promoter sequences between positions -2200 and -438, and of transcribed regions downstream of +124 (including the SM50 intron), had no effect on the spatial or transcriptional activity of the transgenes.

Regulatory proteins that interact with the SM50 promoter were identified by the gel retardation assay, using bulk embryo mesenchyme blastula stage nuclear proteins. Five protein binding sites were identified and mapped to various degrees of resolution. Two sites are homologous, may be enhancer elements, and at least one is required for expression. Two additional sites are also present in the promoter of the aboral ectoderm specific cytoskeletal actin gene CyIIIa; one of these is a CCAA T element, the other a putative repressor element. The fifth site overlaps the binding site of the putative repressor and may function as a positive regulator by interfering with binding of the repressor. All of the proteins are detectable in nuclear extracts prepared from 64 cell stage embryos, a stage just before expression of SM50 is initiated, as well as from blastula and gastrula stage; the putative enhancer binding protein may be maternal as well.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Small molecules that bind to any predetermined DNA sequence in the human genome are potentially useful tools for molecular biology and human medicine. Polyamides containing N-methylimidazole (Im) N-methylpyrrole (Py) are cell permeable small molecules that bind DNA according to a set of "pairing rules" with affinities and specificities similar to many naturally occurring DNA binding proteins. Py-Im polyamides offer a general approach to the chemical regulation of gene expression. We demonstrate here that polyamide containing a DNA alkylating moiety seco-CBI can specifically direct sequence specific DNA alkylation. We can also control the strand of DNA that is alkylated, depending on the enantiomer of seco-CBI used and the orientation of the polyamide relative to the alkylation site (Chapter 2). This class of molecules has been applied to a gene repair system in collaboration with the Baltimore group at Caltech (Chapter 3). Also reported are additional seco-CBI polyamide conjugates synthesized to study other systems (HIV-1 and COX-2) (Appendix 1).

Relevância:

30.00% 30.00%

Publicador:

Resumo:

With recent advances in high-throughput sequencing, mapping of genome-wide transcription factor occupancy has become feasible. To advance the understanding of skeletal muscle differentiation specifically and transcriptional regulation in general, I determined the genome-wide occupancy map for myogenin in differentiating C2C12 myocyte cells. I then analyzed the myogenin map for underlying sequence content and the association between occupied elements and expression trajectories of adjacent genes. Having determined that myogenin primarily associates with expressed genes, I performed a similar analysis on occupancy maps of other transcription factors active during skeletal muscle differentiation, including an extensive analysis of co-occupancy. This analysis provided strong motif evidence for protein-protein interactions as the primary driving force in the formation of Myogenin / Mef2 and MyoD / AP-1 complexes at jointly-occupied sites. Finally, factor occupancy analysis was extended to include bHLH transcription factors in tissues other than skeletal muscle. The cross-tissue analysis led to the emergence of a motif structure used by bHLH TFs to encode either tissue-specific or "general" (public) access in a variety of lineages.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The genomes of many positive stranded RNA viruses and of all retroviruses are translated as large polyproteins which are proteolytically processed by cellular and viral proteases. Viral proteases are structurally related to two families of cellular proteases, the pepsin-like and trypsin-like proteases. This thesis describes the proteolytic processing of several nonstructural proteins of dengue 2 virus, a representative member of the Flaviviridae, and describes methods for transcribing full-length genomic RNA of dengue 2 virus. Chapter 1 describes the in vitro processing of the nonstructural proteins NS2A, NS2B and NS3. Chapter 2 describes a system that allows identification of residues within the protease that are directly or indirectly involved with substrate recognition. Chapter 3 describes methods to produce genome length dengue 2 RNA from cDNA templates.

The nonstructural protein NS3 is structurally related to viral trypsinlike proteases from the alpha-, picorna-, poty-, and pestiviruses. The hypothesis that the flavivirus nonstructural protein NS3 is a viral proteinase that generates the termini of several nonstructural proteins was tested using an efficient in vitro expression system and antisera specific for the nonstructural proteins NS2B and NS3. A series of cDNA constructs was transcribed using T7 RNA polymerase and the RNA translated in reticulocyte lysates. Proteolytic processing occurred in vitro to generate NS2B and NS3. The amino termini of NS2B and NS3 produced in vitro were found to be the same as the termini of NS2B and NS3 isolated from infected cells. Deletion analysis of cDNA constructs localized the protease domain necessary and sufficient for correct cleavage to the first 184 amino acids of NS3. Kinetic analysis of processing events in vitro and experiments to examine the sensitivity of processing to dilution suggested that an intramolecular cleavage between NS2A and NS2B preceded an intramolecular cleavage between NS2B and NS3. The data from these expression experiments confirm that NS3 is the viral proteinase responsible for cleavage events generating the amino termini of NS2B and NS3 and presumably for cleavages generating the termini of NS4A and NS5 as well.

Biochemical and genetic experiments using viral proteinases have defined the sequence requirements for cleavage site recognition, but have not identified residues within proteinases that interact with substrates. A biochemical assay was developed that could identify residues which were important for substrate recognition. Chimeric proteases between yellow fever and dengue 2 were constructed that allowed mapping of regions involved in substrate recognition, and site directed mutagenesis was used to modulate processing efficiency.

Expression in vitro revealed that the dengue protease domain efficiently processes the yellow fever polyprotein between NS2A and NS2B and between NS2B and NS3, but that the reciprocal construct is inactive. The dengue protease processes yellow fever cleavage sites more efficiently than dengue cleavage sites, suggesting that suboptimal cleavage efficiency may be used to increase levels of processing intermediates in vivo. By mutagenizing the putative substrate binding pocket it was possible to change the substrate specificity of the yellow fever protease; changing a minimum of three amino acids in the yellow fever protease enabled it to recognize dengue cleavage sites. This system allows identification of residues which are directly or indirectly involved with enzyme-substrate interaction, does not require a crystal structure, and can define the substrate preferences of individual members of a viral proteinase family.

Full-length cDNA clones, from which infectious RNA can be transcribed, have been developed for a number of positive strand RNA viruses, including the flavivirus type virus, yellow fever. The technology necessary to transcribe genomic RNA of dengue 2 virus was developed in order to better understand the molecular biology of the dengue subgroup. A 5' structural region clone was engineered to transcribe authentic dengue RNA that contains an additional 1 or 2 residues at the 5' end. A 3' nonstructural region clone was engineered to allow production of run off transcripts, and to allow directional ligation with the 5' structural region clone. In vitro ligation and transcription produces full-length genomic RNA which is noninfectious when transfected into mammalian tissue culture cells. Alternative methods for constructing cDNA clones and recovering live dengue virus are discussed.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Part I. The regions of sequence homology and non-homology between the DNA molecules of T2, T4, and T6 have been mapped by the electron microscopic heteroduplex method. The heteroduplex maps have been oriented with respect to the T4 genetic map. They show characteristic, reproducible patterns of substitution and deletion loops. All heteroduplex molecules show more than 85% homology. Some of the loop patterns in T2/T4 heteroduplexes are similar to those in T4/T6.

We find that the rII, the lysozyme and ac genes, the D region, and gene 52 are homologous in T2, T4, and T6. Genes 43 and 47 are probably homologous between T2 and T4. The region of greatest homology is that bearing the late genes. The host range region, which comprises a part of gene 37 and all of gene 38, is heterologous in T2, T4, and T6. The remainder of gene 37 is partially homologous in the T2/T4 heteroduplex (Beckendorf, Kim and Lielausis, 1972) but it is heterologous in T4/T6 and in T2/T6. Some of the tRNA genes are homologous and some are not. The internal protein genes in general seem to be non-homologous.

The molecular lengths of the T-even DNAs are the same within the limit of experimental error; their calculated molecular weights are correspondingly different due to unequal glucosylation. The size of the T2 genome is smaller than that of T4 or T6, but the terminally repetitious region in T2 is larger. There is a length distribution of the terminal repetition for any one phage DNA, indicating a variability in length of the DNA molecules packaged within the phage.

Part II. E. coli cells infected with phage strains carrying extensive deletions encompassing the gene for the phage ser-tRNA are missing the phage tRNAs normally present in wild type infected cells. By DNA-RNA hybridization we have demonstrated that the DNA complementary to the missing tRNAs is also absent in such deletion mutants. Thus the genes for these tRNAs must be clustered in the same region of the genome as the ser-tRNA gene. Physical mapping of several deletions of the ser-tRNA and lysozyme genes, by examination of heteroduplex DNA in the electron microscope, has enabled us to locate the cluster, to define its maximum size, and to order a few of the tRNA genes within it. That such deletions can be isolated indicates that the phage-specific tRNAs from this cluster are dispensable.

Part III. Genes 37 and 38 between closely related phages T2 and T4 have been compared by genetic, biochemical, and hetero-duplex studies. Homologous, partially homologous and non-homologous regions of the gene 37 have been mapped. The host range determinant which interacts with the gene 38 product is identified.

Part IV. A population of double-stranded ØX-RF DNA molecules carrying a deletion of about 9% of the wild-type DNA has been discovered in a sample cultivated under conditions where the phage lysozyme gene is nonessential. The structures of deleted monomers, dimers, and trimers have been studied by the electron microscope heteroduplex method. The dimers and trimers are shown to be head-to-tail repeats of the deleted monomers. Some interesting examples of the dynamical phenomenon of branch migration in vitro have been observed in heteroduplexes of deleted dimer and trimer strands with undeleted wild-type monomer viral strands.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Desde a década de 1990, os esforços internacionais para a obtenção de genomas completos levaram à determinação do genoma de inúmeros organismos. Isto, aliado ao grande avanço da computação, tem permitido o uso de abordagens inovadoras no estudo da estrutura, organização e evolução dos genomas e na predição e classificação funcional de genes. Entre os métodos mais comumente empregados nestas análises está a busca por similaridades entre sequências biológicas. Análises comparativas entre genomas completamente sequenciados indicam que cada grupo taxonômico estudado até o momento contém de 10 a 20% de genes sem homólogos reconhecíveis em outras espécies. Acredita-se que estes genes taxonomicamente restritos (TRGs) tenham um papel importante na adaptação a nichos ecológicos particulares, podendo estar envolvidos em importantes processos evolutivos. Entretanto, seu reconhecimento não é simples, sendo necessário distingui-los de ORFs não-funcionais espúrias e/ou artefatos derivados dos processos de anotação gênica. Além disso, genes espécie- ou gêneroespecíficos podem representar uma oportunidade para o desenvolvimento de métodos de identificação e/ou tipagem, tarefa relativamente complicada no caso dos procariotos, onde o método padrão-ouro na atualidade envolve a análise de um grupo de vários genes (MultiLocus Sequence Typing MLST). Neste trabalho utilizamos dados produzidos através de análises comparativas de genomas e de sequências para identificar e caracterizar genes espécie- e gênero-específicos, os quais possam auxiliar no desenvolvimento de novos métodos para identificação e/ou tipagem, além de poderem lançar luz em importantes processos evolutivos (tais como a perda e ou origem de genes em linhagens particulares, bem como a expansão de famílias de genes em linhagens específicas) nos organismos estudados.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Molecular-based approaches for shark species identification have been driven largely by issues specific to the fishery. In an effort to establish a more comprehensive identification data set, we investigated DNA sequence variation of a 1.4-kb region from the mitochondrial genome covering partial sequences from the 12S rDNA, 16S rDNA, and the complete valine tRNA from 35 shark species from the Atlantic fishery. Generally, within-species variability was low in relation to interspecific divergence because species haloptypes formed monophyletic groups. Phylogenetic analyses resolved ordinal relationships among Carcharhiniformes and Lamniformes, and revealed support for the families Sphyrnidae and Triakidae (within Carcharhiniformes) and Lamnidae and Alopidae (within Lamniformes). The combination of limited intraspecific variability and sufficient between-species divergence indicates that this locus is suitable for species identification.