163 resultados para genome structure

em Indian Institute of Science - Bangalore - Índia


Relevância:

100.00% 100.00%

Publicador:

Resumo:

The current explosion of DNA sequence information has generated increasing evidence for the claim that noncoding repetitive DNA sequences present within and around different genes could play an important role in genetic control processes, although the precise role and mechanism by which these sequences function are poorly understood. Several of the simple repetitive sequences which occur in a large number of loci throughout the human and other eukaryotic genomes satisfy the sequence criteria for forming non-B DNA structures in vitro. We have summarized some of the features of three different types of simple repeats that highlight the importance of repetitive DNA in the control of gene expression and chromatin organization. (i) (TG/CA)n repeats are widespread and conserved in many loci. These sequences are associated with nucleosomes of varying linker length and may play a role in chromatin organization. These Z-potential sequences can help absorb superhelical stress during transcription and aid in recombination. (ii) Human telomeric repeat (TTAGGG)n adopts a novel quadruplex structure and exhibits unusual chromatin organization. This unusual structural motif could explain chromosome pairing and stability. (iii) Intragenic amplification of (CTG)n/(CAG)n trinucleotide repeat, which is now known to be associated with several genetic disorders, could down-regulate gene expression in vivo. The overall implications of these findings vis-à-vis repetitive sequences in the genome are summarized.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

The nucleotide sequence of cosmid B1790, carrying the Rif-Str regions of the Mycobacterium leprae chromosome, has been determined. Twelve open reading frames were identified in the 36716bp sequence, representing 40% of the coding capacity. Five ribosomal proteins, two elongation factors and the β and β'subunits of RNA polymerase have been characterized and two novel genes were found. One of these encodes a member of the so-called ABC family of ATP-binding proteins while the other appears to encode an enzyme involved in repairing genomic lesions caused by free radicals. This finding may well be significant as M. leprae, an intracellular pathogen, lives within macrophages.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Background The genome of a wide variety of prokaryotes contains the luxS gene homologue, which encodes for the protein S-ribosylhomocysteinelyase (LuxS). This protein is responsible for the production of the quorum sensing molecule, AI-2 and has been implicated in a variety of functions such as flagellar motility, metabolic regulation, toxin production and even in pathogenicity. A high structural similarity is present in the LuxS structures determined from a few species. In this study, we have modelled the structures from several other species and have investigated their dimer interfaces. We have attempted to correlate the interface features of LuxS with the phenotypic nature of the organisms. Results The protein structure networks (PSN) are constructed and graph theoretical analysis is performed on the structures obtained from X-ray crystallography and on the modelled ones. The interfaces, which are known to contain the active site, are characterized from the PSNs of these homodimeric proteins. The key features presented by the protein interfaces are investigated for the classification of the proteins in relation to their function. From our analysis, structural interface motifs are identified for each class in our dataset, which showed distinctly different pattern at the interface of LuxS for the probiotics and some extremophiles. Our analysis also reveals potential sites of mutation and geometric patterns at the interface that was not evident from conventional sequence alignment studies. Conclusion The structure network approach employed in this study for the analysis of dimeric interfaces in LuxS has brought out certain structural details at the side-chain interaction level, which were elusive from the conventional structure comparison methods. The results from this study provide a better understanding of the relation between the luxS gene and its functional role in the prokaryotes. This study also makes it possible to explore the potential direction towards the design of inhibitors of LuxS and thus towards a wide range of antimicrobials.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The genomic sequences of several RNA plant viruses including cucumber mosaic virus, brome mosaic virus, alfalfa mosaic virus and tobacco mosaic virus have become available recently. The former two viruses are icosahedral while the latter two are bullet and rod shaped, respectively in particle morphology. The non-structural 3a proteins of cucumber mosaic virus and brome mosaic virus have an amino acid sequence homology of 35% and hence are evolutionarily related. In contrast, the coat proteins exhibit little homology, although the circular dichroism spectrum of these viruses are similar. The non-coding regions of the genome also exhibit variable but extensive homology. Comparison of the brome mosaic virus and alfalfa mosaic virus sequences reveals that they are probably related although with a much larger evolutionary distance. The polypeptide folds of the coat protein of three biologically distinct isometric plant viruses, tomato bushy stunt virus, southern bean mosaic virus and satellite tobacco necrosis virus have been shown to display a striking resemblance. All of them consist of a topologically similar 8-standard β-barrel. The implications of these studies to the understanding of the evolution of plant viruses will be discussed.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The complete genome of the baker's yeast S. cerevisiae was analyzed for the presence of polypurine/polypyrimidine (poly[pu/py]) repeats and their occurrences were classified on the basis of their location within and outside open reading frames (ORFs). The analysis reveals that such sequence motifs are present abundantly both in coding as well as noncoding regions. Clear positional preferences are seen when these tracts occur in noncoding regions. These motifs appear to occur predominantly at a unit nucleosomal length both upstream and downstream of ORFs. Moreover, there is a biased distribution of polypurines in the coding strands when these motifs occur within open reading frames. The significance of the biased distribution is discussed with reference to the occurrence of these motifs in other known mRNA sequences and expressed sequence tags. A model for cis regulation of gene expression is proposed based on the ability of these motifs to form an intermolecular triple helix structure when present within the coding region and/or to modulate nucleosome positioning via enhanced histone affinity when present outside coding regions.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

A second DNA binding protein from stationary-phase cells of Mycobacterium smegmatis (MsDps2) has been identified from the bacterial genome. It was cloned, expressed and characterised and its crystal structure was determined. The core dodecameric structure of MsDps2 is the same as that of the Dps from the organism described earlier (MsDps1). However, MsDps2 possesses a long N-terminal tail instead of the C-terminal tail in MsDps1. This tail appears to be involved in DNA binding. It is also intimately involved in stabilizing the dodecamer. Partly on account of this factor, MsDps2 assembles straightway into the dodecamer, while MsDps1 does so on incubation after going through an intermediate trimeric stage. The ferroxidation centre is similar in the two proteins, while the pores leading to it exhibit some difference. The mode of sequestration of DNA in the crystalline array of molecules, as evidenced by the crystal structures, appears to be different in MsDps1 and MsDps2, highlighting the variability in the mode of Dps–DNA complexation. A sequence search led to the identification of 300 Dps molecules in bacteria with known genome sequences. Fifty bacteria contain two or more types of Dps molecules each, while 195 contain only one type. Some bacteria, notably some pathogenic ones, do not contain Dps. A sequence signature for Dps could also be derived from the analysis.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Takifugu rubripes is teleost fish widely used in comparative genomics to understand the human system better due to its similarities both in number of genes and structure of genes. In this work we survey the fugu genome, and, using sensitive computational approaches, we identify the repertoire of putative protein kinases and classify them into groups and subfamilies. The fugu genome encodes 519 protein kinase-like sequences and this number of putative protein kinases is comparable closely to that of human. However, in spite of its similarities to human kinases at the group level, there are differences at the subfamily level as noted in the case of KIS and DYRK subfamilies which contribute to differences which are specific to the adaptation of the organism. Also, certain unique domain combination of galectin domain and YkA domain suggests alternate mechanisms for immune response and binding to lipoproteins. Lastly, an overall similarity with the MAPK pathway of humans suggests its importance to understand signaling mechanisms in humans. Overall the fugu serves as a good model organism to understand roles of human kinases as far as kinases such as LRRK and IRAK and their associated pathways are concerned.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

An analysis of the Mycobacterium smegmatis genome suggests that it codes for several thiolases and thiolase-like proteins. Thiolases are an important family of enzymes that are involved in fatty acid metabolism. They occur as either dimers or tetramers. Thiolases catalyze the Claisen condensation of two acetyl-Coenzyme A molecules in the synthetic direction and the thiolytic cleavage of 3-ketoacyl-Coenzyme A molecules in the degradative direction. Some of the M. smegmatis genes have been annotated as thiolases of the poorly characterized SCP2-thiolase subfamily. The mammalian SCP2-thiolase consists of an N-terminal thiolase domain followed by an additional C-terminal domain called sterol carrier protein-2 or SCP2. The M. smegmatis protein selected in the present study, referred to here as the thiolase-like protein type 1 (MsTLP1), has been biochemically and structurally characterized. Unlike classical thiolases, MsTLP1 is a monomer in solution. Its structure has been determined at 2.7 angstrom resolution by the single wavelength anomalous dispersion method. The structure of the protomer confirms that the N-terminal domain has the thiolase fold. An extra C-terminal domain is indeed observed. Interestingly, it consists of six beta-strands forming an anti-parallel beta-barrel which is completely different from the expected SCP2-fold. Detailed sequence and structural comparisons with thiolases show that the residues known to be essential for catalysis are not conserved in MsTLP1. Consistent with this observation, activity measurements show that MsTLP1 does not catalyze the thiolase reaction. This is the first structural report of a monomeric thiolase-like protein from any organism. These studies show that MsTLP1 belongs to a new group of thiolase related proteins of unknown function.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Cryptococcus neoformans is a pathogenic basidiomycetous yeast responsible for more than 600,000 deaths each year. It occurs as two serotypes (A and D) representing two varieties (i.e. grubii and neoformans, respectively). Here, we sequenced the genome and performed an RNA-Seq-based analysis of the C. neoformans var. grubii transcriptome structure. We determined the chromosomal locations, analyzed the sequence/structural features of the centromeres, and identified origins of replication. The genome was annotated based on automated and manual curation. More than 40,000 introns populating more than 99% of the expressed genes were identified. Although most of these introns are located in the coding DNA sequences (CDS), over 2,000 introns in the untranslated regions (UTRs) were also identified. Poly(A)-containing reads were employed to locate the polyadenylation sites of more than 80% of the genes. Examination of the sequences around these sites revealed a new poly(A)-site-associated motif (AUGHAH). In addition, 1,197 miscRNAs were identified. These miscRNAs can be spliced and/or polyadenylated, but do not appear to have obvious coding capacities. Finally, this genome sequence enabled a comparative analysis of strain H99 variants obtained after laboratory passage. The spectrum of mutations identified provides insights into the genetics underlying the micro-evolution of a laboratory strain, and identifies mutations involved in stress responses, mating efficiency, and virulence.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

We report the crystal structure of the first prokaryotic aspartic proteinase-like domain identified in the genome of Mycobacterium tuberculosis. A search in the genomes of Mycobacterium species showed that the C-terminal domains of some of the PE family proteins contain two classic DT/SG motifs of aspartic proteinases with a low overall sequence similarity to HIV proteinase. The three-dimensional structure of one of them, Rv0977 (PE_PGRS16) of M. tuberculosis revealed the characteristic pepsinf-old and catalytic site architecture. However, the active site was completely blocked by the N-terminal His-tag. Surprisingly, the enzyme was found to be inactive even after the removal of the N-terminal His-tag. A comparison of the structure with pepsins showed significant differences in the critical substrate binding residues and in the flap tyrosine conformation that could contribute to the lack of proteolytic activity of Rv0977. (C) 2013 The Authors. Published by Elsevier B.V. on behalf of Federation of European Biochemical Societies. All rights reserved.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The significance of G-quadruplexes and the helicases that resolve G4 structures in prokaryotes is poorly understood. The Mycobacterium tuberculosis genome is GC-rich and contains >10,000 sequences that have the potential to form G4 structures. In Escherichia coli, RecQ helicase unwinds G4 structures. However, RecQ is absent in M. tuberculosis, and the helicase that participates in G4 resolution in M. tuberculosis is obscure. Here, we show that M. tuberculosis DinG (MtDinG) exhibits high affinity for ssDNA and ssDNA translocation with a 5' -> 3' polarity. Interestingly, MtDinG unwinds overhangs, flap structures, and forked duplexes but fails to unwind linear duplex DNA. Our data with DNase I footprinting provide mechanistic insights and suggest that MtDinG is a 5' -> 3' polarity helicase. Notably, in contrast to E. coli DinG, MtDinG catalyzes unwinding of replication fork and Holliday junction structures. Strikingly, we find that MtDinG resolves intermolecular G4 structures. These data suggest that MtDinG is a multifunctional structure-specific helicase that unwinds model structures of DNA replication, repair, and recombination as well as G4 structures. We finally demonstrate that promoter sequences of M. tuberculosis PE_PGRS2, mce1R, and moeB1 genes contain G4 structures, implying that G4 structures may regulate gene expression in M. tuberculosis. We discuss these data and implicate targeting G4 structures and DinG helicase in M. tuberculosis could be a novel therapeutic strategy for culminating the infection with this pathogen.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Streptococcus pneumoniae causes pneumonia, septicemia and meningitis. S. pneumoniae is responsible for significant mortality both in children and in the elderly. In recent years, the whole genome sequencing of various S. pneumoniae strains have increased manifold and there is an urgent need to provide organism specific annotations to the scientific community. This prompted us to develop the Streptococcus pneumoniae Genome Database (SPGDB) to integrate and analyze the completely sequenced and available S. pneumoniae genome sequences. Further, links to several tools are provided to compare the pool of gene and protein sequences, and proteins structure across different strains of S. pneumoniae. SPGDB aids in the analysis of phenotypic variations as well as to perform extensive genomics and evolutionary studies with reference to S. pneumoniae. (C) 2014 Elsevier Inc. All rights reserved.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The availability of the genome sequence of Mycobacterium tuberculosis H37Rv has encouraged determination of large numbers of protein structures and detailed definition of the biological information encoded therein; yet, the functions of many proteins in M. tuberculosis remain unknown. The emergence of multidrug resistant strains makes it a priority to exploit recent advances in homology recognition and structure prediction to re-analyse its gene products. Here we report the structural and functional characterization of gene products encoded in the M. tuberculosis genome, with the help of sensitive profile-based remote homology search and fold recognition algorithms resulting in an enhanced annotation of the proteome where 95% of the M. tuberculosis proteins were identified wholly or partly with information on structure or function. New information includes association of 244 proteins with 205 domain families and a separate set of new association of folds to 64 proteins. Extending structural information across uncharacterized protein families represented in the M. tuberculosis proteome, by determining superfamily relationships between families of known and unknown structures, has contributed to an enhancement in the knowledge of structural content. In retrospect, such superfamily relationships have facilitated recognition of probable structure and/or function for several uncharacterized protein families, eventually aiding recognition of probable functions for homologous proteins corresponding to such families. Gene products unique to mycobacteria for which no functions could be identified are 183. Of these 18 were determined to be M. tuberculosis specific. Such pathogen-specific proteins are speculated to harbour virulence factors required for pathogenesis. A re-annotated proteome of M. tuberculosis, with greater completeness of annotated proteins and domain assigned regions, provides a valuable basis for experimental endeavours designed to obtain a better understanding of pathogenesis and to accelerate the process of drug target discovery. (C) 2014 Elsevier Ltd. All rights reserved.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The genome of Leishmania major encodes a type II fatty acid biosynthesis pathway for which no structural or biochemical information exists. Here, for the first time, we have characterized the central player of the pathway, the acyl carrier protein (LmACP), using nuclear magnetic resonance (NMR). Structurally, the LmACP molecule is similar to other type II ACPs, comprising a four-helix bundle, enclosing a hydrophobic core. Dissimilarities in sequence, however, exist in helix II (recognition helix) of the protein. The enzymatic conversion of apo-LmACP into the holo form using type I (Escherichia coli AcpS) and type II (Sfp type) phosphopantetheinyl transferases (PPTs) is relatively slow. Mutagenesis studies underscore the importance of the residues present at the protein protein interaction interface of LmACP in modulating the activity of PPTs. Interestingly, the cognate PPT for this ACP, the L. major 4'-phosphopantetheinyl transferase (LmPPT), does not show any enzymatic activity toward it, though it readily converts other type I and type II ACPs into their holo forms. NMR chemical shift perturbation studies suggest a moderately tight complex between LmACP and its cognate PPT, suggesting inhibition. We surmise that the unique surface of LmACP might have evolved to complement its cognate enzyme (LmPPT), possibly for the purpose of regulation.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Trypsin-treated rat brain myelin was subjected to biochemical and X-ray studies. Untreated myelin gave rise to a pattern of three rings with a fundamental repeat period of 155 Angstrom consisting of two bilayers per repeat period, whereas myelin treated with trypsin showed a fundamental repeat period of 75 Angstrom with one bilayer per repeat period. The integrated raw intensity of the h=4 reflection with respect to the h=2 reflection is 0.38 for untreated myelin. The corresponding value reduced to 0.23, 0.18, 0.17 for myelin treated with 5, 10, 40 units of trypsin per mg of myelin, respectively, for 30 min at 30 degrees C. The decrease in relative raw intensity of the higher-order reflection relative to the lower-order reflection is suggestive of a disordering of the phosphate groups upon trypsin treatment or an increased mosaicity of the membrane or a combination of both these effects, However, trypsin treatment does not lead to a complete breakdown of the membrane, The integrated intensity of the h=1 reflection, though weak, is above the measurable threshold for untreated myelin, whereas the corresponding intensity is below the measurable threshold for trypsin-treated myelin, indicating a possible asymmetric to symmetric transition of the myelin bilayer structure about its centre after trypsin treatment.