166 resultados para Sequence Motifs
Resumo:
The occurrence of DNA architectural proteins containing two functional domains derived from two different architectural proteins is an interesting emerging research theme in the field of nucleoid structure and function. Mycobacterium tuberculosis HupB, unlike Escherichia coli HU, is a two-domain protein that, in the N-terminal region, shows broad sequence homology with bacterial HU. The long C-terminal extension, on the other hand, contains seven PAKK/KAAK motifs, which are characteristic of the histone H1/H5 family of proteins. In this article, we describe several aspects of HupB function, in comparison with its truncated derivatives lacking either the C-terminus or N-terminus. We found that HupB binds a variety of DNA repair and replication intermediates with K(d) values in the nanomolar range. By contrast, the N-terminal fragment of M. tuberculosis HupB (HupB(MtbN)) showed diminished DNA-binding activity, with K(d) values in the micromolar range, and the C-terminal domain was completely devoid of DNA-binding activity. Unlike HupB(MtbN), HupB was able to constrain DNA in negative supercoils and introduce negative superhelical turns into relaxed DNA. Similarly, HupB exerted a robust inhibitory effect on DNA strand exchange promoted by cognate and noncognate RecA proteins, whereas HupB(MtbN), even at a 50-fold molar excess, had no inhibitory effect. Considered together, these results suggest that synergy between the N-terminal and C-terminal domains of HupB is essential for its DNA-binding ability, and to modulate the topological features of DNA, which has implications for processes such as DNA compaction, gene regulation, homologous recombination, and DNA repair.
Resumo:
Crossover motifs are integral components for designing DNA-based nanostructures and nanomechanical devices due to their enhanced rigidity compared to the normal B-DNA. Although the structural rigidity of the double helix B-DNA has been investigated extensively using both experimental and theoretical tools, to date there is no quantitative information about structural rigidity and the mechanical strength of parallel crossover DNA motifs. We have used fully atomistic molecular dynamics simulations in explicit solvent to get the force-extension curve of parallel DNA nanostructures to characterize their mechanical rigidity. In the presence of monovalent Na(+) ions, we find that the stretch modulus (gamma(1)) of the paranemic crossover and its topoisomer JX DNA structure is significantly higher (similar to 30%) compared to normal B-DNA of the same sequence and length. However, this is in contrast to the original expectation that these motifs are almost twice as rigid compared to the double-stranded B-DNA. When the DNA motif is surrounded by a solvent with Mg(2+) counterions, we find an enhanced rigidity compared to Na(+) environment due to the electrostatic screening effects arising from the divalent nature of Mg(2+) ions. To our knowledge, this is the first direct determination of the mechanical strength of these crossover motifs, which can be useful for the design of suitable DNA for DNA-based nanostructures and nanomechanical devices with improved structural rigidity.
Resumo:
In this article, we consider the single-machine scheduling problem with past-sequence-dependent (p-s-d) setup times and a learning effect. The setup times are proportional to the length of jobs that are already scheduled; i.e. p-s-d setup times. The learning effect reduces the actual processing time of a job because the workers are involved in doing the same job or activity repeatedly. Hence, the processing time of a job depends on its position in the sequence. In this study, we consider the total absolute difference in completion times (TADC) as the objective function. This problem is denoted as 1/LE, (Spsd)/TADC in Kuo and Yang (2007) ('Single Machine Scheduling with Past-sequence-dependent Setup Times and Learning Effects', Information Processing Letters, 102, 22-26). There are two parameters a and b denoting constant learning index and normalising index, respectively. A parametric analysis of b on the 1/LE, (Spsd)/TADC problem for a given value of a is applied in this study. In addition, a computational algorithm is also developed to obtain the number of optimal sequences and the range of b in which each of the sequences is optimal, for a given value of a. We derive two bounds b* for the normalising constant b and a* for the learning index a. We also show that, when a < a* or b > b*, the optimal sequence is obtained by arranging the longest job in the first position and the rest of the jobs in short processing time order.
Crystallization and preliminary X-ray diffraction studies of sortase A from Streptococcus pneumoniae
Resumo:
Sortases are cell-membrane-anchored cysteine transpeptidases that are essential for the assembly and anchoring of cell-surface adhesins in Gram-positive bacteria. Thus, they play critical roles in virulence, infection and colonization by pathogens. Sortases have been classified into four types based on their primary sequence and the target-protein motifs that they recognize. All Gram-positive bacteria express a class A housekeeping sortase (SrtA). Sortase A from Streptococcus pneumoniae (NP_358691) has been crystallized in two crystal forms. Diamond-shaped crystals of Delta N(59)SrtA diffracted to 4.0 angstrom resolution and belonged to a tetragonal system with unit-cell parameters a = b = 122.8, c = 86.5 angstrom, alpha = beta = gamma = 90 degrees, while rod-shaped crystals of Delta N(81)SrtA diffracted to 2.91 angstrom resolution and belonged to the monoclinic space group P2(1) with unit-cell parameters a = 66.8, b = 103.47, c = 74.79 angstrom, alpha = gamma = 90, beta = 115.65 degrees. The Matthews coefficient (V(M) = 2.77 angstrom(3) Da(-1)) with similar to 56% solvent content suggested the presence of four molecules in the asymmetric unit for Delta N(81)SrtA. Also, a multi-copy search using a monomer as a probe in the molecular-replacement method resulted in the successful location of four sortase molecules in the asymmetric unit, with statistics R = 41.61, R(free) = 46.44, correlation coefficient (CC) = 64.31, CC(free) = 57.67.
Resumo:
Of the similar to 4000 ORFs identified through the genome sequence of Mycobacterium tuberculosis (TB) H37Rv, experimentally determined structures are available for 312. Since knowledge of protein structures is essential to obtain a high-resolution understanding of the underlying biology, we seek to obtain a structural annotation for the genome, using computational methods. Structural models were obtained and validated for similar to 2877 ORFs, covering similar to 70% of the genome. Functional annotation of each protein was based on fold-based functional assignments and a novel binding site based ligand association. New algorithms for binding site detection and genome scale binding site comparison at the structural level, recently reported from the laboratory, were utilized. Besides these, the annotation covers detection of various sequence and sub-structural motifs and quaternary structure predictions based on the corresponding templates. The study provides an opportunity to obtain a global perspective of the fold distribution in the genome. The annotation indicates that cellular metabolism can be achieved with only 219 folds. New insights about the folds that predominate in the genome, as well as the fold-combinations that make up multi-domain proteins are also obtained. 1728 binding pockets have been associated with ligands through binding site identification and sub-structure similarity analyses. The resource (http://proline.physics.iisc.ernet.in/Tbstructuralannotation), being one of the first to be based on structure-derived functional annotations at a genome scale, is expected to be useful for better understanding of TB and for application in drug discovery. The reported annotation pipeline is fairly generic and can be applied to other genomes as well.
Resumo:
During V(D)J recombination, RAG (recombination-activating gene) complex cleaves DNA based on sequence specificity. Besides its physiological function, RAG has been shown to act as a structure-specific nuclease. Recently, we showed that the presence of cytosine within the single-stranded region of heteroduplex DNA is important when RAGs cleave on DNA structures. In the present study, we report that heteroduplex DNA containing a bubble region can be cleaved efficiently when present along with a recombination signal sequence (RSS) in cis or trans configuration. The sequence of the bubble region influences RAG cleavage at RSS when present in cis. We also find that the kinetics of RAG cleavage differs between RSS and bubble, wherein RSS cleavage reaches maximum efficiency faster than bubble cleavage. In addition, unlike RSS, RAG cleavage at bubbles does not lead to cleavage complex formation. Finally, we show that the ``nonamer binding region,'' which regulates RAG cleavage on RSS, is not important during RAG activity in non-B DNA structures. Therefore, in the current study, we identify the possible mechanism by which RAG cleavage is regulated when it acts as a structure-specific nuclease. (C) 2011 Elsevier Ltd. All rights reserved.
Resumo:
Short range side chain-backbone hydrogen bonded motifs involving Asn and Gln residues have been identified from a data set of 1370 protein crystal structures (resolution = 1.5 angstrom). Hydrogen bonds involving residues i - 5 to i + 5 have been considered. Out of 12,901 Asn residues, 3403 residues (26.4%) participate in such interactions, while out of 10,934 Gln residues, 1780 Gln residues (16.3%) are involved in these motifs. Hydrogen bonded ring sizes (Cn, where n is the number of atoms involved), directionality and internal torsion angles are used to classify motifs. The occurrence of the various motifs in the contexts of protein structure is illustrated. Distinct differences are established between the nature of motifs formed by Asn and Gln residues. For Asn, the most highly populated motifs are the C10 (COdi .NHi + 2), C13 (COdi .NHi + 3) and C17 (NdHi .COi - 4) structures. In contrast, Gln predominantly forms C16 (COei .NHi - 3), C12 (NeHi .COi - 2), C15 (NeHi .COi - 3) and C18 (NeHi .COi - 4) motifs, with only the C18motif being analogous to the Asn C17structure. Specific conformational types are established for the Asn containing motifs, which mimic backbone beta-turns and a-turns. Histidine residues are shown to serve as a mimic for Asn residues in side chain-backbone hydrogen bonded ring motifs. Illustrative examples from protein structures are considered. Proteins 2012; (c) 2011 Wiley Periodicals, Inc.
Resumo:
Phosphoinositide-specific phospholipase C (PLC) is involved in Ca2+ mediated signalling events that lead to altered cellular status. Using various sequence-analysis methods, we identified two conserved motifs in known PLC sequences. The identified motifs are located in the C2 domain of plant PLCs and are not found in any other protein. These motifs are specifically found in the Ca2+ binding loops and form adjoining beta strands. Further, we identified certain conserved residues that are highly distinct from corresponding residues of animal PLCs. The motifs reported here could be used to annotate plant-specific phospholipase C sequences. Furthermore, we demonstrated that the C2 domain alone is capable of targeting PLC to the membrane in response to a Ca2+ signal. We also showed that the binding event results from a change in the hydrophobicity of the C2 domain upon Ca2+ binding. Bioinformatic analyses revealed that all PLCs from Arabidopsis and rice lack a transmembrane domain, myristoylation and GPI-anchor protein modifications. Our bioinformatic study indicates that plant PLCs are located in the cytoplasm, the nucleus and the mitochondria. Our results suggest that there are no distinct isoforms of plant PLCs, as have been proposed to exist in the soluble and membrane associated fractions. The same isoform could potentially be present in both subcellular fractions, depending on the calcium level of the cytosol. Overall, these data suggest that the C2 domain of PLC plays a vital role in calcium signalling.
Resumo:
Quest for new drug targets in Plasmodium sp. has underscored malonyl CoA:ACP transacylase (PfFabD) of fatty acid biosynthetic pathway in apicoplast. In this study, a piggyback approach was employed for the receptor deorphanization using inhibitors of bacterial FabD enzymes. Due to the lack of crystal structure, theoretical model was constructed using the structural details of homologous enzymes. Sequence and structure analysis has localized the presence of two conserved pentapeptide motifs: GQGXG and GXSXG and five key invariant residues viz., Gln109, Ser193, Arg218, His305 and Gln354 characteristic of FabD enzyme. Active site mapping of PfFabD using substrate molecules has disclosed the spatial arrangement of key residues in the cavity. As structurally similar molecules exhibit similar biological activities, signature pharmacophore fingerprints of FabD antagonists were generated using 0D-3D descriptors for molecular similarity-based cluster analysis and to correlate with their binding profiles. It was observed that antagonists showing good geometrical fitness score were grouped in cluster-1, whereas those exhibiting high binding affinities in cluster-2. This study proves important to shed light on the active site environment to reveal the hotspot for binding with higher affinity and to narrow down the virtual screening process by searching for close neighbors of the active compounds.
Resumo:
The rapidly growing structure databases enhance the probability of finding identical sequences sharing structural similarity. Structure prediction methods are being used extensively to abridge the gap between known protein sequences and the solved structures which is essential to understand its specific biochemical and cellular functions. In this work, we plan to study the ambiguity between sequence-structure relationships and examine if sequentially identical peptide fragments adopt similar three-dimensional structures. Fragments of varying lengths (five to ten residues) were used to observe the behavior of sequence and its three-dimensional structures. The STAMP program was used to superpose the three-dimensional structures and the two parameters (Sequence Structure Similarity Score (Sc) and Root Mean Square Deviation value) were employed to classify them into three categories: similar, intermediate and dissimilar structures. Furthermore, the same approach was carried out on all the three-dimensional protein structures solved in the two organisms, Mycobacterium tuberculosis and Plasmodium falciparum to validate our results.
Resumo:
The regulation of phospholipid biosynthesis in Saccharomyces cerevisiae through cis-acting upstream activating sequence inositol (UAS(ino)) and trans-acting elements, such as the INO2-INO4 complex and OPI1 by inositol supplementation in growth is thoroughly studied. In this study, we provide evidence for the regulation of lipid biosynthesis by phosphatidylinositol-specific phospholipase C (PLC) through UAS(ino) and the trans-acting elements. Gene expression analysis and radiolabelling experiments demonstrated that the overexpression of rice PLC in yeast cells altered phospholipid biosynthesis at the levels of transcriptional and enzyme activity. This is the first report implicating PLC in the direct regulation of lipid biosynthesis. (C) 2012 Federation of European Biochemical Societies. Published by Elsevier B.V. All rights reserved.
Resumo:
We report the draft genome sequence of an ST772 Staphylococcus aureus disease isolate carrying staphylococcal cassette chromosome mec (SCCmec) type V from a pyomyositis patient. Our de novo short read assembly is similar to 2.8 Mb and encodes a unique Panton-Valentine leukocidin (PVL) phage with structural genes similar to those of phi 7247PVL and novel lysogenic genes at the N termini.
Resumo:
Over the past two decades, many ingenious efforts have been made in protein remote homology detection. Because homologous proteins often diversify extensively in sequence, it is challenging to demonstrate such relatedness through entirely sequence-driven searches. Here, we describe a computational method for the generation of `protein-like' sequences that serves to bridge gaps in protein sequence space. Sequence profile information, as embodied in a position-specific scoring matrix of multiply aligned sequences of bona fide family members, serves as the starting point in this algorithm. The observed amino acid propensity and the selection of a random number dictate the selection of a residue for each position in the sequence. In a systematic manner, and by applying a `roulette-wheel' selection approach at each position, we generate parent family-like sequences and thus facilitate an enlargement of sequence space around the family. When generated for a large number of families, we demonstrate that they expand the utility of natural intermediately related sequences in linking distant proteins. In 91% of the assessed examples, inclusion of designed sequences improved fold coverage by 5-10% over searches made in their absence. Furthermore, with several examples from proteins adopting folds such as TIM, globin, lipocalin and others, we demonstrate that the success of including designed sequences in a database positively sensitized methods such as PSI-BLAST and Cascade PSI-BLAST and is a promising opportunity for enormously improved remote homology recognition using sequence information alone.
Resumo:
Comparison of multiple protein structures has a broad range of applications in the analysis of protein structure, function and evolution. Multiple structure alignment tools (MSTAs) are necessary to obtain a simultaneous comparison of a family of related folds. In this study, we have developed a method for multiple structure comparison largely based on sequence alignment techniques. A widely used Structural Alphabet named Protein Blocks (PBs) was used to transform the information on 3D protein backbone conformation as a ID sequence string. A progressive alignment strategy similar to CLUSTALW was adopted for multiple PB sequence alignment (mulPBA). Highly similar stretches identified by the pairwise alignments are given higher weights during the alignment. The residue equivalences from PB based alignments are used to obtain a three dimensional fit of the structures followed by an iterative refinement of the structural superposition. Systematic comparisons using benchmark datasets of MSTAs underlines that the alignment quality is better than MULTIPROT, MUSTANG and the alignments in HOMSTRAD, in more than 85% of the cases. Comparison with other rigid-body and flexible MSTAs also indicate that mulPBA alignments are superior to most of the rigid-body MSTAs and highly comparable to the flexible alignment methods. (C) 2012 Elsevier Masson SAS. All rights reserved.