167 resultados para SEQUENCE ALIGNMENT
em Indian Institute of Science - Bangalore - Índia
Resumo:
With the immense growth in the number of available protein structures, fast and accurate structure comparison has been essential. We propose an efficient method for structure comparison, based on a structural alphabet. Protein Blocks (PBs) is a widely used structural alphabet with 16 pentapeptide conformations that can fairly approximate a complete protein chain. Thus a 3D structure can be translated into a 1D sequence of PBs. With a simple Needleman-Wunsch approach and a raw PB substitution matrix, PB-based structural alignments were better than many popular methods. iPBA web server presents an improved alignment approach using (i) specialized PB Substitution Matrices (SM) and (ii) anchor-based alignment methodology. With these developments, the quality of similar to 88% of alignments was improved. iPBA alignments were also better than DALI, MUSTANG and GANGSTA(+) in > 80% of the cases. The webserver is designed to for both pairwise comparisons and database searches. Outputs are given as sequence alignment and superposed 3D structures displayed using PyMol and Jmol. A local alignment option for detecting subs-structural similarity is also embedded. As a fast and efficient `sequence-based' structure comparison tool, we believe that it will be quite useful to the scientific community. iPBA can be accessed at http://www.dsimb.inserm.fr/dsimb_tools/ipba/.
Resumo:
Comparison of multiple protein structures has a broad range of applications in the analysis of protein structure, function and evolution. Multiple structure alignment tools (MSTAs) are necessary to obtain a simultaneous comparison of a family of related folds. In this study, we have developed a method for multiple structure comparison largely based on sequence alignment techniques. A widely used Structural Alphabet named Protein Blocks (PBs) was used to transform the information on 3D protein backbone conformation as a ID sequence string. A progressive alignment strategy similar to CLUSTALW was adopted for multiple PB sequence alignment (mulPBA). Highly similar stretches identified by the pairwise alignments are given higher weights during the alignment. The residue equivalences from PB based alignments are used to obtain a three dimensional fit of the structures followed by an iterative refinement of the structural superposition. Systematic comparisons using benchmark datasets of MSTAs underlines that the alignment quality is better than MULTIPROT, MUSTANG and the alignments in HOMSTRAD, in more than 85% of the cases. Comparison with other rigid-body and flexible MSTAs also indicate that mulPBA alignments are superior to most of the rigid-body MSTAs and highly comparable to the flexible alignment methods. (C) 2012 Elsevier Masson SAS. All rights reserved.
Resumo:
The discovery of GH (Glycoside Hydrolase) 19 chitinases in Streptomyces sp. raises the possibility of the presence of these proteins in other bacterial species, since they were initially thought to be confined to higher plants. The present study mainly concentrates on the phylogenetic distribution and homology conservation in GH19 family chitinases. Extensive database searches are performed to identify the presence of GH19 family chitinases in the three major super kingdoms of life. Multiple sequence alignment of all the identified GH19 chitinase family members resulted in the identification of globally conserved residues. We further identified conserved sequence motifs across the major sub groups within the family. Estimation of evolutionary distance between the various bacterial and plant chitinases are carried out to better understand the pattern of evolution. Our study also supports the horizontal gene transfer theory, which states that GH19 chitinase genes are transferred from higher plants to bacteria. Further, the present study sheds light on the phylogenetic distribution and identifies unique sequence signatures that define GH19 chitinase family of proteins. The identified motifs could be used as markers to delineate uncharacterized GH19 family chitinases. The estimation of evolutionary distance between chitinase identified in plants and bacteria shows that the flowering plants are more related to chitinase in actinobacteria than that of identified in purple bacteria. We propose a model to elucidate the natural history of GH19 family chitinases.
Resumo:
The increasing number of available protein structures requires efficient tools for multiple structure comparison. Indeed, multiple structural alignments are essential for the analysis of function, evolution and architecture of protein structures. For this purpose, we proposed a new web server called multiple Protein Block Alignment (mulPBA). This server implements a method based on a structural alphabet to describe the backbone conformation of a protein chain in terms of dihedral angles. This sequence-like' representation enables the use of powerful sequence alignment methods for primary structure comparison, followed by an iterative refinement of the structural superposition. This approach yields alignments superior to most of the rigid-body alignment methods and highly comparable with the flexible structure comparison approaches. We implement this method in a web server designed to do multiple structure superimpositions from a set of structures given by the user. Outputs are given as both sequence alignment and superposed 3D structures visualized directly by static images generated by PyMol or through a Jmol applet allowing dynamic interaction. Multiple global quality measures are given. Relatedness between structures is indicated by a distance dendogram. Superimposed structures in PDB format can be also downloaded, and the results are quickly obtained. mulPBA server can be accessed at www.dsimb.inserm.fr/dsimb_tools/mulpba/.
Resumo:
Human La protein is known to be an essential host factor for translation and replication of hepatitis C virus (HCV) RNA. Previously, we have demonstrated that residues responsible for interaction of human La protein with the HCV internal ribosomal entry site (IRES) around the initiator AUG within stem-loop IV form a beta-turn in the RNA recognition motif (RRM) structure. In this study, sequence alignment and mutagenesis suggest that the HCV RNA-interacting beta-turn is conserved only in humans and chimpanzees, the species primarily known to be infected by HCV. A 7-mer peptide corresponding to the HCV RNA-interacting region of human La inhibits HCV translation, whereas another peptide corresponding to the mouse La sequence was unable to do so. Furthermore, IRES-mediated translation was found to be significantly high in the presence of recombinant human La protein in vitro in rabbit reticulocyte lysate. We observed enhanced replication with HCV subgenomic and full-length replicons upon overexpression of either human La protein or a chimeric mouse La protein harboring a human La beta-turn sequence in mouse cells. Taken together, our results raise the possibility of creating an immunocompetent HCV mouse model using human-specific cell entry factors and a humanized form of La protein.
Resumo:
Background The genome of a wide variety of prokaryotes contains the luxS gene homologue, which encodes for the protein S-ribosylhomocysteinelyase (LuxS). This protein is responsible for the production of the quorum sensing molecule, AI-2 and has been implicated in a variety of functions such as flagellar motility, metabolic regulation, toxin production and even in pathogenicity. A high structural similarity is present in the LuxS structures determined from a few species. In this study, we have modelled the structures from several other species and have investigated their dimer interfaces. We have attempted to correlate the interface features of LuxS with the phenotypic nature of the organisms. Results The protein structure networks (PSN) are constructed and graph theoretical analysis is performed on the structures obtained from X-ray crystallography and on the modelled ones. The interfaces, which are known to contain the active site, are characterized from the PSNs of these homodimeric proteins. The key features presented by the protein interfaces are investigated for the classification of the proteins in relation to their function. From our analysis, structural interface motifs are identified for each class in our dataset, which showed distinctly different pattern at the interface of LuxS for the probiotics and some extremophiles. Our analysis also reveals potential sites of mutation and geometric patterns at the interface that was not evident from conventional sequence alignment studies. Conclusion The structure network approach employed in this study for the analysis of dimeric interfaces in LuxS has brought out certain structural details at the side-chain interaction level, which were elusive from the conventional structure comparison methods. The results from this study provide a better understanding of the relation between the luxS gene and its functional role in the prokaryotes. This study also makes it possible to explore the potential direction towards the design of inhibitors of LuxS and thus towards a wide range of antimicrobials.
Resumo:
Mycobacterium leprae recA harbors an in-frame insertion sequence that encodes an intein homing endonuclease (PI-MleI). Most inteins (intein endonucleases) possess two conserved LAGLIDADG (DOD) motifs at their ctive center. A common feature of LAGLIDADG-type homing endonucleases is that they recognize and cleave the same or very similar DNA sequences. However, PI-MleI is distinctive from other members of the family of LAGLIDADG-type HEases for its modular structure with functionally separable domains for DNA-binding and cleavage, each with distinct sequence preferences. Sequence alignment analyses of PI-MleI revealed three putative LAGLIDADG motifs; however, there is conflicting bioinformatics data in regard to their identity and specific location within the intein polypeptide. To resolve this conflict and to determine the active-site residues essential for DNA target site recognition and double-stranded DNA cleavage, we performed site-directed mutagenesis of presumptive catalytic residues in the LAGLIDADG motifs. Analysis of target DNA recognition and kinetic parameters of the wild-type PI-MleI and its variants disclosed that the two amino acid residues, Asp(122) (in Block C) and Asp(193) (in functional Block E), are crucial to the double-stranded DNA endonuclease activity, whereas Asp(218) (in pseudo-Block E) is not. However, despite the reduced catalytic activity, the PI-MleI variants, like the wild-type PI-MleI, generated a footprint of the same length around the insertion site. The D122T variant showed significantly reduced catalytic activity, and D122A and D193A mutations although failed to affect their DNA-binding affinities, but abolished the double-stranded DNA cleavage activity. On the other hand, D122C variant showed approximately twofold higher double-stranded DNA cleavage activity, compared with the wild-type PI-MleI. These results provide compelling evidence that Asp(122) and Asp(193) in DOD motif I and II, respectively, are bona fide active-site residues essential for DNA cleavage activity. The implications of these results are discussed in this report.
Resumo:
Background:Overwhelming majority of the Serine/Threonine protein kinases identified by gleaning archaeal and eubacterial genomes could not be classified into any of the well known Hanks and Hunter subfamilies of protein kinases. This is owing to the development of Hanks and Hunter classification scheme based on eukaryotic protein kinases which are highly divergent from their prokaryotic homologues. A large dataset of prokaryotic Serine/Threonine protein kinases recognized from genomes of prokaryotes have been used to develop a classification framework for prokaryotic Ser/Thr protein kinases. Methodology/Principal Findings: We have used traditional sequence alignment and phylogenetic approaches and clustered the prokaryotic kinases which represent 72 subfamilies with at least 4 members in each. Such a clustering enables classification of prokaryotic Ser/Thr kinases and it can be used as a framework to classify newly identified prokaryotic Ser/Thr kinases. After series of searches in a comprehensive sequence database we recognized that 38 subfamilies of prokaryotic protein kinases are associated to a specific taxonomic level. For example 4, 6 and 3 subfamilies have been identified that are currently specific to phylum proteobacteria, cyanobacteria and actinobacteria respectively. Similarly subfamilies which are specific to an order, sub-order, class, family and genus have also been identified. In addition to these, we also identify organism-diverse subfamilies. Members of these clusters are from organisms of different taxonomic levels, such as archaea, bacteria, eukaryotes and viruses.Conclusion/Significance: Interestingly, occurrence of several taxonomic level specific subfamilies of prokaryotic kinases contrasts with classification of eukaryotic protein kinases in which most of the popular subfamilies of eukaryotic protein kinases occur diversely in several eukaryotes. Many prokaryotic Ser/Thr kinases exhibit a wide variety of modular organization which indicates a degree of complexity and protein-protein interactions in the signaling pathways in these microbes.
Resumo:
Pulicat Lake sediments are often severely polluted with the toxic heavy metal mercury. Several mercury-resistant strains of Bacillus species were isolated from the sediments and all the isolates exhibited broad spectrum resistance (resistance to both organic and inorganic mercuric compounds). Plasmid curing assay showed that all the isolated Bacillus strains carry chromosomally borne mercury resistance. Polymerase chain reaction and southern hybridization analyses using merA and merB3 gene primers/probes showed that five of the isolated Bacillus strains carry sequences similar to known merA and merB3 genes. Results of multiple sequence alignment revealed 99% similarity with merA and merB3 of TnMERI1 (class II transposons). Other mercury resistant Bacillus species lacking homology to these genes were not able to volatilize mercuric chloride, indicating the presence of other modes of resistance to mercuric compounds.
Resumo:
Background: HU a small, basic, histone like protein is a major component of the bacterial nucleoid. E. coli has two subunits of HU coded by hupA and hupB genes whereas Mycobacterium tuberculosis (Mtb) has only one subunit of HU coded by ORF Rv2986c (hupB gene). One noticeable feature regarding Mtb HupB, based on sequence alignment of HU orthologs from different bacteria, was that HupB(Mtb) bears at its C-terminal end, a highly basic extension and this prompted an examination of its role in Mtb HupB function. Methodology/Principal Findings: With this objective two clones of Mtb HupB were generated; one expressing full length HupB protein (HupB(Mtb)) and another which expresses only the N terminal region (first 95 amino acid) of hupB (HupB(MtbN)). Gel retardation assays revealed that HupBMtbN is almost like E. coli HU (heat stable nucleoid protein) in terms of its DNA binding, with a binding constant (K-d) for linear dsDNA greater than 1000 nM, a value comparable to that obtained for the HU alpha alpha and HU alpha beta forms. However CTR (C-terminal Region) of HupB(Mtb) imparts greater specificity in DNA binding. HupB(Mtb) protein binds more strongly to supercoiled plasmid DNA than to linear DNA, also this binding is very stable as it provides DNase I protection even up to 5 minutes. Similar results were obtained when the abilities of both proteins to mediate protection against DNA strand cleavage by hydroxyl radicals generated by the Fenton's reaction, were compared. It was also observed that both the proteins have DNA binding preference for A: T rich DNA which may occur at the regulatory regions of ORFs and the oriC region of Mtb. Conclusions/Significance: These data thus point that HupB(Mtb) may participate in chromosome organization in-vivo, it may also play a passive, possibly an architectural role.
Resumo:
Pulicat Lake sediments are often severely polluted with the toxic heavy metal mercury. Several mercury-resistant strains of Bacillus species were isolated from the sediments and all the isolates exhibited broad spectrum resistance (resistance to both organic and inorganic mercuric compounds). Plasmid curing assay showed that all the isolated Bacillus strains carry chromosomally borne mercury resistance. Polymerase chain reaction and southern hybridization analyses using merA and merB3 gene primers/probes showed that five of the isolated Bacillus strains carry sequences similar to known merA and merB3 genes. Results of multiple sequence alignment revealed 99% similarity with merA and merB3 of TnMERI1 (class II transposons). Other mercury resistant Bacillus species lacking homology to these genes were not able to volatilize mercuric chloride, indicating the presence of other modes of resistance to mercuric compounds.
Resumo:
The Basic Local Alignment Search Tool (BLAST) is one of the most widely used sequence alignment programs with which similarity searches, for both protein and nucleic acid sequences, can be performed against large databases at high speed. A large number of tools exist for processing BLAST output, but none of them provide three-dimensional structure visualization. This shortcoming has been addressed in the proposed tool BLAST Server for Structural Biologists (BSSB), which maps a BLAST output onto the three-dimensional structure of the subject protein. The three-dimensional structure of the subject protein is represented using a three-color coding scheme (identical: red; similar: yellow; and mismatch: white) based on the pairwise alignment obtained. Thus, the user will be able to visualize a possible three-dimensional structure for the query protein sequence. This information can be used to gain a deeper insight into the sequence-structure correlation. Furthermore, the additional structure-level information enables the user to make coherent and logical decisions regarding the type of input model structure or fragment that can be used for molecular replacement calculations. This tool is freely available to all users at http://bioserver1.physics.iisc.ernet.in/bssb/.
Resumo:
The three dimensional structure of a protein provides major insights into its function. Protein structure comparison has implications in functional and evolutionary studies. A structural alphabet (SA) is a library of local protein structure prototypes that can abstract every part of protein main chain conformation. Protein Blocks (PBS) is a widely used SA, composed of 16 prototypes, each representing a pentapeptide backbone conformation defined in terms of dihedral angles. Through this description, the 3D structural information can be translated into a 1D sequence of PBs. In a previous study, we have used this approach to compare protein structures encoded in terms of PBs. A classical sequence alignment procedure based on dynamic programming was used, with a dedicated PB Substitution Matrix (SM). PB-based pairwise structural alignment method gave an excellent performance, when compared to other established methods for mining. In this study, we have (i) refined the SMs and (ii) improved the Protein Block Alignment methodology (named as iPBA). The SM was normalized in regards to sequence and structural similarity. Alignment of protein structures often involves similar structural regions separated by dissimilar stretches. A dynamic programming algorithm that weighs these local similar stretches has been designed. Amino acid substitutions scores were also coupled linearly with the PB substitutions. iPBA improves (i) the mining efficiency rate by 6.8% and (ii) more than 82% of the alignments have a better quality. A higher efficiency in aligning multi-domain proteins could be also demonstrated. The quality of alignment is better than DALI and MUSTANG in 81.3% of the cases. Thus our study has resulted in an impressive improvement in the quality of protein structural alignment. (C) 2011 Elsevier Masson SAS. All rights reserved.
Resumo:
The role of FIC (Filamentation induced by cAMP)(2) domain containing proteins in the regulation of many vital pathways, mostly through the transfer of NMPs from NTPs to specific target proteins (NMPylation), in microorganisms, higher eukaryotes, and plants is emerging. The identity and function of FIC domain containing protein of the human pathogen, Mycobacterium tuberculosis, remains unknown. In this regard, M. tuberculosis fic gene (Mtfic) was cloned, overexpressed, and purified to homogeneity for its biochemical characterisation. It has the characteristic FIC motif, HPFREGNGRSTR (HPFxxGNGRxxR), spanning 144th to 155th residue. Neither the His-tagged nor the GST-tagged MtFic protein, overexpressed in Escherichia coil, nor expression of Mtfic in Mycobacterium smegmatis, yielded the protein in the soluble fraction. However, the maltose binding protein (MBP) tagged MtFic (MBP-MtFic) could be obtained partly in the soluble fraction. The cloned, overexpressed, and purified recombinant MBP-MtFic showed conversion of ATP, GTP, CTP, and UTP into AMP. GMP, CMP, and UMP, respectively. Sequence alignment with several FIC motif containing proteins, complemented with homology modeling on the FIC motif containing protein, VbhT of Bartonella schoenbuchensis as the template, showed conservation and interaction of residues constituting the FIC domain. Site-specific mutagenesis of the His144, or Glu148, or Asn150 of the FIC motif, or of Arg87 residue that constitutes the FIC domain, or complete deletion of the FIC motif, abolished the NTP to NMP conversion activity. The design of NMP formation assay using the recombinant, soluble MtFic would enable identification of its target substrate for NMPylation. (C) 2012 Elsevier Inc. All rights reserved.
Resumo:
Background: Interaction of non-structural protein 5A (NS5A) of Hepatitis C virus (HCV) with human kinases namely, casein kinase 1 alpha (ck1 alpha) and protein kinase R (PKR) have different functional implications such as regulation of viral replication and evasion of interferon induced immune response respectively. Understanding the structural and molecular basis of interactions of the viral protein with two different human kinases can be useful in developing strategies for treatment against HCV. Results: Serine 232 of NS5A is known to be phosphorylated by human ck1 alpha. A structural model of NS5A peptide containing phosphoacceptor residue Serine 232 bound to ck1 alpha has been generated using the known 3-D structures of kinase-peptide complexes. The substrate interacting residues in ck1 alpha has been identified from the model and these are found to be conserved well in the ck1 family. ck1 alpha - substrate peptide complex has also been used to understand the structural basis of association between ck1 alpha and its other viral stress induced substrate, tumour suppressor p53 transactivation domain which has a crystal structure available. Interaction of NS5A with another human kinase PKR is primarily genotype specific. NS5A from genotype 1b has been shown to interact and inhibit PKR whereas NS5A from genotype 2a/3a are unable to bind and inhibit PKR efficiently. This is one of the main reasons for the varied response to interferon therapy in HCV patients across different genotypes. Using PKR crystal structure, sequence alignment and evolutionary trace analysis some of the critical residues responsible for the interaction of NS5A 1b with PKR have been identified. Conclusions: The substrate interacting residues in ck1 alpha have been identified using the structural model of kinase substrate peptide. The PKR interacting NS5A 1b residues have also been predicted using PKR crystal structure, NS5A sequence analysis along with known experimental results. Functional significance and nature of interaction of interferon sensitivity determining region and variable region 3 of NS5A in different genotypes with PKR which was experimentally shown are also supported by the findings of evolutionary trace analysis. Designing inhibitors to prevent this interaction could enable the HCV genotype 1 infected patients respond well to interferon therapy.