958 resultados para Protein Structure, Multifractal Analysis, 6 Letter Model


Relevância:

100.00% 100.00%

Publicador:

Resumo:

The recombinant heat shock protein (18 kDa-hsp) from Mycobacterium leprae was studied as a T-epitope model for vaccine development. We present a structural analysis of the stability of recombinant 18 kDa-hsp during different processing steps. Circular dichroism and ELISA were used to monitor protein structure after thermal stress, lyophilization and chemical modification. We observed that the 18 kDa-hsp is extremely resistant to a wide range of temperatures (60% of activity is retained at 80ºC for 20 min). N-Acylation increased its ordered structure by 4% and decreased its ß-T1 structure by 2%. ELISA demonstrated that the native conformation of the 18 kDa-hsp was preserved after hydrophobic modification by acylation. The recombinant 18 kDa-hsp resists to a wide range of temperatures and chemical modifications without loss of its main characteristic, which is to be a source of T epitopes. This resistance is probably directly related to its lack of organization at the level of tertiary and secondary structures.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The vast majority of known proteins have not yet been experimentally characterized and little is known about their function. The design and implementation of computational tools can provide insight into the function of proteins based on their sequence, their structure, their evolutionary history and their association with other proteins. Knowledge of the three-dimensional (3D) structure of a protein can lead to a deep understanding of its mode of action and interaction, but currently the structures of <1% of sequences have been experimentally solved. For this reason, it became urgent to develop new methods that are able to computationally extract relevant information from protein sequence and structure. The starting point of my work has been the study of the properties of contacts between protein residues, since they constrain protein folding and characterize different protein structures. Prediction of residue contacts in proteins is an interesting problem whose solution may be useful in protein folding recognition and de novo design. The prediction of these contacts requires the study of the protein inter-residue distances related to the specific type of amino acid pair that are encoded in the so-called contact map. An interesting new way of analyzing those structures came out when network studies were introduced, with pivotal papers demonstrating that protein contact networks also exhibit small-world behavior. In order to highlight constraints for the prediction of protein contact maps and for applications in the field of protein structure prediction and/or reconstruction from experimentally determined contact maps, I studied to which extent the characteristic path length and clustering coefficient of the protein contacts network are values that reveal characteristic features of protein contact maps. Provided that residue contacts are known for a protein sequence, the major features of its 3D structure could be deduced by combining this knowledge with correctly predicted motifs of secondary structure. In the second part of my work I focused on a particular protein structural motif, the coiled-coil, known to mediate a variety of fundamental biological interactions. Coiled-coils are found in a variety of structural forms and in a wide range of proteins including, for example, small units such as leucine zippers that drive the dimerization of many transcription factors or more complex structures such as the family of viral proteins responsible for virus-host membrane fusion. The coiled-coil structural motif is estimated to account for 5-10% of the protein sequences in the various genomes. Given their biological importance, in my work I introduced a Hidden Markov Model (HMM) that exploits the evolutionary information derived from multiple sequence alignments, to predict coiled-coil regions and to discriminate coiled-coil sequences. The results indicate that the new HMM outperforms all the existing programs and can be adopted for the coiled-coil prediction and for large-scale genome annotation. Genome annotation is a key issue in modern computational biology, being the starting point towards the understanding of the complex processes involved in biological networks. The rapid growth in the number of protein sequences and structures available poses new fundamental problems that still deserve an interpretation. Nevertheless, these data are at the basis of the design of new strategies for tackling problems such as the prediction of protein structure and function. Experimental determination of the functions of all these proteins would be a hugely time-consuming and costly task and, in most instances, has not been carried out. As an example, currently, approximately only 20% of annotated proteins in the Homo sapiens genome have been experimentally characterized. A commonly adopted procedure for annotating protein sequences relies on the "inheritance through homology" based on the notion that similar sequences share similar functions and structures. This procedure consists in the assignment of sequences to a specific group of functionally related sequences which had been grouped through clustering techniques. The clustering procedure is based on suitable similarity rules, since predicting protein structure and function from sequence largely depends on the value of sequence identity. However, additional levels of complexity are due to multi-domain proteins, to proteins that share common domains but that do not necessarily share the same function, to the finding that different combinations of shared domains can lead to different biological roles. In the last part of this study I developed and validate a system that contributes to sequence annotation by taking advantage of a validated transfer through inheritance procedure of the molecular functions and of the structural templates. After a cross-genome comparison with the BLAST program, clusters were built on the basis of two stringent constraints on sequence identity and coverage of the alignment. The adopted measure explicity answers to the problem of multi-domain proteins annotation and allows a fine grain division of the whole set of proteomes used, that ensures cluster homogeneity in terms of sequence length. A high level of coverage of structure templates on the length of protein sequences within clusters ensures that multi-domain proteins when present can be templates for sequences of similar length. This annotation procedure includes the possibility of reliably transferring statistically validated functions and structures to sequences considering information available in the present data bases of molecular functions and structures.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Site-directed mutagenesis and combinatorial libraries are powerful tools for providing information about the relationship between protein sequence and structure. Here we report two extensions that expand the utility of combinatorial mutagenesis for the quantitative assessment of hypotheses about the determinants of protein structure. First, we show that resin-splitting technology, which allows the construction of arbitrarily complex libraries of degenerate oligonucleotides, can be used to construct more complex protein libraries for hypothesis testing than can be constructed from oligonucleotides limited to degenerate codons. Second, using eglin c as a model protein, we show that regression analysis of activity scores from library data can be used to assess the relative contributions to the specific activity of the amino acids that were varied in the library. The regression parameters derived from the analysis of a 455-member sample from a library wherein four solvent-exposed sites in an α-helix can contain any of nine different amino acids are highly correlated (P < 0.0001, R2 = 0.97) to the relative helix propensities for those amino acids, as estimated by a variety of biophysical and computational techniques.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Bacterial type III secretion systems deliver protein virulence factors to host cells. Here we characterize the interaction between HrpB2, a small protein secreted by the Xanthomonas citri subsp. citri type III secretion system, and the cytosolic domain of the inner membrane protein HrcU, a paralog of the flagellar protein FlhB. We show that a recombinant fragment corresponding to the C-terminal cytosolic domain of HrcU produced in E. coli suffers cleavage within a conserved Asn264-Pro265-Thr266-His267 (NPTH) sequence. A recombinant HrcU cytosolic domain with N264A, P265A, T266A mutations at the cleavage site (HrcU(AAAH)) was not cleaved and interacted with HrpB2. Furthermore, a polypeptide corresponding to the sequence following the NPTH cleavage site also interacted with HrpB2 indicating that the site for interaction is located after the NPTH site. Non-polar deletion mutants of the hrcU and hrpB2 genes resulted in a total loss of pathogenicity in susceptible citrus plants and disease symptoms could be recovered by expression of HrpB2 and HrcU from extrachromossomal plasmids. Complementation of the Delta hrcU mutant with HrcU(AAAH) produced canker lesions similar to those observed when complemented with wild-type HrcU. HrpB2 secretion however, was significantly reduced in the Delta hrcU mutant complemented with HrcU(AAAH), suggesting that an intact and cleavable NPTH site in HrcU is necessary for total functionally of T3SS in X. citri subsp. citri. Complementation of the Delta hrpB2 X. citri subsp. citri strain with a series of hrpB2 gene mutants revealed that the highly conserved HrpB2 C-terminus is essential for T3SS-dependent development of citrus canker symptoms in planta.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The transcription factors TFIIB, Brf1, and Brf2 share related N-terminal zinc ribbon and core domains. TFIIB bridges RNA polymerase II (Pol II) with the promoter-bound preinitiation complex, whereas Brf1 and Brf2 are involved, as part of activities also containing TBP and Bdp1 and referred to here as Brf1-TFIIIB and Brf2-TFIIIB, in the recruitment of Pol III. Brf1-TFIIIB recruits Pol III to type 1 and 2 promoters and Brf2-TFIIIB to type 3 promoters such as the human U6 promoter. Brf1 and Brf2 both have a C-terminal extension absent in TFIIB, but their C-terminal extensions are unrelated. In yeast Brf1, the C-terminal extension interacts with the TBP/TATA box complex and contributes to the recruitment of Bdp1. Here we have tested truncated Brf2, as well as Brf2/TFIIB chimeric proteins for U6 transcription and for assembly of U6 preinitiation complexes. Our results characterize functions of various human Brf2 domains and reveal that the C-terminal domain is required for efficient association of the protein with U6 promoter-bound TBP and SNAP(c), a type 3 promoter-specific transcription factor, and for efficient recruitment of Bdp1. This in turn suggests that the C-terminal extensions in Brf1 and Brf2 are crucial to specific recruitment of Pol III over Pol II.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

It has long been known that amino acids are the building blocks for proteins and govern their folding into specific three-dimensional structures. However, the details of this process are still unknown and represent one of the main problems in structural bioinformatics, which is a highly active research area with the focus on the prediction of three-dimensional structure and its relationship to protein function. The protein structure prediction procedure encompasses several different steps from searches and analyses of sequences and structures, through sequence alignment to the creation of the structural model. Careful evaluation and analysis ultimately results in a hypothetical structure, which can be used to study biological phenomena in, for example, research at the molecular level, biotechnology and especially in drug discovery and development. In this thesis, the structures of five proteins were modeled with templatebased methods, which use proteins with known structures (templates) to model related or structurally similar proteins. The resulting models were an important asset for the interpretation and explanation of biological phenomena, such as amino acids and interaction networks that are essential for the function and/or ligand specificity of the studied proteins. The five proteins represent different case studies with their own challenges like varying template availability, which resulted in a different structure prediction process. This thesis presents the techniques and considerations, which should be taken into account in the modeling procedure to overcome limitations and produce a hypothetical and reliable three-dimensional structure. As each project shows, the reliability is highly dependent on the extensive incorporation of experimental data or known literature and, although experimental verification of in silico results is always desirable to increase the reliability, the presented projects show that also the experimental studies can greatly benefit from structural models. With the help of in silico studies, the experiments can be targeted and precisely designed, thereby saving both money and time. As the programs used in structural bioinformatics are constantly improved and the range of templates increases through structural genomics efforts, the mutual benefits between in silico and experimental studies become even more prominent. Hence, reliable models for protein three-dimensional structures achieved through careful planning and thoughtful executions are, and will continue to be, valuable and indispensable sources for structural information to be combined with functional data.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

MOTIVATION: The accurate prediction of the quality of 3D models is a key component of successful protein tertiary structure prediction methods. Currently, clustering or consensus based Model Quality Assessment Programs (MQAPs) are the most accurate methods for predicting 3D model quality; however they are often CPU intensive as they carry out multiple structural alignments in order to compare numerous models. In this study, we describe ModFOLDclustQ - a novel MQAP that compares 3D models of proteins without the need for CPU intensive structural alignments by utilising the Q measure for model comparisons. The ModFOLDclustQ method is benchmarked against the top established methods in terms of both accuracy and speed. In addition, the ModFOLDclustQ scores are combined with those from our older ModFOLDclust method to form a new method, ModFOLDclust2, that aims to provide increased prediction accuracy with negligible computational overhead. RESULTS: The ModFOLDclustQ method is competitive with leading clustering based MQAPs for the prediction of global model quality, yet it is up to 150 times faster than the previous version of the ModFOLDclust method at comparing models of small proteins (<60 residues) and over 5 times faster at comparing models of large proteins (>800 residues). Furthermore, a significant improvement in accuracy can be gained over the previous clustering based MQAPs by combining the scores from ModFOLDclustQ and ModFOLDclust to form the new ModFOLDclust2 method, with little impact on the overall time taken for each prediction. AVAILABILITY: The ModFOLDclustQ and ModFOLDclust2 methods are available to download from: http://www.reading.ac.uk/bioinf/downloads/ CONTACT: l.j.mcguffin@reading.ac.uk.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Recent improvements of a hierarchical ab initio or de novo approach for predicting both α and β structures of proteins are described. The united-residue energy function used in this procedure includes multibody interactions from a cumulant expansion of the free energy of polypeptide chains, with their relative weights determined by Z-score optimization. The critical initial stage of the hierarchical procedure involves a search of conformational space by the conformational space annealing (CSA) method, followed by optimization of an all-atom model. The procedure was assessed in a recent blind test of protein structure prediction (CASP4). The resulting lowest-energy structures of the target proteins (ranging in size from 70 to 244 residues) agreed with the experimental structures in many respects. The entire experimental structure of a cyclic α-helical protein of 70 residues was predicted to within 4.3 Å α-carbon (Cα) rms deviation (rmsd) whereas, for other α-helical proteins, fragments of roughly 60 residues were predicted to within 6.0 Å Cα rmsd. Whereas β structures can now be predicted with the new procedure, the success rate for α/β- and β-proteins is lower than that for α-proteins at present. For the β portions of α/β structures, the Cα rmsd's are less than 6.0 Å for contiguous fragments of 30–40 residues; for one target, three fragments (of length 10, 23, and 28 residues, respectively) formed a compact part of the tertiary structure with a Cα rmsd less than 6.0 Å. Overall, these results constitute an important step toward the ab initio prediction of protein structure solely from the amino acid sequence.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

A simulation-based modelling approach is used to examine the effects of stratified seed dispersal (representing the distribution of the majority of dispersal around the maternal parent and also rare long-distance dispersal) on the genetic structure of maternally inherited genomes and the colonization rate of expanding plant populations. The model is parameterized to approximate postglacial oak colonization in the UK, but is relevant to plant populations that exhibit stratified seed dispersal. The modelling approach considers the colonization of individual plants over a large area (three 500 km x 10 km rolled transects are used to approximate a 500 km x 300 km area). Our approach shows how the interaction of plant population dynamics with stratified dispersal can result in a spatially patchy haplotype structure. We show that while both colonization speeds and the resulting genetic structure are influenced by the characteristics of the dispersal kernel, they are robust to changes in the periodicity of long-distance events, provided the average number of long-distance dispersal events remains constant. We also consider the effects of additional physical and environmental mechanisms on plant colonization. Results show significant changes in genetic structure when the initial colonization of different haplotypes is staggered over time and when a barrier to colonization is introduced. Environmental influences on survivorship and fecundity affect both the genetic structure and the speed of colonization. The importance of these mechanisms in relation to the postglacial spread and genetic structure of oak in the UK is discussed.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The role of receptor activity modifying protein 1 (RAMP1) in forming receptors with the calcitonin receptor-like receptor (CLR) and the calcitonin receptor (CTR) was examined by producing chimeras between RAMP1 and RAMP3. RAMPs have three extracellular helices. Exchange of helix 1 of the RAMPs or residues 62-69 in helix 2 greatly reduced CLR trafficking (a marker for CLR association). Modeling suggests that these exchanges alter the CLR recognition site on RAMP1, which is more exposed than on RAMP3. Exchange of residues 86-89 of RAMP1 had no effect on the trafficking of CLR but reduced the potency of human (h) alphaCGRP and adrenomedullin. However, these alterations to RAMP1 had no effect on the potency of hbetaCGRP. These residues of RAMP1 lie at the junction of helix 3 and its connecting loop with helix 2. Modeling suggests that the loop is more exposed in RAMP1 than RAMP3; it may play an important role in peptide binding, either directly or indirectly. Exchange of residues 90-94 of RAMP1 caused a modest reduction in CLR expression and a 15-fold decrease in CGRP potency. It is unlikely that the decrease in expression is enough to explain the reduction in potency, and so these may have dual roles in recognizing CLR and CGRP. For CTR, only 6 out of 26 chimeras covering the extracellular part of RAMP1 did not reduce agonist potency. Thus the association of CTR with RAMP1 seems more sensitive to changes in RAMP1 structure induced by the chimeras than is CLR.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Objective In this study, we have used a chemometrics-based method to correlate key liposomal adjuvant attributes with in-vivo immune responses based on multivariate analysis. Methods The liposomal adjuvant composed of the cationic lipid dimethyldioctadecylammonium bromide (DDA) and trehalose 6,6-dibehenate (TDB) was modified with 1,2-distearoyl-sn-glycero-3-phosphocholine at a range of mol% ratios, and the main liposomal characteristics (liposome size and zeta potential) was measured along with their immunological performance as an adjuvant for the novel, postexposure fusion tuberculosis vaccine, Ag85B-ESAT-6-Rv2660c (H56 vaccine). Partial least square regression analysis was applied to correlate and cluster liposomal adjuvants particle characteristics with in-vivo derived immunological performances (IgG, IgG1, IgG2b, spleen proliferation, IL-2, IL-5, IL-6, IL-10, IFN-γ). Key findings While a range of factors varied in the formulations, decreasing the 1,2-distearoyl-sn-glycero-3-phosphocholine content (and subsequent zeta potential) together built the strongest variables in the model. Enhanced DDA and TDB content (and subsequent zeta potential) stimulated a response skewed towards a cell mediated immunity, with the model identifying correlations with IFN-γ, IL-2 and IL-6. Conclusion This study demonstrates the application of chemometrics-based correlations and clustering, which can inform liposomal adjuvant design.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Self-incompatibility RNases (S-RNases) are an allelic series of style glycoproteins associated with rejection of self-pollen in solanaceous plants. The nucleotide sequences of S-RNase alleles from several genera have been determined, but the structure of the gene products has only been described for those from Nicotiana alata. We report on the N-glycan structures and the disulfide bonding of the S-3-RNase from wild tomato (Lycopersicon peruvianum) and use this and other information to construct a model of this molecule. The S-3-RNase has a single N-glycosylation site (Asn-28) to which one of three N-glycans is attached. S-3-RNase has seven Cys residues; six are involved in disulfide linkages (Cys-16-Cys-21, Cys-46-Cys-91, and Cys-166-Cys-177), and one has a free thiol group (Cys-150). The disulfide-bonding pattern is consistent with that observed in RNase Rh, a related RNase for which radiographic-crystallographic information is available. A molecular model of the S-3-RNase shows that four of the most variable regions of the S-RNases are clustered on one surface of the molecule. This is discussed in the context of recent experiments that set out to determine the regions of the S-RNase important for recognition during the self-incompatibility response.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The principal topic of this work is the application of data mining techniques, in particular of machine learning, to the discovery of knowledge in a protein database. In the first chapter a general background is presented. Namely, in section 1.1 we overview the methodology of a Data Mining project and its main algorithms. In section 1.2 an introduction to the proteins and its supporting file formats is outlined. This chapter is concluded with section 1.3 which defines that main problem we pretend to address with this work: determine if an amino acid is exposed or buried in a protein, in a discrete way (i.e.: not continuous), for five exposition levels: 2%, 10%, 20%, 25% and 30%. In the second chapter, following closely the CRISP-DM methodology, whole the process of construction the database that supported this work is presented. Namely, it is described the process of loading data from the Protein Data Bank, DSSP and SCOP. Then an initial data exploration is performed and a simple prediction model (baseline) of the relative solvent accessibility of an amino acid is introduced. It is also introduced the Data Mining Table Creator, a program developed to produce the data mining tables required for this problem. In the third chapter the results obtained are analyzed with statistical significance tests. Initially the several used classifiers (Neural Networks, C5.0, CART and Chaid) are compared and it is concluded that C5.0 is the most suitable for the problem at stake. It is also compared the influence of parameters like the amino acid information level, the amino acid window size and the SCOP class type in the accuracy of the predictive models. The fourth chapter starts with a brief revision of the literature about amino acid relative solvent accessibility. Then, we overview the main results achieved and finally discuss about possible future work. The fifth and last chapter consists of appendices. Appendix A has the schema of the database that supported this thesis. Appendix B has a set of tables with additional information. Appendix C describes the software provided in the DVD accompanying this thesis that allows the reconstruction of the present work.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

All life forms need to monitor carbon and energy availability to survive and this is especially true for plants which must integrate unavoidable environmental conditions with metabolism for cellular homeostasis maintenance. Sugars, in the heart of metabolism, are now recognized as crucial signaling molecules that translate those conditions. One such signal is trehalose 6- phosphate (T6P), a phosphorylated dimer of glucose molecules which levels correlate well with those of sucrose (Suc). Central integrators of stress and energy regulation include the conserved plant Snf1-related kinase1 (SnRK1) which respond to low cellular energy levels by up-regulating energy conserving and catabolic metabolism and down-regulating energy consuming processes. In 2009 T6P was shown to inhibit SnRK1. The in vitro inhibition of SnRK1 by T6P was confirmed in vivo through the observation that genes normally induced by SnRK1 were repressed by T6P and vice-versa, promoting growth processes. These observations provided a model for the regulation of growth by sugar.(...)