43 resultados para secondary structure analysis
Resumo:
Insoluble expression of heterologous proteins in Escherichia coli is a major bottleneck of many structural genomics and high-throughput protein biochemistry projects. Many of these proteins may be amenable to refolding, but their identification is hampered by a lack of high-throughput methods. We have developed a matrix-assisted refolding approach in which correctly folded proteins are distinguished from misfolded proteins by their elution from affinity resin under nondenaturing conditions. Misfolded proteins remain adhered to the resin, presumably via hydrophobic interactions. The assay can be applied to insoluble proteins on an individual basis but is particularly well suited for high-throughput applications because it is rapid, automatable and has no rigorous sample preparation requirements. The efficacy of the screen is demonstrated on small-scale expression samples for 15 proteins. Refolding is then validated by large-scale expressions using SEC and circular dichroism.
Resumo:
Alfuy virus (ALFV) is classified as a subtype of the flavivirus Murray Valley encephalitis virus (MVEV); however, despite preliminary reports of antigenic and ecological similarities with MVEV, ALFV has not been associated with human disease. Here, it was shown that ALFV is at least 10(4)-fold less neuroinvasive than MVEV after peripheral inoculation of 3-week-old Swiss outbred mice, but ALFV demonstrates similar neurovirulence. In addition, it was shown that ALFV is partially attenuated in mice that are deficient in alpha/beta interferon responses, in contrast to MVEV which is uniformly lethal in these mice. To assess the antigenic relationship between these viruses, a panel of monoclonal antibodies was tested for the ability to bind to ALFV and MVEV in ELISA. Although the majority of monoclonal antibodies recognized both viruses, confirming their antigenic similarity, several discriminating antibodies were identified. Finally, the entire genome of the prototype strain of ALFV (MRM3929) was sequenced and phylogenetically analysed. Nucleotide (73%) and amino acid sequence (83 %) identity between ALFV and IMVEV confirmed previous reports of their close relationship. Several nucleotide and amino acid deletions and/or substitutions with putative functional significance were identified in ALFV, including the abolition of a conserved glycosylation site in the envelope protein and the deletion of the terminal dinucleotide 5'-CUOH-3' found in all other members of the genus. These findings confirm previous reports that ALFV is closely related to IMVEV, but also highlights significant antigenic, genetic and phenotypic divergence from MVEV. Accordingly, the data suggest that ALFV is a distinct species within the serogroup Japanese encephalitis virus.
Resumo:
High-quality data about protein structures and their gene sequences are essential to the understanding of the relationship between protein folding and protein coding sequences. Firstly we constructed the EcoPDB database, which is a high-quality database of Escherichia coli genes and their corresponding PDB structures. Based on EcoPDB, we presented a novel approach based on information theory to investigate the correlation between cysteine synonymous codon usages and local amino acids flanking cysteines, the correlation between cysteine synonymous codon usages and synonymous codon usages of local amino acids flanking cysteines, as well as the correlation between cysteine synonymous codon usages and the disulfide bonding states of cysteines in the E. coli genome. The results indicate that the nearest neighboring residues and their synonymous codons of the C-terminus have the greatest influence on the usages of the synonymous codons of cysteines and the usage of the synonymous codons has a specific correlation with the disulfide bond formation of cysteines in proteins. The correlations may result from the regulation mechanism of protein structures at gene sequence level and reflect the biological function restriction that cysteines pair to form disulfide bonds. The results may also be helpful in identifying residues that are important for synonymous codon selection of cysteines to introduce disulfide bridges in protein engineering and molecular biology. The approach presented in this paper can also be utilized as a complementary computational method and be applicable to analyse the synonymous codon usages in other model organisms. (c) 2005 Elsevier Ltd. All rights reserved.
Resumo:
A 13-residue peptide sequence from a respiratory syncitial virus fusion protein was constrained in an alpha-helical conformation by fusing two back-to-back cyclic alpha-turn mimetics. The resulting peptide, Ac-(3 -> 7; 8 -> 12)-bicyclo-FP[KDEFD][KSIRD]V-NH2, was highly alpha-helical in water by CD and NMR spectroscopy, correctly positioning crucial binding residues (F488, I491, V493) on one face of the helix and side chain-side chain linkers on a noninteracting face of the helix. This compound displayed potent activity in both a recombinant fusion assay and an RSV antiviral assay (IC50 = 36 nM) and demonstrates for the first time that back-to-back modular alpha-helix mimetics can produce functional antagonists of important protein-protein interactions.
Resumo:
Purple acid phosphatases are a family of binuclear metallohydrolases that have been identified in plants, animals and fungi. Only one isoform of similar to 35 kDa has been isolated from animals, where it is associated with bone resorption and microbial killing through its phosphatase activity, and hydroxyl radical production, respectively. Using the sensitive PSI-BLAST search method, sequences representing new purple acid phosphatase-like proteins have been identified in mammals, insects and nematodes. These new putative isoforms are closely related to the similar to 55 kDa purple acid phosphatase characterized from plants. Secondary structure prediction of the new human isoform further confirms its similarity to a purple acid phosphatase from the red kidney bean. A structural model for the human enzyme was constructed based on the red kidney bean purple acid phosphatase structure. This model shows that the catalytic centre observed in other purple acid phosphatases is also present in this new isoform. These observations suggest that the sequences identified in this study represent a novel subfamily of plant-like purple acid phosphatases in animals and humans. (c) 2006 Elsevier B.V. All rights reserved.
Resumo:
Hydrophobins are small (similar to 100 aa) proteins that have an important role in the growth and development of mycelial fungi. They are surface active and, after secretion by the fungi, self-assemble into amphipathic membranes at hydrophobic/hydrophilic interfaces, reversing the hydrophobicity of the surface. In this study, molecular dynamics simulation techniques have been used to model the process by which a specific class I hydrophobin, SC3, binds to a range of hydrophobic/ hydrophilic interfaces. The structure of SC3 used in this investigation was modeled based on the crystal structure of the class II hydrophobin HFBII using the assumption that the disulfide pairings of the eight conserved cysteine residues are maintained. The proposed model for SC3 in aqueous solution is compact and globular containing primarily P-strand and coil structures. The behavior of this model of SC3 was investigated at an air/water, an oil/water, and a hydrophobic solid/water interface. It was found that SC3 preferentially binds to the interfaces via the loop region between the third and fourth cysteine residues and that binding is associated with an increase in a-helix formation in qualitative agreement with experiment. Based on a combination of the available experiment data and the current simulation studies, we propose a possible model for SC3 self-assembly on a hydrophobic solid/water interface.
Resumo:
Heterogeneous nuclear ribonucleoprotein (hnRNP) A2 is a multitasking protein involved in RNA packaging, alternative splicing of pre-mRNA. telomere maintenance, cytoplasmic RNA trafficking, and translation. It binds short segments of single-stranded nucleic acids, including the A2RE11 RNA element that is necessary and sufficient for cytoplasmic transport of a subset of rnRNAs in oligodendrocytes and neurons. We have explored the structures of hnRNP A2, its RNA recognition motifs (RRMs) and Gly-rich module, and the RRM complexes with A2RE11. Circular dichroism spectroscopy showed that the secondary structure of the first 189 residues of hnRNP A2 parallels that of the tandem beta alpha beta beta alpha beta RRMs of its paralogue, hnRNP A1, previously deduced from X-ray diffraction studies. The unusual GRD was shown to have substantial beta-sheet and beta-turn structure. Sedimentation equilibrium and circular dichroism results were consistent with the tandem RRM region being monomeric and supported earlier evidence for the binding of two A2RE11 oligoribonucleotides to this domain, in contrast to the protein dimer formed by the complex of hnRNP A1 with the telomeric ssDNA repeat. A three-dimensional structure for the N-terminal, two-RRM-containing segment of hnRNP A2 was derived by homology modeling. This structure was used to derive a model for the complex with A2RE11 using the previously described interaction of pairs of stacked nucleotides with aromatic residues on the RRM beta-sheet platforms, conserved in other RRM-RNA complexes, together with biochemical data and molecular dynamics-based observations of inter-RRM mobility.
Resumo:
Kunjin virus is a member of the Flavivirus genus and is an Australian variant of West Nile virus. The C-terminal domain of the Kunjin virus NS3 protein displays helicase activity. The protein is thought to separate daughter and template RNA strands, assisting the initiation of replication by unwinding RNA secondary structure in the 3' nontranslated region. Expression, purification and preliminary crystallographic characterization of the NS3 helicase domain are reported. It is shown that Kunjin virus helicase may adopt a dimeric assembly in absence of nucleic acids, oligomerization being a means to provide the helicases with multiple nucleic acid-binding capability, facilitating translocation along the RNA strands. Kunjin virus NS3 helicase domain is an attractive model for studying the molecular mechanisms of flavivirus replication, while simultaneously providing a new basis for the rational development of anti-flaviviral compounds.
Resumo:
Background: The residue-wise contact order (RWCO) describes the sequence separations between the residues of interest and its contacting residues in a protein sequence. It is a new kind of one-dimensional protein structure that represents the extent of long-range contacts and is considered as a generalization of contact order. Together with secondary structure, accessible surface area, the B factor, and contact number, RWCO provides comprehensive and indispensable important information to reconstructing the protein three-dimensional structure from a set of one-dimensional structural properties. Accurately predicting RWCO values could have many important applications in protein three-dimensional structure prediction and protein folding rate prediction, and give deep insights into protein sequence-structure relationships. Results: We developed a novel approach to predict residue-wise contact order values in proteins based on support vector regression (SVR), starting from primary amino acid sequences. We explored seven different sequence encoding schemes to examine their effects on the prediction performance, including local sequence in the form of PSI-BLAST profiles, local sequence plus amino acid composition, local sequence plus molecular weight, local sequence plus secondary structure predicted by PSIPRED, local sequence plus molecular weight and amino acid composition, local sequence plus molecular weight and predicted secondary structure, and local sequence plus molecular weight, amino acid composition and predicted secondary structure. When using local sequences with multiple sequence alignments in the form of PSI-BLAST profiles, we could predict the RWCO distribution with a Pearson correlation coefficient (CC) between the predicted and observed RWCO values of 0.55, and root mean square error (RMSE) of 0.82, based on a well-defined dataset with 680 protein sequences. Moreover, by incorporating global features such as molecular weight and amino acid composition we could further improve the prediction performance with the CC to 0.57 and an RMSE of 0.79. In addition, combining the predicted secondary structure by PSIPRED was found to significantly improve the prediction performance and could yield the best prediction accuracy with a CC of 0.60 and RMSE of 0.78, which provided at least comparable performance compared with the other existing methods. Conclusion: The SVR method shows a prediction performance competitive with or at least comparable to the previously developed linear regression-based methods for predicting RWCO values. In contrast to support vector classification (SVC), SVR is very good at estimating the raw value profiles of the samples. The successful application of the SVR approach in this study reinforces the fact that support vector regression is a powerful tool in extracting the protein sequence-structure relationship and in estimating the protein structural profiles from amino acid sequences.
Resumo:
Aims: To estimate (i) Australian government taxation revenue collected from the consumption of alcohol by adolescents and (ii) the amount spent by the government on interventions aimed at educating adolescents about the potential dangers of alcohol use. Design: Secondary data analysis. Setting: Australia. Findings: Australian adolescents (aged between 12 and 17 years, inclusive) spent approximately $217 million on alcoholic beverages in 2002, netting the Australian government approximately $112 million in tax revenue. This resulted in an average of $195 earned in tax per adolescent drinker. It is estimated that the Government spent approximately $17 million on adolescent drinking interventions in 2002, equating to an expenditure of about $10.51 per adolescent on the delivery of alcohol interventions. For every dollar spent on alcohol interventions aimed at adolescents, it is estimated that the government receives around $7 in alcohol tax revenue. Conclusions: A substantial disparity exists between the amount of tax revenue received by the Australian Government from adolescent drinkers and the overall amount spent in attempting to prevent and relieve some of the problems associated with adolescent problem drinking. (c) 2006 Elsevier Ltd. All rights reserved.
Resumo:
Background: Designing novel proteins with site-directed recombination has enormous prospects. By locating effective recombination sites for swapping sequence parts, the probability that hybrid sequences have the desired properties is increased dramatically. The prohibitive requirements for applying current tools led us to investigate machine learning to assist in finding useful recombination sites from amino acid sequence alone. Results: We present STAR, Site Targeted Amino acid Recombination predictor, which produces a score indicating the structural disruption caused by recombination, for each position in an amino acid sequence. Example predictions contrasted with those of alternative tools, illustrate STAR'S utility to assist in determining useful recombination sites. Overall, the correlation coefficient between the output of the experimentally validated protein design algorithm SCHEMA and the prediction of STAR is very high (0.89). Conclusion: STAR allows the user to explore useful recombination sites in amino acid sequences with unknown structure and unknown evolutionary origin. The predictor service is available from http://pprowler.itee.uq.edu.au/star.
Resumo:
We present a machine learning model that predicts a structural disruption score from a protein’s primary structure. SCHEMA was introduced by Frances Arnold and colleagues as a method for determining putative recombination sites of a protein on the basis of the full (PDB) description of its structure. The present method provides an alternative to SCHEMA that is able to determine the same score from sequence data only. Circumventing the need for resolving the full structure enables the exploration of yet unresolved and even hypothetical sequences for protein design efforts. Deriving the SCHEMA score from a primary structure is achieved using a two step approach: first predicting a secondary structure from the sequence and then predicting the SCHEMA score from the predicted secondary structure. The correlation coefficient for the prediction is 0.88 and indicates the feasibility of replacing SCHEMA with little loss of precision. ©2005 IEEE
Resumo:
DNA microarray is a powerful tool to measure the level of a mixed population of nucleic acids at one time, which has great impact in many aspects of life sciences research. In order to distinguish nucleic acids with very similar composition by hybridization, it is necessary to design probes with high specificities, i.e. uniqueness, and also sensitivities, i.e., suitable melting temperature and no secondary structure. We make use of available biology tools to gain necessary sequence information of human chromosome 12, and combined with evolutionary strategy (ES) to find unique subsequences representing all predicted exons. The results are presented and discussed.