39 resultados para secondary structure detection
em University of Queensland eSpace - Australia
Resumo:
Motivation: Conformational flexibility is essential to the function of many proteins, e.g. catalytic activity. To assist efforts in determining and exploring the functional properties of a protein, it is desirable to automatically identify regions that are prone to undergo conformational changes. It was recently shown that a probabilistic predictor of continuum secondary structure is more accurate than categorical predictors for structurally ambivalent sequence regions, suggesting that such models are suited to characterize protein flexibility. Results: We develop a computational method for identifying regions that are prone to conformational change directly from the amino acid sequence. The method uses the entropy of the probabilistic output of an 8-class continuum secondary structure predictor. Results for 171 unique amino acid sequences with well-characterized variable structure (identified in the 'Macromolecular movements database') indicate that the method is highly sensitive at identifying flexible protein regions, but false positives remain a problem. The method can be used to explore conformational flexibility of proteins (including hypothetical or synthetic ones) whose structure is yet to be determined experimentally.
Resumo:
Background: The structure of proteins may change as a result of the inherent flexibility of some protein regions. We develop and explore probabilistic machine learning methods for predicting a continuum secondary structure, i.e. assigning probabilities to the conformational states of a residue. We train our methods using data derived from high-quality NMR models. Results: Several probabilistic models not only successfully estimate the continuum secondary structure, but also provide a categorical output on par with models directly trained on categorical data. Importantly, models trained on the continuum secondary structure are also better than their categorical counterparts at identifying the conformational state for structurally ambivalent residues. Conclusion: Cascaded probabilistic neural networks trained on the continuum secondary structure exhibit better accuracy in structurally ambivalent regions of proteins, while sustaining an overall classification accuracy on par with standard, categorical prediction methods.
Resumo:
Our previous studies using trans-complementation analysis of Kunjin virus (KUN) full-length cDNA clones harboring in-frame deletions in the NS3 gene demonstrated the inability of these defective complemented RNAs to be packaged into virus particles (W. J. Liu, P. L. Sedlak, N. Kondratieva, and A. A. Khromykh, J. Virol. 76:10766-10775). In this study we aimed to establish whether this requirement for NS3 in RNA packaging is determined by the secondary RNA structure of the NS3 gene or by the essential role of the translated NS3 gene product. Multiple silent mutations of three computer-predicted stable RNA structures in the NS3 coding region of KUN replicon RNA aimed at disrupting RNA secondary structure without affecting amino acid sequence did not affect RNA replication and packaging into virus-like particles in the packaging cell line, thus demonstrating that the predicted conserved RNA structures in the NS3 gene do not play a role in RNA replication and/or packaging. In contrast, double frameshift mutations in the NS3 coding region of full-length KUN RNA, producing scrambled NS3 protein but retaining secondary RNA structure, resulted in the loss of ability of these defective RNAs to be packaged into virus particles in complementation experiments in KUN replicon-expressing cells. Furthermore, the more robust complementation-packaging system based on established stable cell lines producing large amounts of complemented replicating NS3-deficient replicon RNAs and infection with KUN virus to provide structural proteins also failed to detect any secreted virus-like particles containing packaged NS3-deficient replicon RNAs. These results have now firmly established the requirement of KUN NS3 protein translated in cis for genome packaging into virus particles.
Resumo:
The structure of a novel plant defensin isolated from the flowers of Petunia hybrida has been determined by H-1 NMR spectroscopy. P. hybrida defensin 1 (PhD1) is a basic, cysteine-rich, antifungal protein of 47 residues and is the first example of a new subclass of plant defensins with five disulfide bonds whose structure has been determined. PhD1 has the fold of the cysteine-stabilized alphabeta motif, consisting of an alpha-helix and a triple-stranded antiparallel beta-sheet, except that it contains a fifth disulfide bond from the first loop to the alpha-helix. The additional disulfide bond is accommodated in PhD1 without any alteration of its tertiary structure with respect to other plant defensins. Comparison of its structure with those of classic, four-disulfide defensins has allowed us to identify a previously unrecognized hydrogen bond network that is integral to structure stabilization in the family.
Resumo:
We discuss recent progress towards the establishment of important structure-property-function relationships in eumelanins-key functional bio-macromolecular systems responsible for photoprotection and immune response in humans, and implicated in the development of melanoma skin cancer. We focus on the link between eumelanin's secondary structure and optical properties such as broad band UV-visible absorption and strong non-radiative relaxation; both key features of the photo-protective function. We emphasise the insights gained through a holistic approach combining optical spectroscopy with first principles quantum chemical calculations, and advance the hypothesis that the robust functionality characteristic of eumelanin is related to extreme chemical and structural disorder at the secondary level. This inherent disorder is a low cost natural resource, and it is interesting to speculate as to whether it may play a role in other functional bio-macromolecular systems.
Resumo:
The caseins (alpha(s1), alpha(s2), beta, and kappa) are phosphoproteins present in bovine milk that have been studied for over a century and whose structures remain obscure. Here we describe the chemical synthesis and structure elucidation of the N-terminal segment (1-44) of bovine K-casein, the protein which maintains the micellar structure of the caseins. K-Casein (1-44) was synthesised by highly optimised Boc solid-phase peptide chemistry and characterised by mass spectrometry. Structure elucidation was carried out by circular dichroism and nuclear magnetic resonance spectroscopy. CD analysis demonstrated that the segment was ill defined in aqueous medium but in 30% trifluoroethanol it exhibited considerable helical structure. Further, NMR analysis showed the presence of a helical segment containing 26 residues which extends from Pro(8) to Arg(34). This is the first report which demonstrates extensive secondary structure within the casein class of proteins. (c) 2006 Elsevier Inc. All rights reserved.
Resumo:
Plant resistance proteins (R proteins) recognize corresponding pathogen avirulence (Avr) proteins either indirectly through detection of changes in their host protein targets or through direct R-Avr protein interaction. Although indirect recognition imposes selection against Avr effector function, pathogen effector molecules recognized through direct interaction may overcome resistance through sequence diversification rather than loss of function. Here we show that the flax rust fungus AvrLS67 genes, whose products are recognized by the L5, L6, and L7 R proteins of flax, are highly diverse, with 12 sequence variants identified from six rust strains. Seven AvrL567 variants derived from Avr alleles induce necrotic responses when expressed in flax plants containing corresponding resistance genes (R genes), whereas five variants from avr alleles do not. Differences in recognition specificity between AvA567 variants and evidence for diversifying selection acting on these genes suggest they have been involved in a gene-specific arms race with the corresponding flax R genes. Yeast two-hybrid assays indicate that recognition is based on direct R-Avr protein interaction and recapitulate the interaction specificity observed in planta. Biochemical analysis of Escherichia coli-produced AvrL567 proteins shows that variants that escape recognition nevertheless maintain a conserved structure and stability, suggesting that the amino acid sequence differences directly affect the R-Avr protein interaction. We suggest that direct recognition associated with high genetic diversity at corresponding R and Avr gene loci represents an alternative outcome of plant-pathogen coevolution to indirect recognition associated with simple balanced polymorphisms for functional and nonfunctional R and Avr genes.
Resumo:
The albA gene from Klebsiella oxytoca encodes a protein that binds albicidin phytotoxins and antibiotics with high affinity. Previously, it has been shown that shifting pH from 6 to 4 reduces binding activity of AlbA by about 30%, indicating that histidine residues might be involved in substrate binding. In this study, molecular analysis of the albA coding region revealed sequence discrepancies with the albA sequence reported previously, which were probably due to sequencing errors. The albA gene was subsequently cloned from K oxytoca ATCC 13182(T) to establish the revised sequence. Biochemical and molecular approaches were used to determine the functional role of four histidine residues (His(78), HiS(125), HiS(141) and His(189)) in the corrected sequence for AlbA. Treatment of AlbA with diethyl pyrocarbonate (DEPC), a histidine-specific alkylating reagent, reduced binding activity by about 95%. DEPC treatment increased absorbance at 240-244 nm by an amount indicating conversion to N-carbethoxyhistidine of a single histidine residue per AlbA molecule. Pretreatment with albicidin protected AlbA against modification by DEPC, with a 1 : 1 molar ratio of albicidin to the protected histidine residues. Based on protein secondary structure and amino acid surface probability indices, it is predicted that HiS125 might be the residue required for albicidin binding. Mutation of HiS125 to either alanine or leucine resulted in about 32% loss of binding activity, and deletion of HiS125 totally abolished binding activity. Mutation of HiS125 to arginine and tyrosine had no effect. These results indicate that HiS125 plays a key role either in an electrostatic interaction between AlbA and albicidin or in the conformational dynamics of the albicidin-binding site.
Resumo:
Limited but significant sequence similarity has been observed between an uncharacterized human protein, SIN1, and the S. pombe SIN1, Dictyostelium RIP3 and S. cerevisiae AVO1 proteins. The human Sin1 gene has been automatically predicted (MAPKAP1; GenBank accession number NM_024117); however, this sequence appears to be incomplete. In this study, we have cloned and characterized the full-length human Sin1 mRNA and identified a highly conserved domain that defines the family of SIN1 orthologues, members of which are widely distributed in the fungal and metazoan kingdoms. We demonstrate that Sin1 transcripts can use alternative polyadenylation signals and describe a number of Sin1 splice variants that potentially encode functionally different isoforms. (C) 2004 Elsevier B.V. All rights reserved.
Resumo:
DNA Microarray is a powerful tool to measure the level of a mixed population of nucleic acids at one time, which has great impact in many aspects of life sciences research. In order to distinguish nucleic acids with very similar composition by hybridization, it is necessary to design microarray probes with high specificities and sensitivities. Highly specific probes correspond to probes having unique DNA sequences; whereas highly sensitive probes correspond to those with melting temperature within a desired range and having no secondary structure. The selection of these probes from a set of functional DNA sequences (exons) constitutes a computationally expensive discrete non-linear search problem. We delegate the search task to a simple yet effective Evolution Strategy algorithm. The computational efficiency is also greatly improved by making use of an available bioinformatics tool.
Resumo:
Short peptides corresponding to two to four a-helical turns of proteins are not thermodynamically stable helices in water. Unstructured octapeptide Ac-His1*-Ala2-Ala3-His4*-His5*-Glu6-Leu7-His8*-NH2 (1) reacts with two [Pd ((NH2)-N-15(CH2)(2) (NH2)-N-15)(NO3)(2)] in water to form a kinetically stable intermediate, [{Pden}(2)-{(1,4)(5,8)-peptide}](2), in which two 19-membered metallocyclic rings stabilize two peptide turns. Slow subsequent folding to a thermodynamically more stable two-turn a-helix drives the equilibrium to [{Pden}(2)-{(1,5)(4,8)-peptide}] (3), featuring two 22-membered rings. This transformation from unstructured peptide via turns to an a-helix suggests that metal clips might be useful probes for investigating peptide folding.
Resumo:
We describe a new method for using neural networks to predict residue contact pairs in a protein. The main inputs to the neural network are a set of 25 measures of correlated mutation between all pairs of residues in two windows of size 5 centered on the residues of interest. While the individual pair-wise correlations are a relatively weak predictor of contact, by training the network on windows of correlation the accuracy of prediction is significantly improved. The neural network is trained on a set of 100 proteins and then tested on a disjoint set of 1033 proteins of known structure. An average predictive accuracy of 21.7% is obtained taking the best L/2 predictions for each protein, where L is the sequence length. Taking the best L/10 predictions gives an average accuracy of 30.7%. The predictor is also tested on a set of 59 proteins from the CASP5 experiment. The accuracy is found to be relatively consistent across different sequence lengths, but to vary widely according to the secondary structure. Predictive accuracy is also found to improve by using multiple sequence alignments containing many sequences to calculate the correlations. (C) 2004 Wiley-Liss, Inc.
Resumo:
Cystic fibrosis is caused by mutations in the cystic fibrosis transmembrane conductance regulator (CFTR) gene, which encodes a chloride channel present in many cells. In cardiomyocytes, we report that multiple exon 1 usage and alternative splicing produces four CFTR transcripts, with different 5'-untranslated regions, CFTRTRAD-139, CFTR-1C/-1A, CFTR-1C, and CFTR-1B. CFTR transcripts containing the novel upstream exons (exons -1C, -1B, and -1A) represent more than 90% of cardiac expressed CFTR mRNA. Regulation of cardiac CFTR expression, in response to developmental and pathological stimuli, is exclusively due to the modulation of CFTR-1C and CFTR-1C/-1A expression. Upstream open reading frames have been identified in the 5'-untranslated regions of all CFTR transcripts that, in conjunction with adjacent stem-loop structures, modulate the efficiency of translation initiation at the AUG codon of the main CFTR coding region in CFTRTRAD-139 and CFTR-1C/-1A transcripts. Exon(-1A), only present in CFTR-1C/-1A transcripts, encodes an AUG codon that is in-frame with the main CFTR open reading frame, the efficient translation of which produces a novel CFTR protein isoform with a curtailed amino terminus. As the expression of this CFTR transcript parallels the spatial and temporal distribution of the cAMP-activated whole-cell current density in normal and diseased hearts, we suggest that CFTR-1C/-1A provides the molecular basis for the cardiac cAMP-activated chloride channel. Our findings provide further insight into the complex nature of in vivo CFTR expression, to which multiple mRNA transcripts, protein isoforms, and post-transcriptional regulatory mechanisms are now added.
Resumo:
Eukaryotic gene expression, reflected in the amount of steady-state mRNA, is regulated at the post-transcriptional level. The 5'-untranslated regions (5'-UTRs) of some transcripts contain cis-acting elements, including upstream open reading frames (uORFs), that have been identified as being fundamental in modulating translation efficiency and mRNA stability. Previously, we demonstrated that uORFs present in the 5'-UTR of cystic fibrosis transmembrane conductance regular (CFTR) transcripts expressed in the heart were able to modulate translation efficiency of the main CFTR ORF. Here, we show that the same 5'-UTR elements are associated with the differential stability of the 5'-UTR compared to the main coding region of CFTR transcripts. Furthermore, these post-transcriptional mechanisms are important factors governing regulated CFTR expression in the heart, in response to developmental and pathophysiological stimuli. (C) 2004 Elsevier Inc. All rights reserved.