11 resultados para hybrid prediction method
em National Center for Biotechnology Information - NCBI
Resumo:
GeneSplicer is a new, flexible system for detecting splice sites in the genomic DNA of various eukaryotes. The system has been tested successfully using DNA from two reference organisms: the model plant Arabidopsis thaliana and human. It was compared to six programs representing the leading splice site detectors for each of these species: NetPlantGene, NetGene2, HSPL, NNSplice, GENIO and SpliceView. In each case GeneSplicer performed comparably to the best alternative, in terms of both accuracy and computational efficiency.
Resumo:
A method is presented for computing the average solution of problems that are too complicated for adequate resolution, but where information about the statistics of the solution is available. The method involves computing average derivatives by interpolation based on linear regression, and an updating of a measure constrained by the available crude information. Examples are given.
Resumo:
Interaction between a peptide hormone and extracellular domains of its receptor is a crucial step for initiation of hormone action. We have developed a modification of the yeast two-hybrid system to study this interaction and have used it to characterize the interaction of insulin-like growth factor 1 (IGF-1) with its receptor by using GAL4 transcriptional regulation with a β-galactosidase assay as readout. In this system, IGF-1 and proIGF-1 bound to the cysteine-rich domain, extracellular domain, or entire IGF-1 proreceptor. This interaction was specific. Thus, proinsulin showed no significant interaction with the IGF-1 receptor, while a chimeric proinsulin containing the C-peptide of IGF-1 had an intermediate interaction, consistent with its affinity for the IGF-1 receptor. Over 2000 IGF-1 mutants were generated by PCR and screened for interaction with the color assay. About 40% showed a strong interaction, 20% showed an intermediate interaction, and 40% give little or no signal. Of 50 mutants that were sequenced, several (Leu-5 → His, Glu-9 → Val, Arg-37 → Gly, and Met-59 → Leu) appeared to enhance receptor association, others resulted in weaker receptor interaction (Tyr-31 → Phe and Ile-43 → Phe), and two gave no detectable signal (Leu-14 → Arg and Glu-46 → Ala). Using PCR-based mutagenesis with proinsulin, we also identified a gain of function mutant (proinsulin Leu-17 → Pro) that allowed for a strong IGF-1–receptor interaction. These data demonstrate that the specificity of the interaction between a hormone and its receptor can be characterized with high efficiency in the two-hybrid system and that novel hormone analogues may be found by this method.
Resumo:
In this study, we estimate the statistical significance of structure prediction by threading. We introduce a single parameter ɛ that serves as a universal measure determining the probability that the best alignment is indeed a native-like analog. Parameter ɛ takes into account both length and composition of the query sequence and the number of decoys in threading simulation. It can be computed directly from the query sequence and potential of interactions, eliminating the need for sequence reshuffling and realignment. Although our theoretical analysis is general, here we compare its predictions with the results of gapless threading. Finally we estimate the number of decoys from which the native structure can be found by existing potentials of interactions. We discuss how this analysis can be extended to determine the optimal gap penalties for any sequence-structure alignment (threading) method, thus optimizing it to maximum possible performance.
Resumo:
Operon structure is an important organization feature of bacterial genomes. Many sets of genes occur in the same order on multiple genomes; these conserved gene groupings represent candidate operons. This study describes a computational method to estimate the likelihood that such conserved gene sets form operons. The method was used to analyze 34 bacterial and archaeal genomes, and yielded more than 7600 pairs of genes that are highly likely (P ≥ 0.98) to belong to the same operon. The sensitivity of our method is 30–50% for the Escherichia coli genome. The predicted gene pairs are available from our World Wide Web site http://www.tigr.org/tigr-scripts/operons/operons.cgi.
Resumo:
Recent improvements of a hierarchical ab initio or de novo approach for predicting both α and β structures of proteins are described. The united-residue energy function used in this procedure includes multibody interactions from a cumulant expansion of the free energy of polypeptide chains, with their relative weights determined by Z-score optimization. The critical initial stage of the hierarchical procedure involves a search of conformational space by the conformational space annealing (CSA) method, followed by optimization of an all-atom model. The procedure was assessed in a recent blind test of protein structure prediction (CASP4). The resulting lowest-energy structures of the target proteins (ranging in size from 70 to 244 residues) agreed with the experimental structures in many respects. The entire experimental structure of a cyclic α-helical protein of 70 residues was predicted to within 4.3 Å α-carbon (Cα) rms deviation (rmsd) whereas, for other α-helical proteins, fragments of roughly 60 residues were predicted to within 6.0 Å Cα rmsd. Whereas β structures can now be predicted with the new procedure, the success rate for α/β- and β-proteins is lower than that for α-proteins at present. For the β portions of α/β structures, the Cα rmsd's are less than 6.0 Å for contiguous fragments of 30–40 residues; for one target, three fragments (of length 10, 23, and 28 residues, respectively) formed a compact part of the tertiary structure with a Cα rmsd less than 6.0 Å. Overall, these results constitute an important step toward the ab initio prediction of protein structure solely from the amino acid sequence.
Resumo:
A method for the quantitative estimation of instability with respect to deamidation of the asparaginyl (Asn) residues in proteins is described. The procedure involves the observation of several simple aspects of the three-dimensional environment of each Asn residue in the protein and a calculation that includes these observations, the primary amino acid residue sequence, and the previously reported complete set of sequence-dependent rates of deamidation for Asn pentapeptides. This method is demonstrated and evaluated for 23 proteins in which 31 unstable and 167 stable Asn residues have been reported and for 7 unstable and 63 stable Asn residues that have been reported in 61 human hemoglobin variants. The relative importance of primary structure and three-dimensional structure in Asn deamidation is estimated.
Resumo:
RNA-protein interactions are pivotal in fundamental cellular processes such as translation, mRNA processing, early development, and infection by RNA viruses. However, in spite of the central importance of these interactions, few approaches are available to analyze them rapidly in vivo. We describe a yeast genetic method to detect and analyze RNA-protein interactions in which the binding of a bifunctional RNA to each of two hybrid proteins activates transcription of a reporter gene in vivo. We demonstrate that this three-hybrid system enables the rapid, phenotypic detection of specific RNA-protein interactions. As examples, we use the binding of the iron regulatory protein 1 (IRP1) to the iron response element (IRE), and of HIV trans-activator protein (Tat) to the HIV trans-activation response element (TAR) RNA sequence. The three-hybrid assay we describe relies only on the physical properties of the RNA and protein, and not on their natural biological activities; as a result, it may have broad application in the identification of RNA-binding proteins and RNAs, as well as in the detailed analysis of their interactions.
Resumo:
The diffusion equation method of global minimization is applied to compute the crystal structure of S6, with no a priori knowledge about the system. The experimental lattice parameters and positions and orientations of the molecules in the unit cell are predicted correctly.
Resumo:
We present a method for predicting protein folding class based on global protein chain description and a voting process. Selection of the best descriptors was achieved by a computer-simulated neural network trained on a data base consisting of 83 folding classes. Protein-chain descriptors include overall composition, transition, and distribution of amino acid attributes, such as relative hydrophobicity, predicted secondary structure, and predicted solvent exposure. Cross-validation testing was performed on 15 of the largest classes. The test shows that proteins were assigned to the correct class (correct positive prediction) with an average accuracy of 71.7%, whereas the inverse prediction of proteins as not belonging to a particular class (correct negative prediction) was 90-95% accurate. When tested on 254 structures used in this study, the top two predictions contained the correct class in 91% of the cases.
Resumo:
Progress in homology modeling and protein design has generated considerable interest in methods for predicting side-chain packing in the hydrophobic cores of proteins. Present techniques are not practically useful, however, because they are unable to model protein main-chain flexibility. Parameterization of backbone motions may represent a general and efficient method to incorporate backbone relaxation into such fixed main-chain models. To test this notion, we introduce a method for treating explicitly the backbone motions of alpha-helical bundles based on an algebraic parameterization proposed by Francis Crick in 1953 [Crick, F. H. C. (1953) Acta Crystallogr. 6, 685-689]. Given only the core amino acid sequence, a simple calculation can rapidly reproduce the crystallographic main-chain and core side-chain structures of three coiled coils (one dimer, one trimer, and one tetramer) to within 0.6-A root-mean-square deviations. The speed of the predictive method [approximately 3 min per rotamer choice on a Silicon Graphics (Mountain View, CA) 4D/35 computer] permits it to be used as a design tool.