907 resultados para Prediction of scholastic success.
Resumo:
Operon structure is an important organization feature of bacterial genomes. Many sets of genes occur in the same order on multiple genomes; these conserved gene groupings represent candidate operons. This study describes a computational method to estimate the likelihood that such conserved gene sets form operons. The method was used to analyze 34 bacterial and archaeal genomes, and yielded more than 7600 pairs of genes that are highly likely (P ≥ 0.98) to belong to the same operon. The sensitivity of our method is 30–50% for the Escherichia coli genome. The predicted gene pairs are available from our World Wide Web site http://www.tigr.org/tigr-scripts/operons/operons.cgi.
Resumo:
Single-stranded regions in RNA secondary structure are important for RNA–RNA and RNA–protein interactions. We present a probability profile approach for the prediction of these regions based on a statistical algorithm for sampling RNA secondary structures. For the prediction of phylogenetically-determined single-stranded regions in secondary structures of representative RNA sequences, the probability profile offers substantial improvement over the minimum free energy structure. In designing antisense oligonucleotides, a practical problem is how to select a secondary structure for the target mRNA from the optimal structure(s) and many suboptimal structures with similar free energies. By summarizing the information from a statistical sample of probable secondary structures in a single plot, the probability profile not only presents a solution to this dilemma, but also reveals ‘well-determined’ single-stranded regions through the assignment of probabilities as measures of confidence in predictions. In antisense application to the rabbit β-globin mRNA, a significant correlation between hybridization potential predicted by the probability profile and the degree of inhibition of in vitro translation suggests that the probability profile approach is valuable for the identification of effective antisense target sites. Coupling computational design with DNA–RNA array technique provides a rational, efficient framework for antisense oligonucleotide screening. This framework has the potential for high-throughput applications to functional genomics and drug target validation.
Resumo:
A method for the quantitative estimation of instability with respect to deamidation of the asparaginyl (Asn) residues in proteins is described. The procedure involves the observation of several simple aspects of the three-dimensional environment of each Asn residue in the protein and a calculation that includes these observations, the primary amino acid residue sequence, and the previously reported complete set of sequence-dependent rates of deamidation for Asn pentapeptides. This method is demonstrated and evaluated for 23 proteins in which 31 unstable and 167 stable Asn residues have been reported and for 7 unstable and 63 stable Asn residues that have been reported in 61 human hemoglobin variants. The relative importance of primary structure and three-dimensional structure in Asn deamidation is estimated.
Resumo:
We present rules that allow one to predict the stability of DNA pyrimidine.purine.pyrimidine (Y.R.Y) triple helices on the basis of the sequence. The rules were derived from van't Hoff analysis of 23 oligonucleotide triplexes tested at a variety of pH values. To predict the enthalpy of triplex formation (delta H degrees), a simple nearest-neighbor model was found to be sufficient. However, to accurately predict the free energy of the triplex (delta G degrees), a combination model consisting of five parameters was needed. These parameters were (i) the delta G degrees for helix initiation, (ii) the delta G degrees for adding a T-A.T triple, (iii) the delta G degrees for adding a C(+)-G.C triple, (iv) the penalty for adjacent C bases, and (v) the pH dependence of the C(+)-G.C triple's stability. The fitted parameters are highly consistent with thermodynamic data from the basis set, generally predicting both delta H degrees and delta G degrees to within the experimental error. Examination of the parameters points out several interesting features. The combination model predicts that C(+) -G.C. triples are much more stabilizing than T-A.T triples below pH 7.0 and that the stability of the former increases approximately equal to 1 kcal/mol per pH unit as the pH is decreased. Surprisingly though, the most stable sequence is predicted to be a CT repeat, as adjacent C bases partially cancel the stability of one another. The parameters successfully predict tm values from other laboratories, with some interesting exceptions.
Resumo:
The peptide-binding motif of HLA-A29, the predisposing allele for birdshot retinopathy, was determined after acid-elution of endogenous peptides from purified HLA-A29 molecules. Individual and pooled HPLC fractions were sequenced by Edman degradation. Major anchor residues could be defined as glutamate at the second position of the peptide and as tyrosine at the carboxyl terminus. In vitro binding of polyglycine synthetic peptides to purified HLA-A29 molecules also revealed the need for an auxiliary anchor residue at the third position, preferably phenylalanine. By using this motif, we synthesized six peptides from the retinal soluble antigen, a candidate autoantigen in autoimmune uveoretinitis. Their in vitro binding was tested on HLA-A29 and also on HLA-B44 and HLA-B61, two alleles sharing close peptide-binding motifs. Two peptides derived from the carboxyl-terminal sequence of the human retinal soluble antigen bound efficiently to HLA-A29. This study could contribute to the prediction of T-cell epitopes from retinal autoantigens implicated in birdshot retinopathy.
Resumo:
A new approach of comparing protein structures that does not involve the procedure of superposition is suggested. An invariant system of coordinates for immunoglobulin molecules that is based on the geometrical symmetry inherent to the variable domain light-chain (VL)-heavy-chain (VH) complex is described. The coordinates of the Calpha atoms in 22 immunoglobulin structures are calculated in the invariant system of coordinates. We found that 76 identical positions in this Calpha framework are symmetrical about the twofold axis. Comparison of the identical positions in these molecules allows us to select 96 positions in the light chains and 87 positions in the heavy chains whose Calpha atom coordinates are approximately the same. To check whether the average coordinates of Calpha atoms in these positions complies with the stereochemical requirements, we calculated Calpha-Calpha distances. Seventy-three positions of the light chains and 72 positions of the heavy chains satisfy the Calpha-Calpha distance criterion. The Calpha atoms in these positions are used for constructing the "standard" Calpha framework of VL and VH complexes. The average coordinates of Calpha atoms are presented.
Resumo:
The diffusion equation method of global minimization is applied to compute the crystal structure of S6, with no a priori knowledge about the system. The experimental lattice parameters and positions and orientations of the molecules in the unit cell are predicted correctly.
Resumo:
We present a method for predicting protein folding class based on global protein chain description and a voting process. Selection of the best descriptors was achieved by a computer-simulated neural network trained on a data base consisting of 83 folding classes. Protein-chain descriptors include overall composition, transition, and distribution of amino acid attributes, such as relative hydrophobicity, predicted secondary structure, and predicted solvent exposure. Cross-validation testing was performed on 15 of the largest classes. The test shows that proteins were assigned to the correct class (correct positive prediction) with an average accuracy of 71.7%, whereas the inverse prediction of proteins as not belonging to a particular class (correct negative prediction) was 90-95% accurate. When tested on 254 structures used in this study, the top two predictions contained the correct class in 91% of the cases.
Resumo:
The regions surrounding the catalytic amino acids previously identified in a few "retaining" O-glycosyl hydrolases (EC 3.2.1) have been analyzed by hydrophobic cluster analysis and have been used to define sequence motifs. These motifs have been found in more than 150 glycosyl hydrolase sequences representing at least eight established protein families that act on a large variety of substrates. This allows the localization and the precise role of the catalytic residues (nucleophile and acid catalyst) to be predicted for each of these enzymes, including several lysosomal glycosidases. An identical arrangement of the catalytic nucleophile was also found for S-glycosyl hydrolases (myrosinases; EC 3.2.3.1) for which the acid catalyst is lacking. A (beta/alpha)8 barrel structure has been reported for two of the eight families of proteins that have been grouped. It is suggested that the six other families also share this fold at their catalytic domain. These enzymes illustrate how evolutionary events led to a wide diversification of substrate specificity with a similar disposition of identical catalytic residues onto the same ancestral (beta/alpha)8 barrel structure.
Resumo:
An electronic phase with coexisting magnetic and ferroelectric order is predicted for graphene ribbons with zigzag edges. The electronic structure of the system is described with a mean-field Hubbard model that yields results very similar to those of density functional calculations. Without further approximations, the mean-field theory is recasted in terms of a BCS wave function for electron-hole pairs in the edge bands. The BCS coherence present in each spin channel is related to spin-resolved electric polarization. Although the total electric polarization vanishes, due to an internal phase locking of the BCS state, strong magnetoelectric effects are expected in this system. The formulation naturally accounts for the two gaps in the quasiparticle spectrun, Δ0 and Δ1, and relates them to the intraband and interband self-energies.
Resumo:
This paper describes a module for the prediction of emotions in text chats in Spanish, oriented to its use in specific-domain text-to-speech systems. A general overview of the system is given, and the results of some evaluations carried out with two corpora of real chat messages are described. These results seem to indicate that this system offers a performance similar to other systems described in the literature, for a more complex task than other systems (identification of emotions and emotional intensity in the chat domain).