950 resultados para Predicted genotypic values
Resumo:
The glycolytic enzyme glyceraldehyde-3 -phosphate dehydrogenase (GAPDH) is as an attractive target for the development of novel antitrypanosomatid agents. In the present work, comparative molecular field analysis and comparative molecular similarity index analysis were conducted on a large series of selective inhibitors of trypanosomatid GAPDH. Four statistically significant models were obtained (r(2) > 0.90 and q(2) > 0.70), indicating their predictive ability for untested compounds. The models were then used to predict the potency of an external test set, and the predicted values were in good agreement with the experimental results. Molecular modeling studies provided further insight into the structural basis for selective inhibition of trypanosomatid GAPDH.
Resumo:
Worldwide, tuberculosis (TB) is the leading cause of death among curable infectious diseases. Multidrug-resistant Mycobacterium tuberculosis is an emerging problem of great importance to public health, and there is an urgent need for new anti-TB drugs. In the present work, classical 2D quantitative structure-activity relationships (QSAR) and hologram QSAR (HQSAR) studies were performed on a training set of 91 isoniazid derivatives. Significant statistical models (classical QSAR, q(2) = 0.68 and r(2) = 0.72; HQSAR, q(2) = 0.63 and r(2) = 0.86) were obtained, indicating their consistency for untested compounds. The models were then used to evaluate an external test set containing 24 compounds which were not included in the training set, and the predicted values were in good agreement with the experimental results (HQSAR, r(pred)(2) = 0.87; classical QSAR, r(pred)(2) = 0.75).
Resumo:
Cyclic imides have been widely employed in drug design research due to their multiple pharmacological and biological properties. In the present study, two-dimensional quantitative structure-activity relationship (2D QSAR) studies were conducted on a series of potent analgesic cyclic imides using both classical and hologram QSAR (HQSAR) methods, yielding significant statistical models (classical QSAR, q(2) = 0.80; HQSAR, q(2) = 0.84). The models were then used to evaluate an external data test, and the predicted values were in good agreement with the experimental results, indicating their consistency for untested compounds.
Resumo:
Chagas` disease is a parasitic infection widely distributed throughout Latin America, with devastating consequences in terms of human morbidity and mortality. Cruzain, the major cysteine protease from Trypanosoma cruzi, is an attractive target for antitrypanosomal chemotherapy. In the present work, classical two-dimensional quantitative structure-activity relationships (2D QSAR) and hologram QSAR (HQSAR) studies were performed on a training set of 45 thiosemicarbazone and semicarbazone derivatives as inhibitors of T. cruzi cruzain. Significant statistical models (HQSAR, q2=0.75 and r2=0.96; classical QSAR, q2=0.72 and r2=0.83) were obtained, indicating their consistency for untested compounds. The models were then used to evaluate an external test set containing 10 compounds which were not included in the training set, and the predicted values were in good agreement with the experimental results (HQSAR, [image omitted]=0.95; classical QSAR, [image omitted]=0.91), indicating the existence of complementary between the two ligand-based drug design techniques.
Resumo:
Several protease inhibitors have reached the world market in the last fifteen years, dramatically improving the quality of life and life expectancy of millions of HIV-infected patients. In spite of the tremendous research efforts in this area, resistant HIV-1 variants are constantly decreasing the ability of the drugs to efficiently inhibit the enzyme. As a consequence, inhibitors with novel frameworks are necessary to circumvent resistance to chemotherapy. In the present work, we have created 3D QSAR models for a series of 82 HIV-1 protease inhibitors employing the comparative molecular field analysis (CoMFA) method. Significant correlation coefficients were obtained (q(2) = 0.82 and r(2) = 0.97), indicating the internal consistency of the best model, which was then used to evaluate an external test set containing 17 compounds. The predicted values were in good agreement with the experimental results, showing the robustness of the model and its substantial predictive power for untested compounds. The final QSAR model and the information gathered from the CoMFA contour maps should be useful for the design of novel anti-HIV agents with improved potency.
Resumo:
The shuttle radar topography mission (SRTM), was flow on the space shuttle Endeavour in February 2000, with the objective of acquiring a digital elevation model of all land between 60 degrees north latitude and 56 degrees south latitude, using interferometric synthetic aperture radar (InSAR) techniques. The SRTM data are distributed at horizontal resolution of 1 arc-second (similar to 30m) for areas within the USA and at 3 arc-second (similar to 90m) resolution for the rest of the world. A resolution of 90m can be considered suitable for the small or medium-scale analysis, but it is too coarse for more detailed purposes. One alternative is to interpolate the SRTM data at a finer resolution; it will not increase the level of detail of the original digital elevation model (DEM), but it will lead to a surface where there is the coherence of angular properties (i.e. slope, aspect) between neighbouring pixels, which is an important characteristic when dealing with terrain analysis. This work intents to show how the proper adjustment of variogram and kriging parameters, namely the nugget effect and the maximum distance within which values are used in interpolation, can be set to achieve quality results on resampling SRTM data from 3"" to 1"". We present for a test area in western USA, which includes different adjustment schemes (changes in nugget effect value and in the interpolation radius) and comparisons with the original 1"" model of the area, with the national elevation dataset (NED) DEMs, and with other interpolation methods (splines and inverse distance weighted (IDW)). The basic concepts for using kriging to resample terrain data are: (i) working only with the immediate neighbourhood of the predicted point, due to the high spatial correlation of the topographic surface and omnidirectional behaviour of variogram in short distances; (ii) adding a very small random variation to the coordinates of the points prior to interpolation, to avoid punctual artifacts generated by predicted points with the same location than original data points and; (iii) using a small value of nugget effect, to avoid smoothing that can obliterate terrain features. Drainages derived from the surfaces interpolated by kriging and by splines have a good agreement with streams derived from the 1"" NED, with correct identification of watersheds, even though a few differences occur in the positions of some rivers in flat areas. Although the 1"" surfaces resampled by kriging and splines are very similar, we consider the results produced by kriging as superior, since the spline-interpolated surface still presented some noise and linear artifacts, which were removed by kriging.
Resumo:
Predictors of random effects are usually based on the popular mixed effects (ME) model developed under the assumption that the sample is obtained from a conceptual infinite population; such predictors are employed even when the actual population is finite. Two alternatives that incorporate the finite nature of the population are obtained from the superpopulation model proposed by Scott and Smith (1969. Estimation in multi-stage surveys. J. Amer. Statist. Assoc. 64, 830-840) or from the finite population mixed model recently proposed by Stanek and Singer (2004. Predicting random effects from finite population clustered samples with response error. J. Amer. Statist. Assoc. 99, 1119-1130). Predictors derived under the latter model with the additional assumptions that all variance components are known and that within-cluster variances are equal have smaller mean squared error (MSE) than the competitors based on either the ME or Scott and Smith`s models. As population variances are rarely known, we propose method of moment estimators to obtain empirical predictors and conduct a simulation study to evaluate their performance. The results suggest that the finite population mixed model empirical predictor is more stable than its competitors since, in terms of MSE, it is either the best or the second best and when second best, its performance lies within acceptable limits. When both cluster and unit intra-class correlation coefficients are very high (e.g., 0.95 or more), the performance of the empirical predictors derived under the three models is similar. (c) 2007 Elsevier B.V. All rights reserved.
Resumo:
We describe AMIN (Amidase N-terminal domain), a novel protein domain found specifically in bacterial periplasmic proteins. AMIN domains are widely distributed among peptidoglycan hydrolases and transporter protein families. Based on experimental data, contextual information and phyletic profiles, we suggest that AMIN domains mediate the targeting of periplasmic or extracellular proteins to specific regions of the bacterial envelope.
Resumo:
Flash points (T(FP)) of hydrocarbons are calculated from their flash point numbers, N(FP), with the relationship T(FP) (K) = 23.369N(FP)(2/3) + 20.010N(FP)(1/3) + 31.901 In turn, the N(FP) values can be predicted from experimental boiling point numbers (Y(BP)) and molecular structure with the equation N(FP) = 0.987 Y(BP) + 0.176D + 0.687T + 0.712B - 0.176 where D is the number of olefinic double bonds in the structure, T is the number of triple bonds, and B is the number of aromatic rings. For a data set consisting of 300 diverse hydrocarbons, the average absolute deviation between the literature and predicted flash points was 2.9 K.
Resumo:
Chagas disease is nowadays the most serious parasitic health problem. This disease is caused by Trypanosoma cruzi. The great number of deaths and the insufficient effectiveness of drugs against this parasite have alarmed the scientific community worldwide. In an attempt to overcome this problem, a model for the design and prediction of new antitrypanosomal agents was obtained. This used a mixed approach, containing simple descriptors based on fragments and topological substructural molecular design descriptors. A data set was made up of 188 compounds, 99 of them characterized an antitrypanosomal activity and 88 compounds that belong to other pharmaceutical categories. The model showed sensitivity, specificity and accuracy values above 85%. Quantitative fragmental contributions were also calculated. Then, and to confirm the quality of the model, 15 structures of molecules tested as antitrypanosomal compounds (that we did not include in this study) were predicted, taking into account the information on the abovementioned calculated fragmental contributions. The model showed an accuracy of 100% which means that the ""in silico"" methodology developed by our team is promising for the rational design of new antitrypanosomal drugs. (C) 2009 Wiley Periodicals, Inc. J Comput Chem 31: 882-894. 2010
Can mass dissociation patterns of transition-metal complexes be predicted from electrochemical data?
Resumo:
The Cooks kinetic method has been very convenient to correlate the relative dissociation rates obtained by collision-induced fragmentation experiments with the energies of two related bonds in molecules and complexes in the gas phase. Reliable bond energy data are, however, not always available, particularly for polynuclear transition-metal complexes, such as the triruthenium acetate clusters of the general formula [Ru(3) (mu(3)-O)(mu-CH(3)COO)(6)(py)(2)(L)](+), where L = ring substituted N-heterocyclic ligands. Accordingly, their gas-phase collision-induced tandem mass spectrometry (CID MS/MS) dissociation patterns have been analyzed pursuing a relationship with the more easily accessible redox potentials (E(1/2)) and Lever`s E(L) parameters. In fact, excellent linear correlations of In(1/2A(L)/A(py)), where A(py) and A(L) are the abundance of the fragments retaining the pyridine (py) and L ligand, respectively, with E(1/2) and E(L) were found. This result shows that those electrochemical parameters are correlated with bond energies and can be used in the analysis of the dissociation data. Such modified Cooks method can be used, for example, to determine the electronic effects of substituents on the metal-ligand bonds for a series of transition-metal complexes. Copyright (C) 2008 John Wiley & Sons, Ltd.
Resumo:
Some sesquiterpene lactones (SLs) are the active compounds of a great number of traditionally medicinal plants from the Asteraceae family and possess considerable cytotoxic activity. Several studies in vitro have shown the inhibitory activity against cells derived from human carcinoma of the nasopharynx (KB). Chemical studies showed that the cytotoxic activity is due to the reaction of alpha,beta-unsaturated carbonyl structures of the SLs with thiols, such as cysteine. These studies support the view that SLs inhibit tumour growth by selective alkylation of growth-regulatory biological macromolecules, such as key enzymes, which control cell division, thereby inhibiting a variety of cellular functions, which directs the cells into apoptosis. In this study we investigated a set of 55 different sesquiterpene lactones, represented by 5 skeletons (22 germacranolides, 6 elemanolides, 2 eudesmanolides, 16 guaianolides and nor-derivatives and 9 pseudoguaianolides), in respect to their cytotoxic properties. The experimental results and 3D molecular descriptors were submitted to Kohonen self-organizing map (SOM) to classify (training set) and predict (test set) the cytotoxic activity. From the obtained results, it was concluded that only the geometrical descriptors showed satisfactory values. The Kohonen map obtained after training set using 25 geometrical descriptors shows a very significant match, mainly among the inactive compounds (similar to 84%). Analyzing both groups, the percentage seen is high (83%). The test set shows the highest match, where 89% of the substances had their cytotoxic activity correctly predicted. From these results, important properties for the inhibition potency are discussed for the whole dataset and for subsets of the different structural skeletons. (C) 2008 Elsevier Masson SAS. All rights reserved.
Resumo:
Flash points (T(FP)) of organic compounds are calculated from their flash point numbers, N(FP), with the relationship T(FP) = 23.369N(FP)(2/3) + 20.010N(FP)(1/3) + 31.901. In turn, the N(FP) values can be predicted from boiling point numbers (Y(BP)) and functional group counts with the equation N(FP) = 0.974Y(BP) + Sigma(i)n(i)G(i) + 0.095 where G(i) is a functional group-specific contribution to the value of N(FP) and n(i) is the number of such functional groups in the structure. For a data set consisting of 1000 diverse organic compounds, the average absolute deviation between reported and predicted flash points was less than 2.5 K.
Resumo:
The giant extracellular hemoglobin of Glossoscolex paulistus (HbGp) is constituted by Subunits containing heme groups with molecular masses (M) in the range of 15 to 19 kDa, monomers of 16 kDa (d), and trimers of 51 to 52 kDa (abc) linked by nonheme structures named linkers of 24 to 32 kDa (L). HbGp is homologous to Lumbricus terrestris hemoglobin (HbLt). Several reports propose M of HbLt in the range of 3.6 to 4.4 MDa. Based on subunits M determined by mass spectrometry and assuming HbGp stoichiometry of 12(abcd)(3)L(3) (Vinogradov model) plus 144 heme groups, a Value of M for HbGp oligomer of 3560 kDa can be predicted. This Value is nearly 500 kDa higher than the unique HbGp M Value reported in the literature. In the current work, sedimentation velocity analytical ultracentrifugation (AUC) experiments were performed to obtain M for HbGp in oxy and cyano-met forms. s(20,w)(0), values of 58.1 +/- 0.2 S and 59.6 +/- 0.2 S, respectively, for the two oxidation forms were obtained. The ratio between sedimentation and diffusion coefficients supplied values for M of approximately 3600 100 and 3700 100 kDa for oxy and cyano-met HbGp forms, respectively. An independent determination of the partial specific volume, V(bar), for HbGp was performed based on density measurements, providing a value of 0.764 +/- 0.008, in excellent agreement with the estimates from SEDFIT software. Our results show total consistency between M obtained by AUC and recent partial characterization by mass spectrometry. Therefore, HbGp possesses M very close to that of HbLt, suggesting an oligomeric assembly in agreement with the Vinogradov model. (c) 2008 Elsevier Inc. All rights reserved.
Resumo:
The main purpose of this thesis project is to prediction of symptom severity and cause in data from test battery of the Parkinson’s disease patient, which is based on data mining. The collection of the data is from test battery on a hand in computer. We use the Chi-Square method and check which variables are important and which are not important. Then we apply different data mining techniques on our normalize data and check which technique or method gives good results.The implementation of this thesis is in WEKA. We normalize our data and then apply different methods on this data. The methods which we used are Naïve Bayes, CART and KNN. We draw the Bland Altman and Spearman’s Correlation for checking the final results and prediction of data. The Bland Altman tells how the percentage of our confident level in this data is correct and Spearman’s Correlation tells us our relationship is strong. On the basis of results and analysis we see all three methods give nearly same results. But if we see our CART (J48 Decision Tree) it gives good result of under predicted and over predicted values that’s lies between -2 to +2. The correlation between the Actual and Predicted values is 0,794in CART. Cause gives the better percentage classification result then disability because it can use two classes.