4 resultados para Multiple Additive Regression Trees (MART)
em Digital Knowledge Repository of Central Drug Research Institute
Resumo:
A combinatorial protocol (CP) is introduced here to interface it with the multiple linear regression (MLR) for variable selection. The efficiency of CP-MLR is primarily based on the restriction of entry of correlated variables to the model development stage. It has been used for the analysis of Selwood et al data set [16], and the obtained models are compared with those reported from GFA [8] and MUSEUM [9] approaches. For this data set CP-MLR could identify three highly independent models (27, 28 and 31) with Q2 value in the range of 0.632-0.518. Also, these models are divergent and unique. Even though, the present study does not share any models with GFA [8], and MUSEUM [9] results, there are several descriptors common to all these studies, including the present one. Also a simulation is carried out on the same data set to explain the model formation in CP-MLR. The results demonstrate that the proposed method should be able to offer solutions to data sets with 50 to 60 descriptors in reasonable time frame. By carefully selecting the inter-parameter correlation cutoff values in CP-MLR one can identify divergent models and handle data sets larger than the present one without involving excessive computer time.
Resumo:
The human immunodeficiency virus-1 reverse transcriptase inhibitory activity of 2-(2,6-disubstituted phenyl)-3-(substituted pyrimidin-2-yl)-thiazolidin-4-ones have been analyzed using combinatorial protocol in multiple linear regression (CP-MLR) with several electronic and molecular surface area features of the compounds obtained from Molecular Operating Environment (MOE) software. The study has indicated the role of different charged molecular surface areas in modeling the inhibitory activity of the compounds. The derived models collectively suggested that the compounds should be compact without bulky substitutions on its peripheries for better HIV-1 RT inhibitory activity. It also emphasized the necessity of hydrophobicity and compact structural features for their activity. The scope of the descriptors identified for these analogues have been verified by extending the dataset with different 2-(disubstituted phenyl)-3-(substituted pyridin-2-yl)-thiazolidin-4-ones. The joint analysis of extended dataset highlighted the information content of identified descriptors in modeling the HIV-1 RT inhibitory activity of the compounds.
Resumo:
The antimycobacterial activity of nitro/ acetamido alkenol derivatives and chloro/ amino alkenol derivatives has been analyzed through combinatorial protocol in multiple linear regression (CP-MLR) using different topological descriptors obtained from Dragon software. Among the topological descriptor classes considered in the study, the activity is correlated with simple topological descriptors (TOPO) and more complex 2D autocorrelation descriptors (2DAUTO). In model building the descriptors from other classes, that is, empirical, constitutional, molecular walk counts, modified Burden eigenvalues and Galvez topological charge indices have made secondary contribution in association with TOPO and / or 2DAUTO classes. The structure-activity correlations obtained with the TOPO descriptors suggest that less branched and saturated structural templates would be better for the activity. For both the series of compounds, in 2DAUTO the activity has been correlated to the descriptors having mass, volume and/ or polarizability as weighting component. In these two series of compounds, however, the regression coefficients of the descriptors have opposite arithmetic signs with respect to one another. Outwardly these two series of compounds appear very similar. But in terms of activity they belong to different segments of descriptor-activity profiles. This difference in the activity of these two series of compounds may be mainly due to the spacing difference between the C1 (also C6) substituents and rest of the functional groups in them.
Resumo:
Two series of closely related antimalarial agents, 7-chloro-4-(3’,5’-disubstitutedanilino) quinolines, have been analyzed using Combinatorial Protocol in Multiple Linear Regression (CP-MLR) for the structure-activity relations with more than 450 topological descriptors for each set. The study clearly suggested that 3’- and 5’- substituents of the anilino moiety map different domains in the activity space. While one domain favors the compact structural frames having aromatic, heterocyclic ring(s) substituted with closely spaced F, NO2 and O functional groups, the other prefers structural frames enriched with unsaturation, loops, branches, electronic content and devoid of carbonyl function. Also, this study gives an indication in favour of the electron rich centres in the aniline substituent groups for better antimalarial activity; an observation in line with several of the previous reports too. The models developed and the participating descriptors suggest that the substituent groups of the 4-anilino moiety of the 4-(3’, 5’-disubstitutedanilino)quinolines hold scope for further modification in the optimisation of the antimalarial activity.