2 resultados para Early maps.
em Aston University Research Archive
Resumo:
Multidimensional compound optimization is a new paradigm in the drug discovery process, yielding efficiencies during early stages and reducing attrition in the later stages of drug development. The success of this strategy relies heavily on understanding this multidimensional data and extracting useful information from it. This paper demonstrates how principled visualization algorithms can be used to understand and explore a large data set created in the early stages of drug discovery. The experiments presented are performed on a real-world data set comprising biological activity data and some whole-molecular physicochemical properties. Data visualization is a popular way of presenting complex data in a simpler form. We have applied powerful principled visualization methods, such as generative topographic mapping (GTM) and hierarchical GTM (HGTM), to help the domain experts (screening scientists, chemists, biologists, etc.) understand and draw meaningful decisions. We also benchmark these principled methods against relatively better known visualization approaches, principal component analysis (PCA), Sammon's mapping, and self-organizing maps (SOMs), to demonstrate their enhanced power to help the user visualize the large multidimensional data sets one has to deal with during the early stages of the drug discovery process. The results reported clearly show that the GTM and HGTM algorithms allow the user to cluster active compounds for different targets and understand them better than the benchmarks. An interactive software tool supporting these visualization algorithms was provided to the domain experts. The tool facilitates the domain experts by exploration of the projection obtained from the visualization algorithms providing facilities such as parallel coordinate plots, magnification factors, directional curvatures, and integration with industry standard software. © 2006 American Chemical Society.
Resumo:
In 1934, Arthur Lindo Patterson showed that a map of interatomic vectors is obtainable from measured X-ray diffraction data without phase information. Such maps were interpretable for simple crystal structures, but proliferation and overlapping of peaks caused confusion as the number of atoms increased. Since the peak height of a vector between two particular atoms is related to the product of their atomic numbers, a complicated structure could effectively be reduced to a simple one by including just a few heavy atoms (of high atomic number) since their interatomic vectors would stand out from the general clutter. Once located, these atoms provide approximate phases for Fourier syntheses that reveal the locations of additional atoms. Surveys of small-molecule structures in the Cambridge Structural Database during the periods 1936-1969, when Patterson methods were commonly used, and 1980-2013, dominated by direct methods, demonstrate large differences in the abundance of certain elements. The moderately heavy elements K, Rb, As and Br are the heaviest elements in the structure more than 3 times as often in the early period than in the recent period. Examples are given of three triumphs of the heavy atom method and two initial failures that had to be overcome. © 2014 © 2014 Taylor & Francis.