42 resultados para complex data
em Indian Institute of Science - Bangalore - Índia
Resumo:
We address the problem of mining targeted association rules over multidimensional market-basket data. Here, each transaction has, in addition to the set of purchased items, ancillary dimension attributes associated with it. Based on these dimensions, transactions can be visualized as distributed over cells of an n-dimensional cube. In this framework, a targeted association rule is of the form {X -> Y} R, where R is a convex region in the cube and X. Y is a traditional association rule within region R. We first describe the TOARM algorithm, based on classical techniques, for identifying targeted association rules. Then, we discuss the concepts of bottom-up aggregation and cubing, leading to the CellUnion technique. This approach is further extended, using notions of cube-count interleaving and credit-based pruning, to derive the IceCube algorithm. Our experiments demonstrate that IceCube consistently provides the best execution time performance, especially for large and complex data cubes.
Resumo:
Large software systems are developed by composing multiple programs. If the programs manip-ulate and exchange complex data, such as network packets or files, it is essential to establish that they follow compatible data formats. Most of the complexity of data formats is associated with the headers. In this paper, we address compatibility of programs operating over headers of network packets, files, images, etc. As format specifications are rarely available, we infer the format associated with headers by a program as a set of guarded layouts. In terms of these formats, we define and check compatibility of (a) producer-consumer programs and (b) different versions of producer (or consumer) programs. A compatible producer-consumer pair is free of type mismatches and logical incompatibilities such as the consumer rejecting valid outputs gen-erated by the producer. A backward compatible producer (resp. consumer) is guaranteed to be compatible with consumers (resp. producers) that were compatible with its older version. With our prototype tool, we identified 5 known bugs and 1 potential bug in (a) sender-receiver modules of Linux network drivers of 3 vendors and (b) different versions of a TIFF image library.
Resumo:
Large MIMO systems with tens of antennas in each communication terminal using full-rate non-orthogonal space-time block codes (STBC) from Cyclic Division Algebras (CDA) can achieve the benefits of both transmit diversity as well as high spectral efficiencies. Maximum-likelihood (ML) or near-ML decoding of these large-sized STBCs at low complexities, however, has been a challenge. In this paper, we establish that near-ML decoding of these large STBCs is possible at practically affordable low complexities. We show that the likelihood ascent search (LAS) detector, reported earlier by us for V-BLAST, is able to achieve near-ML uncoded BER performance in decoding a 32x32 STBC from CDA, which employs 32 transmit antennas and sends 32(2) = 1024 complex data symbols in 32 time slots in one STBC matrix (i.e., 32 data symbols sent per channel use). In terms of coded BER, with a 16x16 STBC, rate-3/4 turbo code and 4-QAM (i.e., 24 bps/Hz), the LAS detector performs close to within just about 4 dB from the theoretical MIMO capacity. Our results further show that, with LAS detection, information lossless (ILL) STBCs perform almost as good as full-diversity ILL (FD-ILL) STBCs. Such low-complexity detectors can potentially enable implementation of high spectral efficiency large MIMO systems that could be considered in wireless standards.
Resumo:
We study lazy structure sharing as a tool for optimizing equivalence testing on complex data types, We investigate a number of strategies for implementing lazy structure sharing and provide upper and lower bounds on their performance (how quickly they effect ideal configurations of our data structure). In most cases when the strategies are applied to a restricted case of the problem, the bounds provide nontrivial improvements over the naive linear-time equivalence-testing strategy that employs no optimization. Only one strategy, however, which employs path compression, seems promising for the most general case of the problem.
Resumo:
Introduction: Advances in genomics technologies are providing a very large amount of data on genome-wide gene expression profiles, protein molecules and their interactions with other macromolecules and metabolites. Molecular interaction networks provide a useful way to capture this complex data and comprehend it. Networks are beginning to be used in drug discovery, in many steps of the modern discovery pipeline, with large-scale molecular networks being particularly useful for the understanding of the molecular basis of the disease. Areas covered: The authors discuss network approaches used for drug target discovery and lead identification in the drug discovery pipeline. By reconstructing networks of targets, drugs and drug candidates as well as gene expression profiles under normal and disease conditions, the paper illustrates how it is possible to find relationships between different diseases, find biomarkers, explore drug repurposing and study emergence of drug resistance. Furthermore, the authors also look at networks which address particular important aspects such as off-target effects, combination-targets, mechanism of drug action and drug safety. Expert opinion: The network approach represents another paradigm shift in drug discovery science. A network approach provides a fresh perspective of understanding important proteins in the context of their cellular environments, providing a rational basis for deriving useful strategies in drug design. Besides drug target identification and inferring mechanism of action, networks enable us to address new ideas that could prove to be extremely useful for new drug discovery, such as drug repositioning, drug synergy, polypharmacology and personalized medicine.
Resumo:
Analysis of EXAFS data of complex systems containing more than one phase and one type of coordination, has been discussed. It is shown that a modified treatment of EXAFS function as well as the amplitude ratio plots provide useful means of obtaining valuable structural information. The systems investigated are: biphasic Ni+NiO mixture, NiAl2O4 with two coordinations for Ni, NiO+NiAl2O4 mixture, CoS+CoO system and Ni dispersed on Al2O3. The results obtained with these systems have been most satisfactory and serve to illustrate the utility and the applicability of the innovations described in this paper.
Resumo:
UVPES studies and ab initio and DFT computations have been done on the benzene...ICl complex; electron spectral data and computed orbital energies show that donor orbitals are stabilized and acceptor orbitals are destabilized due to complexation. Calculations predict an oblique structure for the complex in which the interacting site is a C=C bond center in the donor and iodine atom in the acceptor, in full agreement with earlier experimental reports. BSSE-corrected binding energies closely match the enthalpy of complexation reported, and the NBO analysis clearly reveals the involvement of the pi orbital of benzene and the sigma* orbital of ICl in the complex.
Resumo:
The calcium binding characteristics of antibiotic X-537A (lasalocid-A) in a lipophilic solvent, acetonitrile (CH3CN), have been studied using circular dichroism (CD) spectroscopy. The analysis of the data indicated that in this medium polar solvent, X-537A forms predominantly the charged complexes of stoichiometries 2:1 and 1:1, the relative amounts of the two being dependent on [Ca2+]. The conformation of the complexes, arrived at on the basis of the data, seem to indicate a rigid part encompassing Ca2+, liganded to 3 oxygens of the molecule, viz., the carbonyl, the substituted tetrahydrofuran ring and the substituted pyran ring oxygens (apart from possibly, the liganding provided by nitrogen atoms of the solvent molecules), and a flexible part consisting of the salicylic acid group of the molecule.
Resumo:
Data-flow analysis is an integral part of any aggressive optimizing compiler. We propose a framework for improving the precision of data-flow analysis in the presence of complex control-flow. W initially perform data-flow analysis to determine those control-flow merges which cause the loss in data-flow analysis precision. The control-flow graph of the program is then restructured such that performing data-flow analysis on the resulting restructured graph gives more precise results. The proposed framework is both simple, involving the familiar notion of product automata, and also general, since it is applicable to any forward data-flow analysis. Apart from proving that our restructuring process is correct, we also show that restructuring is effective in that it necessarily leads to more optimization opportunities. Furthermore, the framework handles the trade-off between the increase in data-flow precision and the code size increase inherent in the restructuring. We show that determining an optimal restructuring is NP-hard, and propose and evaluate a greedy strategy. The framework has been implemented in the Scale research compiler, and instantiated for the specific problem of Constant Propagation. On the SPECINT 2000 benchmark suite we observe an average speedup of 4% in the running times over Wegman-Zadeck conditional constant propagation algorithm and 2% over a purely path profile guided approach.
Resumo:
Studies on the conformational and binding characteristics of the ionophoric antibiotic X-537A (lasalocid-A)�calcium ion complexes have been carried out in deuteriated acetonitrile (CD3 CN) using proton and carbon-13 nuclear magnetic resonance (1 H and 13C n.m.r.) spectroscopy. Detailed analysis of the salt-induced chemical shifts at various X-537A to calcium concentration ratios indicated that X-537A forms charged complexes with calcium with 2 : 1 and 1 : 1 stoicheiometries. The conformational model for the complex based on the n.m.r. data showed that the calcium ion is preferentially bound to one end of the molecule, which is binding to three oxygen atoms, the other end (the salicylic acid part) being relatively free. In the 2 : 1 (sandwich) complex, the calcium ion is sandwiched between two X-537A molecules with three oxygen atoms binding to it from each molecule.
Resumo:
Screen-less oscillation photography is the method of choice for recording three-dimensional X-ray diffraction data for crystals of biological macromolecules. The geometry of an oscillation camera is extremely simple. However, the manner in which the reciprocal lattice is recorded in any experiment is fairly complex. This depends on the Laue symmetry of the reciprocal lattice, the lattice type, the orientation of the crystal on the camera and to a lesser extent on the unit-cell dimensions. Exploring the relative efficiency of collecting X-ray diffraction data for different crystal orientations prior to data collection might reduce the number of films required to record most of the unique data and the consequent amount of time required for processing these films. Here algorithms are presented suitable for this purpose and results are reported for the 11 Laue groups, different lattice types and crystal orientations often employed in data collection.
Resumo:
The preparation and properties of five new dyes derived from nickel(I1) ions and aromatic azo derivatives of ethylenebls(P-ketoesters) are reported.
Resumo:
Empirical potential energy calculations have been carried out to determine the preferred conformations of some oligosaccharides having the trimannosidic core structure (Man3GlcNAc2) and which interact with concanavalin A. In the minimum energy conformations for the trimannosidic core the mannose residue on the Man α(1–6) arm comes close to one of the N-acetylglucosamine residues of the core. The addition of N-acetylglucosamine residues to the terminal mannose residues does not alter the preferred conformation of the trimannosidic core although it alters the relative preference of some of the higher energy conformations. The minimum energy conformation broadly agrees with available X-ray data. The presence of a bisecting N-acetylglucosamine residue on the middle mannose does not push the trimannosidic core to any new conformation but it does alter the relative preference for a particular conformation.
Resumo:
The complex crystallizes in the space group P21/c with four formula units in a unit cell of dimensions a= 12.747, b= 7.416, c= 17.894 A and/3= 90.2 °. The structure has been solved by the symbolic addition procedure using three-dimensional photographic data and refined to an R value of 0.079 for 2019 observed reflexions. The pyramidal nature of the two hetero nitrogen atoms in the antipyrine molecule is inter:nediate between that observed in free antipyrine and in some of its metal complexes. The molecule is more polar than that in crystals of free antipyrine but less so compared with that in metal complexes. In the salicylic acid molecule, the hydroxyl group forms an internal hydrogen bond with one of the oxygen atoms in the carboxyl group. The association between the salicylic acid and the antipyrine molecules is achieved through an intermolecular hydrogen bond with the other carboxyl oxygen atom in the salicylic acid molecule as the proton donor and the carboxyl oxygen atom of the antipyrine molecule as the acceptor.