43 resultados para Points distribution in high dimensional space
em University of Queensland eSpace - Australia
Resumo:
Indexing high dimensional datasets has attracted extensive attention from many researchers in the last decade. Since R-tree type of index structures are known as suffering curse of dimensionality problems, Pyramid-tree type of index structures, which are based on the B-tree, have been proposed to break the curse of dimensionality. However, for high dimensional data, the number of pyramids is often insufficient to discriminate data points when the number of dimensions is high. Its effectiveness degrades dramatically with the increase of dimensionality. In this paper, we focus on one particular issue of curse of dimensionality; that is, the surface of a hypercube in a high dimensional space approaches 100% of the total hypercube volume when the number of dimensions approaches infinite. We propose a new indexing method based on the surface of dimensionality. We prove that the Pyramid tree technology is a special case of our method. The results of our experiments demonstrate clear priority of our novel method.
Resumo:
In this paper, we propose a novel high-dimensional index method, the BM+-tree, to support efficient processing of similarity search queries in high-dimensional spaces. The main idea of the proposed index is to improve data partitioning efficiency in a high-dimensional space by using a rotary binary hyperplane, which further partitions a subspace and can also take advantage of the twin node concept used in the M+-tree. Compared with the key dimension concept in the M+-tree, the binary hyperplane is more effective in data filtering. High space utilization is achieved by dynamically performing data reallocation between twin nodes. In addition, a post processing step is used after index building to ensure effective filtration. Experimental results using two types of real data sets illustrate a significantly improved filtering efficiency.
Resumo:
We focus on mixtures of factor analyzers from the perspective of a method for model-based density estimation from high-dimensional data, and hence for the clustering of such data. This approach enables a normal mixture model to be fitted to a sample of n data points of dimension p, where p is large relative to n. The number of free parameters is controlled through the dimension of the latent factor space. By working in this reduced space, it allows a model for each component-covariance matrix with complexity lying between that of the isotropic and full covariance structure models. We shall illustrate the use of mixtures of factor analyzers in a practical example that considers the clustering of cell lines on the basis of gene expressions from microarray experiments. (C) 2002 Elsevier Science B.V. All rights reserved.
Resumo:
Structural similarity among proteins is reflected in the distribution of hydropathicity along the amino acids in the protein sequence. Similarities in the hydropathy distributions are obvious for homologous proteins within a protein family. They also were observed for proteins with related structures, even when sequence similarities were undetectable. Here we present a novel method that employs the hydropathy distribution in proteins for identification of (sub)families in a set of (homologous) proteins. We represent proteins as points in a generalized hydropathy space, represented by vectors of specifically defined features. The features are derived from hydropathy of the individual amino acids. Projection of this space onto principal axes reveals groups of proteins with related hydropathy distributions. The groups identified correspond well to families of structurally and functionally related proteins. We found that this method accurately identifies protein families in a set of proteins, or subfamilies in a set of homologous proteins. Our results show that protein families can be identified by the analysis of hydropathy distribution, without the need for sequence alignment. (C) 2005 Wiley-Liss, Inc.
Resumo:
The notorious "dimensionality curse" is a well-known phenomenon for any multi-dimensional indexes attempting to scale up to high dimensions. One well-known approach to overcome degradation in performance with respect to increasing dimensions is to reduce the dimensionality of the original dataset before constructing the index. However, identifying the correlation among the dimensions and effectively reducing them are challenging tasks. In this paper, we present an adaptive Multi-level Mahalanobis-based Dimensionality Reduction (MMDR) technique for high-dimensional indexing. Our MMDR technique has four notable features compared to existing methods. First, it discovers elliptical clusters for more effective dimensionality reduction by using only the low-dimensional subspaces. Second, data points in the different axis systems are indexed using a single B+-tree. Third, our technique is highly scalable in terms of data size and dimension. Finally, it is also dynamic and adaptive to insertions. An extensive performance study was conducted using both real and synthetic datasets, and the results show that our technique not only achieves higher precision, but also enables queries to be processed efficiently. Copyright Springer-Verlag 2005
Resumo:
1. Many species of delphinids co-occur in space and time. However, little is known of their ecological interactions and the underlying mechanisms that mediate their coexistence. 2. Snubfin Orcaella heinsohni, and Indo-Pacific humpback dolphins Sousa chinensis, live in sympatry throughout most of their range in Australian waters. I conducted boat-based surveys in Cleveland Bay, north-east Queensland, to collect data on the space and habitat use of both species. Using Geographic Information Systems, kernel methods and Euclidean distances I investigated interspecific differences in their space use patterns, behaviour and habitat preferences. 3. Core areas of use (50% kernel range) for both species were located close to river mouths and modified habitat such as dredged channels and breakwaters close to the Port of Townsville. Foraging and travelling activities were the dominant behavioural activities of snubfin and humpback dolphins within and outside their core areas. 4. Their representative ranges (95% kernel range) overlapped considerably, with shared areas showing strong concordance in the space use by both species. Nevertheless, snubfin dolphins preferred slightly shallower (1-2 m) waters than humpback dolphins (2-5 m). Additionally, shallow areas with seagrass ranked high in the habitat preferences of snubfin dolphins, whereas humpback dolphins favoured dredged channels. 5. Slight differences in habitat preferences appear to be one of the principal factors maintaining the coexistence of snubfin and humpback dolphins. I suggest diet partitioning and interspecific aggression as the major forces determining habitat selection in these sympatric species.
Resumo:
This paper describes U2DE, a finite-volume code that numerically solves the Euler equations. The code was used to perform multi-dimensional simulations of the gradual opening of a primary diaphragm in a shock tube. From the simulations, the speed of the developing shock wave was recorded and compared with other estimates. The ability of U2DE to compute shock speed was confirmed by comparing numerical results with the analytic solution for an ideal shock tube. For high initial pressure ratios across the diaphragm, previous experiments have shown that the measured shock speed can exceed the shock speed predicted by one-dimensional models. The shock speeds computed with the present multi-dimensional simulation were higher than those estimated by previous one-dimensional models and, thus, were closer to the experimental measurements. This indicates that multi-dimensional flow effects were partly responsible for the relatively high shock speeds measured in the experiments.
Resumo:
To investigate changes in the three-dimensional microfilament architecture of vascular smooth muscle cells (SMC) during the process of phenotypic modulation, rabbit aortic SMCs cultured under different conditions and at different time points were either labelled with fluorescein-conjugated probes to cytoskeletal and contractile proteins for observation by confocal laser scanning microscopy, or extracted with Triton X-100 for scanning electron microscopy. Densely seeded SMCs in primary culture, which maintain a contractile phenotype, display prominent linear myofilament bundles (stress fibres) that are present throughout the cytoplasm with alpha-actin filaments predominant in the central part and beta-actin filaments in the periphery of the cell. Intermediate filaments form a meshed network interconnecting the stress fibres and linking directly to the nucleus. Moderately and sparsely seeded SMCs, which modulate toward the synthetic phenotype during the first 5 days of culture, undergo a gradual redistribution of intermediate filaments from the perinuclear region toward the peripheral cytoplasm and a partial disassembly of stress fibres in the central part of the upper cortex of the cytoplasm, with an obvious decrease in alpha-actin and myosin staining. These changes are reversed in moderately seeded SMCs by day 8 of culture when they have reached confluence. The results reveal two changes in microfilament architecture in SMCs as they undergo a change in phenotype: the redistribution of intermediate filaments probably due to an increase in synthetic organelles in the perinuclear area, and the partial disassembly of stress fibres which may reflect a degradation of contractile components.
Resumo:
Background/Aims: Liver clearance models are based on information (or assumptions) on solute distribution kinetics within the microvasculatory system, The aim was to study albumin distribution kinetics in regenerated livers and in livers of normal adult rats, Methods: A novel mathematical model was used to evaluate the distribution space and the transit time dispersion of albumin in livers following regeneration after a two-thirds hepatectomy compared to livers of normal adult rats. Outflow curves of albumin measured after bolus injection in single-pass perfused rat livers were analyzed by correcting for the influence of catheters and fitting a long-tailed function to the data. Results: The curves were well described by the proposed model. The distribution volume and the transit time dispersion of albumin observed in the partial hepatectomy group were not significantly different from livers of normal adult rats. Conclusions: These findings suggest that the distribution space and the transit time dispersion of albumin (CV2) is relatively constant irrespective of the presence of rapid and extensive repair. This invariance of CV2 implies, as a first approximation, a similar degree of intrasinusoidal mixing, The finding that a sum of two (instead of one) inverse Gaussian densities is an appropriate empirical function to describe the outflow curve of vascular indicators has consequences for an improved prediction of hepatic solute extraction.
Resumo:
Three kinds of integrable Kondo problems in one-dimensional extended Hubbard models are studied by means of the boundary graded quantum inverse scattering method. The boundary K matrices depending on the local moments of the impurities are presented as a nontrivial realization of the graded reflection equation algebras acting in a (2s alpha + 1)-dimensional impurity Hilbert space. Furthermore, these models are solved using the algebraic Bethe ansatz method, and the Bethe ansatz equations are obtained.
Resumo:
Integrable Kondo impurities in two cases of one-dimensional q-deformed t-J models are studied by means of the boundary Z(2)-graded quantum inverse scattering method. The boundary K matrices depending on the local magnetic moments of the impurities are presented as nontrivial realizations of the reflection equation algebras in an impurity Hilbert space. Furthermore, these models are solved by using the algebraic Bethe ansatz method and the Bethe ansatz equations are obtained.
Resumo:
The optimal dosing schedule for melphalan therapy of recurrent malignant melanoma in isolated limb perfusions has been examined using a physiological pharmacokinetic model with data from isolated rat hindlimb perfusions (IRHP), The study included a comparison of melphalan distribution in IRHP under hyperthermia and normothermia conditions. Rat hindlimbs were perfused with Krebs-Henseleit buffer containing 4.7% bovine serum albumin at 37 or 41.5 degrees C at a flow rate of 4 ml/min. Concentrations of melphalan in perfusate and tissues were determined by high performance liquid chromatography with fluorescence detection, The concentration of melphalan in perfusate and tissues was linearly related to the input concentration. The rate and amount of melphalan uptake into the different tissues was higher at 41.5 degrees C than at 37 degrees C. A physiological pharmacokinetic model was validated from the tissue and perfusate time course of melphalan after melphalan perfusion. Application of the model involved the amount of melphalan exposure in the muscle, skin and fat in a recirculation system was related to the method of melphalan administration: single bolus > divided bolus > infusion, The peak concentration of melphalan in the perfusate was also related to the method of administration in the same order, Infusing the total dose of melphalan over 20 min during a 60 min perfusion optimized the exposure of tissues to melphalan whilst minimizing the peak perfusate concentration of melphalan. It is suggested that this method of melphalan administration may be preferable to other methods in terms of optimizing the efficacy of melphalan whilst minimizing the limb toxicity associated with its use in isolated limb perfusion.
Resumo:
An isolated rat hindlimb perfusion model carrying xenografts of the human melanoma cell line MM96 was used to study the effects of perfusion conditions on melphalan distribution. Krebs-Henseleit buffer and Hartmann's solution containing 4.7% bovine serum albumin (BSA) or 2.8% dextran 40 were used as perfusates. Melphalan concentrations in perfusate, tumour nodules and normal tissues were measured using high-performance liquid chromatography (HPLC). Increasing the perfusion flow rates (from 4 to 8 mi min(-1)) resulted in higher tissue blood flow (determined with Cr-51-labelled microspheres) and melphalan uptake by tumour and normal tissues. me distribution of melphalan within tumour nodules and normal tissues was similar for both Krebs-Henseleit buffer and Hartmann's solution; however, tissue concentrations of melphalan were significantly higher for a perfusate containing 2.8% dextran 40 than for one containing 4.7% BSA. The melphalan concentration in the tumour was one-third of that found in the skin if the perfusate contained 4.7% BSA. In conclusion, this study has shown that a high perfusion flow enhances the delivery of melphalan into implanted tumour nodules and normal tissues, and a perfusate with low melphalan binding (no albumin) is preferred for maximum uptake of drug by the tumour.