9 resultados para Centralize density-based spatial clustering of applications with noise
em Repositório Científico do Instituto Politécnico de Lisboa - Portugal
Resumo:
Research on the problem of feature selection for clustering continues to develop. This is a challenging task, mainly due to the absence of class labels to guide the search for relevant features. Categorical feature selection for clustering has rarely been addressed in the literature, with most of the proposed approaches having focused on numerical data. In this work, we propose an approach to simultaneously cluster categorical data and select a subset of relevant features. Our approach is based on a modification of a finite mixture model (of multinomial distributions), where a set of latent variables indicate the relevance of each feature. To estimate the model parameters, we implement a variant of the expectation-maximization algorithm that simultaneously selects the subset of relevant features, using a minimum message length criterion. The proposed approach compares favourably with two baseline methods: a filter based on an entropy measure and a wrapper based on mutual information. The results obtained on synthetic data illustrate the ability of the proposed expectation-maximization method to recover ground truth. An application to real data, referred to official statistics, shows its usefulness.
Resumo:
We calculate the equilibrium thermodynamic properties, percolation threshold, and cluster distribution functions for a model of associating colloids, which consists of hard spherical particles having on their surfaces three short-ranged attractive sites (sticky spots) of two different types, A and B. The thermodynamic properties are calculated using Wertheim's perturbation theory of associating fluids. This also allows us to find the onset of self-assembly, which can be quantified by the maxima of the specific heat at constant volume. The percolation threshold is derived, under the no-loop assumption, for the correlated bond model: In all cases it is two percolated phases that become identical at a critical point, when one exists. Finally, the cluster size distributions are calculated by mapping the model onto an effective model, characterized by a-state-dependent-functionality (f) over bar and unique bonding probability (p) over bar. The mapping is based on the asymptotic limit of the cluster distributions functions of the generic model and the effective parameters are defined through the requirement that the equilibrium cluster distributions of the true and effective models have the same number-averaged and weight-averaged sizes at all densities and temperatures. We also study the model numerically in the case where BB interactions are missing. In this limit, AB bonds either provide branching between A-chains (Y-junctions) if epsilon(AB)/epsilon(AA) is small, or drive the formation of a hyperbranched polymer if epsilon(AB)/epsilon(AA) is large. We find that the theoretical predictions describe quite accurately the numerical data, especially in the region where Y-junctions are present. There is fairly good agreement between theoretical and numerical results both for the thermodynamic (number of bonds and phase coexistence) and the connectivity properties of the model (cluster size distributions and percolation locus).
Resumo:
We use Wertheim's first-order perturbation theory to investigate the phase behaviour and the structure of coexisting fluid phases for a model of patchy particles with dissimilar patches (two patches of type A and f(B) patches of type B). A patch of type alpha = {A, B} can bond to a patch of type beta = {A, B} in a volume nu(alpha beta), thereby decreasing the internal energy by epsilon(alpha beta). We analyse the range of model parameters where AB bonds, or Y-junctions, are energetically disfavoured (epsilon(AB) < epsilon(AA)/2) but entropically favoured (nu(AB) >> nu(alpha alpha)), and BB bonds, or X-junctions, are energetically favoured (epsilon(BB) > 0). We show that, for low values of epsilon(BB)/epsilon(AA), the phase diagram has three different regions: (i) close to the critical temperature a low-density liquid composed of long chains and rich in Y-junctions coexists with a vapour of chains; (ii) at intermediate temperatures there is coexistence between a vapour of short chains and a liquid of very long chains with X-and Y-junctions; (iii) at low temperatures an ideal gas coexists with a high-density liquid with all possible AA and BB bonds formed. It is also shown that in region (i) the liquid binodal is reentrant (its density decreases with decreasing temperature) for the lower values of epsilon(BB)/epsilon(AA). The existence of these three regions is a consequence of the competition between the formation of X- and Y-junctions: X-junctions are energetically favoured and thus dominate at low temperatures, whereas Y-junctions are entropically favoured and dominate at higher temperatures.
Resumo:
In this paper a solution to an highly constrained and non-convex economical dispatch (ED) problem with a meta-heuristic technique named Sensing Cloud Optimization (SCO) is presented. The proposed meta-heuristic is based on a cloud of particles whose central point represents the objective function value and the remaining particles act as sensors "to fill" the search space and "guide" the central particle so it moves into the best direction. To demonstrate its performance, a case study with multi-fuel units and valve- point effects is presented.
Resumo:
The construction industry keeps on demanding huge quantities of natural resources, mainly minerals for mortars and concrete production. The depletion of many quarries and environmental concerns about reducing the dumping of construction and demolition waste in quarries have led to an increase in the procuring and use of recycled aggregates from this type of waste. If they are to be incorporated in concrete and mortars it is essential to know their properties to guarantee the adequate performance of the end products, in both mechanical and durability-related terms. Existing regulated tests were developed for natural aggregates, however, and several problems arise when they are applied to recycled aggregates, especially fine recycled aggregates (FRA). This paper describes the main problems encountered with these tests and proposes an alternative method to determine the density and water absorption of FRA that removes them. The use of sodium hexametaphosphate solutions in the water absorption test has proven to improve its efficiency, minimizing cohesion between particles and helping to release entrained air.
Resumo:
Team sports represent complex systems: players interact continuously during a game, and exhibit intricate patterns of interaction, which can be identified and investigated at both individual and collective levels. We used Voronoi diagrams to identify and investigate the spatial dynamics of players' behavior in Futsal. Using this tool, we examined 19 plays of a sub-phase of a Futsal game played in a reduced area (20 m(2)) from which we extracted the trajectories of all players. Results obtained from a comparative analysis of player's Voronoi area (dominant region) and nearest teammate distance revealed different patterns of interaction between attackers and defenders, both at the level of individual players and teams. We found that, compared to defenders, larger dominant regions were associated with attackers. Furthermore, these regions were more variable in size among players from the same team but, at the player level, the attackers' dominant regions were more regular than those associated with each of the defenders. These findings support a formal description of the dynamic spatial interaction of the players, at least during the particular sub-phase of Futsal investigated. The adopted approach may be extended to other team behaviors where the actions taken at any instant in time by each of the involved agents are associated with the space they occupy at that particular time.
Resumo:
This paper introduces a new unsupervised hyperspectral unmixing method conceived to linear but highly mixed hyperspectral data sets, in which the simplex of minimum volume, usually estimated by the purely geometrically based algorithms, is far way from the true simplex associated with the endmembers. The proposed method, an extension of our previous studies, resorts to the statistical framework. The abundance fraction prior is a mixture of Dirichlet densities, thus automatically enforcing the constraints on the abundance fractions imposed by the acquisition process, namely, nonnegativity and sum-to-one. A cyclic minimization algorithm is developed where the following are observed: 1) The number of Dirichlet modes is inferred based on the minimum description length principle; 2) a generalized expectation maximization algorithm is derived to infer the model parameters; and 3) a sequence of augmented Lagrangian-based optimizations is used to compute the signatures of the endmembers. Experiments on simulated and real data are presented to show the effectiveness of the proposed algorithm in unmixing problems beyond the reach of the geometrically based state-of-the-art competitors.
Quality indicators in the education of children with profound Intellectual and multiple disabilities
Resumo:
Todas as crianças, independentemente das suas necessidades, deveriam ter acesso a uma educação de qualidade e a serem incluídas nas suas famílias e comunidades. Esta afirmação inclui as crianças mais vulneráveis, em particular as crianças com dificuldades intelectuais e multideficiência. Os resultados da investigação sobre a educação de crianças com dificuldades intelectuais e multideficiência ainda não produziram até ao momento informação suficiente que possa ser usada para desenvolver indicadores de qualidade para a avaliação das práticas e dos serviços. A investigação nesta área é limitada por constrangimentos éticos, dificuldades na determinação de amostras e desafios metodológicos, sendo reduzido o número de estudos capaz de produzir a informação necessária. Este artigo tem como objetivo discutir fatores que contribuam para a qualidade do envolvimento de crianças com dificuldades intelectuais e multideficiência em atividades educativas, com base na experiência das autoras e na informação disponível que tem sido publicada sobre este assunto. Com base nesta discussão é sugerido um conjunto de indicadores que poderão ajudar os profissionais a dirigir as suas observações para a qualidade da oferta educativa e para aspetos significativos dos desempenhos das crianças quando envolvidas em atividades curriculares.
Resumo:
In this paper a new method for self-localization of mobile robots, based on a PCA positioning sensor to operate in unstructured environments, is proposed and experimentally validated. The proposed PCA extension is able to perform the eigenvectors computation from a set of signals corrupted by missing data. The sensor package considered in this work contains a 2D depth sensor pointed upwards to the ceiling, providing depth images with missing data. The positioning sensor obtained is then integrated in a Linear Parameter Varying mobile robot model to obtain a self-localization system, based on linear Kalman filters, with globally stable position error estimates. A study consisting in adding synthetic random corrupted data to the captured depth images revealed that this extended PCA technique is able to reconstruct the signals, with improved accuracy. The self-localization system obtained is assessed in unstructured environments and the methodologies are validated even in the case of varying illumination conditions.