993 resultados para Spatial Clustering
Resumo:
This paper analyses earthquake data in the perspective of dynamical systems and fractional calculus (FC). This new standpoint uses Multidimensional Scaling (MDS) as a powerful clustering and visualization tool. FC extends the concepts of integrals and derivatives to non-integer and complex orders. MDS is a technique that produces spatial or geometric representations of complex objects, such that those objects that are perceived to be similar in some sense are placed on the MDS maps forming clusters. In this study, over three million seismic occurrences, covering the period from January 1, 1904 up to March 14, 2012 are analysed. The events are characterized by their magnitude and spatiotemporal distributions and are divided into fifty groups, according to the Flinn–Engdahl (F–E) seismic regions of Earth. Several correlation indices are proposed to quantify the similarities among regions. MDS maps are proven as an intuitive and useful visual representation of the complex relationships that are present among seismic events, which may not be perceived on traditional geographic maps. Therefore, MDS constitutes a valid alternative to classic visualization tools for understanding the global behaviour of earthquakes.
Resumo:
Research on the problem of feature selection for clustering continues to develop. This is a challenging task, mainly due to the absence of class labels to guide the search for relevant features. Categorical feature selection for clustering has rarely been addressed in the literature, with most of the proposed approaches having focused on numerical data. In this work, we propose an approach to simultaneously cluster categorical data and select a subset of relevant features. Our approach is based on a modification of a finite mixture model (of multinomial distributions), where a set of latent variables indicate the relevance of each feature. To estimate the model parameters, we implement a variant of the expectation-maximization algorithm that simultaneously selects the subset of relevant features, using a minimum message length criterion. The proposed approach compares favourably with two baseline methods: a filter based on an entropy measure and a wrapper based on mutual information. The results obtained on synthetic data illustrate the ability of the proposed expectation-maximization method to recover ground truth. An application to real data, referred to official statistics, shows its usefulness.
Resumo:
Research on cluster analysis for categorical data continues to develop, new clustering algorithms being proposed. However, in this context, the determination of the number of clusters is rarely addressed. We propose a new approach in which clustering and the estimation of the number of clusters is done simultaneously for categorical data. We assume that the data originate from a finite mixture of multinomial distributions and use a minimum message length criterion (MML) to select the number of clusters (Wallace and Bolton, 1986). For this purpose, we implement an EM-type algorithm (Silvestre et al., 2008) based on the (Figueiredo and Jain, 2002) approach. The novelty of the approach rests on the integration of the model estimation and selection of the number of clusters in a single algorithm, rather than selecting this number based on a set of pre-estimated candidate models. The performance of our approach is compared with the use of Bayesian Information Criterion (BIC) (Schwarz, 1978) and Integrated Completed Likelihood (ICL) (Biernacki et al., 2000) using synthetic data. The obtained results illustrate the capacity of the proposed algorithm to attain the true number of cluster while outperforming BIC and ICL since it is faster, which is especially relevant when dealing with large data sets.
Resumo:
In data clustering, the problem of selecting the subset of most relevant features from the data has been an active research topic. Feature selection for clustering is a challenging task due to the absence of class labels for guiding the search for relevant features. Most methods proposed for this goal are focused on numerical data. In this work, we propose an approach for clustering and selecting categorical features simultaneously. We assume that the data originate from a finite mixture of multinomial distributions and implement an integrated expectation-maximization (EM) algorithm that estimates all the parameters of the model and selects the subset of relevant features simultaneously. The results obtained on synthetic data illustrate the performance of the proposed approach. An application to real data, referred to official statistics, shows its usefulness.
Resumo:
Pesticide exposure during brain development could represent an important risk factor for the onset of neurodegenerative diseases. Previous studies investigated the effect of permethrin (PERM) administered at 34 mg/kg, a dose close to the no observable adverse effect level (NOAEL) from post natal day (PND) 6 to PND 21 in rats. Despite the PERM dose did not elicited overt signs of toxicity (i.e. normal body weight gain curve), it was able to induce striatal neurodegeneration (dopamine and Nurr1 reduction, and lipid peroxidation increase). The present study was designed to characterize the cognitive deficits in the current animal model. When during late adulthood PERM treated rats were tested for spatial working memory performances in a T-maze-rewarded alternation task they took longer to choose for the correct arm in comparison to age matched controls. No differences between groups were found in anxiety-like state, locomotor activity, feeding behavior and spatial orientation task. Our findings showing a selective effect of PERM treatment on the T-maze task point to an involvement of frontal cortico-striatal circuitry rather than to a role for the hippocampus. The predominant disturbances concern the dopamine (DA) depletion in the striatum and, the serotonin (5-HT) and noradrenaline (NE) unbalance together with a hypometabolic state in the medial prefrontal cortex area. In the hippocampus, an increase of NE and a decrease of DA were observed in PERM treated rats as compared to controls. The concentration of the most representative marker for pyrethroid exposure (3-phenoxybenzoic acid) measured in the urine of rodents 12 h after the last treatment was 41.50 µ/L and it was completely eliminated after 96 h.
Resumo:
OBJECTIVE: To identify clusters of the major occurrences of leprosy and their associated socioeconomic and demographic factors. METHODS: Cases of leprosy that occurred between 1998 and 2007 in São José do Rio Preto (southeastern Brazil) were geocodified and the incidence rates were calculated by census tract. A socioeconomic classification score was obtained using principal component analysis of socioeconomic variables. Thematic maps to visualize the spatial distribution of the incidence of leprosy with respect to socioeconomic levels and demographic density were constructed using geostatistics. RESULTS: While the incidence rate for the entire city was 10.4 cases per 100,000 inhabitants annually between 1998 and 2007, the incidence rates of individual census tracts were heterogeneous, with values that ranged from 0 to 26.9 cases per 100,000 inhabitants per year. Areas with a high leprosy incidence were associated with lower socioeconomic levels. There were identified clusters of leprosy cases, however there was no association between disease incidence and demographic density. There was a disparity between the places where the majority of ill people lived and the location of healthcare services. CONCLUSIONS: The spatial analysis techniques utilized identified the poorer neighborhoods of the city as the areas with the highest risk for the disease. These data show that health departments must prioritize politico-administrative policies to minimize the effects of social inequality and improve the standards of living, hygiene, and education of the population in order to reduce the incidence of leprosy.
Resumo:
This paper focus on a demand response model analysis in a smart grid context considering a contingency scenario. A fuzzy clustering technique is applied on the developed demand response model and an analysis is performed for the contingency scenario. Model considerations and architecture are described. The demand response developed model aims to support consumers decisions regarding their consumption needs and possible economic benefits.
Resumo:
Renewable energy sources (RES) have unique characteristics that grant them preference in energy and environmental policies. However, considering that the renewable resources are barely controllable and sometimes unpredictable, some challenges are faced when integrating high shares of renewable sources in power systems. In order to mitigate this problem, this paper presents a decision-making methodology regarding renewable investments. The model computes the optimal renewable generation mix from different available technologies (hydro, wind and photovoltaic) that integrates a given share of renewable sources, minimizing residual demand variability, therefore stabilizing the thermal power generation. The model also includes a spatial optimization of wind farms in order to identify the best distribution of wind capacity. This methodology is applied to the Portuguese power system.
Resumo:
This paper presents a spatial econometrics analysis for the number of road accidents with victims in the smallest administrative divisions of Lisbon, considering as a baseline a log-Poisson model for environmental factors. Spatial correlation on data is investigated for data alone and for the residuals of the baseline model without and with spatial-autocorrelated and spatial-lagged terms. In all the cases no spatial autocorrelation was detected.
Resumo:
OBJECTIVE To analyze spatial changes in the risk of AIDS and the relationship between AIDS incidence and socioeconomic variables in the state of Rondonia, Amazon region. METHODS A spatial, population case-control study in Rondonia, Brazil, based on 1,780 cases reported to the Epidemiological Surveillance System and controls based on demographic data from 1987 to 2006. The cases were grouped into five consecutive four-year periods. A generalized additive model was adjusted to the data; the dependent variable was the status of the individuals (case or control), and the independent variables were a bi-dimensional spline of the geographic coordinates and some municipality-level socioeconomic variables. The observed values of the Moran’s I test were compared to a reference distribution of values generated under conditions of spatial randomness. RESULTS AIDS risk shows a marked spatial and temporal pattern. The disease incidence is related to socioeconomic variables at the municipal level in Rondônia, such as urbanization and human capital. The highest incidence rates of AIDS are in municipalities along the BR-364 highway and calculations of the Moran’s I test show positive spatial correlation associated with proximity of the municipality to the highway in the third and fourth periods (p = 0.05). CONCLUSIONS Incidence of the disease is higher in municipalities of greater economic wealth and urbanization, and in those municipalities bisected by Rondônia’s main roads. The rapid development associated with the opening up of once remote regions may be accompanied by an increase in these risks to health.
Resumo:
OBJECTIVE To analyze the spatial distribution of homicide mortality in the state of Bahia, Northeastern Brazil. METHODS Ecological study of the 15 to 39-year old male population in the state of Bahia in the period 1996-2010. Data from the Mortality Information System, relating to homicide (X85-Y09) and population estimates from the Brazilian Institute of Geography and Statistics were used. The existence of spatial correlation, the presence of clusters and critical areas of the event studied were analyzed using Moran’s I Global and Local indices. RESULTS A non-random spatial pattern was observed in the distribution of rates, as was the presence of three clusters, the first in the north health district, the second in the eastern region, and the third cluster included townships in the south and the far south of Bahia. CONCLUSIONS The homicide mortality in the three different critical areas requires further studies that consider the socioeconomic, cultural and environmental characteristics in order to guide specific preventive and interventionist practices.
Resumo:
Biosignals analysis has become widespread, upstaging their typical use in clinical settings. Electrocardiography (ECG) plays a central role in patient monitoring as a diagnosis tool in today's medicine and as an emerging biometric trait. In this paper we adopt a consensus clustering approach for the unsupervised analysis of an ECG-based biometric records. This type of analysis highlights natural groups within the population under investigation, which can be correlated with ground truth information in order to gain more insights about the data. Preliminary results are promising, for meaningful clusters are extracted from the population under analysis. © 2014 EURASIP.
Resumo:
OBJECTIVE To analyze temporal trends and distribution patterns of unsafe abortion in Brazil. METHODS Ecological study based on records of hospital admissions of women due to abortion in Brazil between 1996 and 2012, obtained from the Hospital Information System of the Ministry of Health. We estimated the number of unsafe abortions stratified by place of residence, using indirect estimate techniques. The following indicators were calculated: ratio of unsafe abortions/100 live births and rate of unsafe abortion/1,000 women of childbearing age. We analyzed temporal trends through polynomial regression and spatial distribution using municipalities as the unit of analysis. RESULTS In the study period, a total of 4,007,327 hospital admissions due to abortions were recorded in Brazil. We estimated a total of 16,905,911 unsafe abortions in the country, with an annual mean of 994,465 abortions (mean unsafe abortion rate: 17.0 abortions/1,000 women of childbearing age; ratio of unsafe abortions: 33.2/100 live births). Unsafe abortion presented a declining trend at national level (R2: 94.0%, p < 0.001), with unequal patterns between regions. There was a significant reduction of unsafe abortion in the Northeast (R2: 93.0%, p < 0.001), Southeast (R2: 92.0%, p < 0.001) and Central-West regions (R2: 64.0%, p < 0.001), whereas the North (R2: 39.0%, p = 0.030) presented an increase, and the South (R2: 22.0%, p = 0.340) remained stable. Spatial analysis identified the presence of clusters of municipalities with high values for unsafe abortion, located mainly in states of the North, Northeast and Southeast Regions. CONCLUSIONS Unsafe abortion remains a public health problem in Brazil, with marked regional differences, mainly concentrated in the socioeconomically disadvantaged regions of the country. Qualification of attention to women’s health, especially to reproductive aspects and attention to pre- and post-abortion processes, are necessary and urgent strategies to be implemented in the country.
Resumo:
OBJECTIVE To analyze the temporal evolution of maternal mortality and its spatial distribution.METHODS Ecological study with a sample made up of 845 maternal deaths in women between 10 and 49 years, registered from 1999 to 2008 in the state of Rio Grande do Sul, Southern Brazil. Data were obtained from Information System on Mortality of Ministry of Health. The maternal mortality ratio and the specific maternal mortality ratio were calculated from records, and analyzed by the Poisson regression model. In the spatial distribution, three maps of the state were built with the rates in the geographical macro-regions, in 1999, 2003, and 2008.RESULTS There was an increase of 2.0% in the period of ten years (95%CI 1.00;1.04; p = 0.01), with no significant change in the magnitude of the maternal mortality ratio. The Serra macro-region presented the highest maternal mortality ratio (1.15, 95%CI 1.08;1.21; p < 0.001). Most deaths in Rio Grande do Sul were of white women over 40 years, with a lower level of education. The time of delivery/abortion and postpartum are times of increased maternal risk, with a greater negative impact of direct causes such as hypertension and bleeding.CONCLUSIONS The lack of improvement in maternal mortality ratio indicates that public policies had no impact on women’s reproductive and maternal health. It is needed to qualify the attention to women’s health, especially in the prenatal period, seeking to identify and prevent risk factors, as a strategy of reducing maternal death.
Resumo:
OBJECTIVE To analyze the methodology used for assessing the spatial distribution of specialized cardiac care units. METHODS A modeling and simulation method was adopted for the practical application of cardiac care service in the state of Santa Catarina, Southern Brazil, using the p-median model. As the state is divided into 21 health care regions, a methodology which suggests an arrangement of eight intermediate cardiac care units was analyzed, comparing the results obtained using data from 1996 and 2012. RESULTS Results obtained using data from 2012 indicated significant changes in the state, particularly in relation to the increased population density in the coastal regions. The current study provided a satisfactory response, indicated by the homogeneity of the results regarding the location of the intermediate cardiac care units and their respective regional administrations, thereby decreasing the average distance traveled by users to health care units, located in higher population density areas. The validity of the model was corroborated through the analysis of the allocation of the median vertices proposed in 1996 and 2012. CONCLUSIONS The current spatial distribution of specialized cardiac care units is more homogeneous and reflects the demographic changes that have occurred in the state over the last 17 years. The comparison between the two simulations and the current configuration showed the validity of the proposed model as an aid in decision making for system expansion.