873 resultados para agglomerative clustering
Resumo:
An American cutaneous leishmaniasis outbreak, with cases clustering during 1993 in Tartagal city, Salta, was reported. The outbreak involved 102 individuals, 43.1% of them with multiple ulcers. Age (mean: 33 years old) and sex distribution of cases (74.5% males), as well as working activity (70 forest-related), support the hypothesis of classical forest transmission leishmaniasis, despite the fact that the place of permanent residence was in periurban Tartagal. Moreover, during July, sandflies were only collected from one of the 'deforestation areas'. Lutzomyia intermedia was the single species of the 491 phlebotomines captured, reinforcing the vector incrimination of this species. Most infections must have been acquired during the fall (April to June), a pattern consistent with previous sandfly population dynamics data. Based on the epidemiological and entomological results, it was advised not to do any vector-targeted periurban control measures during July. Further studies should be done to assess if the high rate of multiple lesions was due to parasite factors or to infective vector density factors.
Resumo:
The aim of this study was to investigate the presence of the Hepatitis G Virus on a population of blood donors from São Paulo, Brazil and to evaluate its association to sociodemographic variables. Two RT-PCR systems targeting the putative 5'NCR and NS3 regions were employed and the former has shown a higher sensitivity. The observed prevalence of HGV-RNA on 545 blood donors was 9.7% (CI 95% 7.4;12.5). Statistical analysis depicted an association with race/ethnicity, black and mulatto donors being more frequently infected; and also with years of education, less educated donors presenting higher prevalences. No association was observed with other sociodemographic parameters as age, gender, place of birth and of residence. DNA sequencing of nine randomly chosen isolates demonstrated the presence of genotypes 1, 2 and 3 among our population but clustering of these Brazilian isolates was not detected upon phylogenetic analysis.
Resumo:
This paper analyses forest fires in the perspective of dynamical systems. Forest fires exhibit complex correlations in size, space and time, revealing features often present in complex systems, such as the absence of a characteristic length-scale, or the emergence of long range correlations and persistent memory. This study addresses a public domain forest fires catalogue, containing information of events for Portugal, during the period from 1980 up to 2012. The data is analysed in an annual basis, modelling the occurrences as sequences of Dirac impulses with amplitude proportional to the burnt area. First, we consider mutual information to correlate annual patterns. We use visualization trees, generated by hierarchical clustering algorithms, in order to compare and to extract relationships among the data. Second, we adopt the Multidimensional Scaling (MDS) visualization tool. MDS generates maps where each object corresponds to a point. Objects that are perceived to be similar to each other are placed on the map forming clusters. The results are analysed in order to extract relationships among the data and to identify forest fire patterns.
Resumo:
In this paper we analyze the behavior of tornado time-series in the U.S. from the perspective of dynamical systems. A tornado is a violently rotating column of air extending from a cumulonimbus cloud down to the ground. Such phenomena reveal features that are well described by power law functions and unveil characteristics found in systems with long range memory effects. Tornado time series are viewed as the output of a complex system and are interpreted as a manifestation of its dynamics. Tornadoes are modeled as sequences of Dirac impulses with amplitude proportional to the events size. First, a collection of time series involving 64 years is analyzed in the frequency domain by means of the Fourier transform. The amplitude spectra are approximated by power law functions and their parameters are read as an underlying signature of the system dynamics. Second, it is adopted the concept of circular time and the collective behavior of tornadoes analyzed. Clustering techniques are then adopted to identify and visualize the emerging patterns.
Resumo:
In this paper, we apply multidimensional scaling (MDS) and parametric similarity indices (PSI) in the analysis of complex systems (CS). Each CS is viewed as a dynamical system, exhibiting an output time-series to be interpreted as a manifestation of its behavior. We start by adopting a sliding window to sample the original data into several consecutive time periods. Second, we define a given PSI for tracking pieces of data. We then compare the windows for different values of the parameter, and we generate the corresponding MDS maps of ‘points’. Third, we use Procrustes analysis to linearly transform the MDS charts for maximum superposition and to build a global MDS map of “shapes”. This final plot captures the time evolution of the phenomena and is sensitive to the PSI adopted. The generalized correlation, the Minkowski distance and four entropy-based indices are tested. The proposed approach is applied to the Dow Jones Industrial Average stock market index and the Europe Brent Spot Price FOB time-series.
Resumo:
This paper studies forest fires from the perspective of dynamical systems. Burnt area, precipitation and atmospheric temperatures are interpreted as state variables of a complex system and the correlations between them are investigated by means of different mathematical tools. First, we use mutual information to reveal potential relationships in the data. Second, we adopt the state space portrait to characterize the system’s behavior. Third, we compare the annual state space curves and we apply clustering and visualization tools to unveil long-range patterns. We use forest fire data for Portugal, covering the years 1980–2003. The territory is divided into two regions (North and South), characterized by different climates and vegetation. The adopted methodology represents a new viewpoint in the context of forest fires, shedding light on a complex phenomenon that needs to be better understood in order to mitigate its devastating consequences, at both economical and environmental levels.
Resumo:
This paper studies the statistical distributions of worldwide earthquakes from year 1963 up to year 2012. A Cartesian grid, dividing Earth into geographic regions, is considered. Entropy and the Jensen–Shannon divergence are used to analyze and compare real-world data. Hierarchical clustering and multi-dimensional scaling techniques are adopted for data visualization. Entropy-based indices have the advantage of leading to a single parameter expressing the relationships between the seismic data. Classical and generalized (fractional) entropy and Jensen–Shannon divergence are tested. The generalized measures lead to a clear identification of patterns embedded in the data and contribute to better understand earthquake distributions.
Resumo:
During June 1997-June 1999 rotavirus infection was screened in infants aged up to 2 years and hospitalised with acute diarrhoea in São Luís, Northeastern Brazil. Altogether, 128 stool samples were collected from diarrhoeic patients and additional 122 faecal specimens from age- and- temporal matched inpatients without diarrhoea were obtained; rotavirus positivity rates for these groups were 32.0% (41/128) and 9.8% (12/122), respectively (p < 0.001). Both electropherotyping and serotyping could be performed in 42 (79.2%) of the 53 rotavirus-positive stool samples. Long and short electropherotypes were detected at similar rates - 38.1% and 40.5% of specimens, respectively. Overall, a G serotype could be assigned for 35 (83.3%) of specimens, the majority of them (66.7%) bearing G1-serotype specificity. Taking both electropherotypes and serotypes together, G1 rotavirus strains displaying long and short RNA patterns accounted for 30.9% and 19.0% of tested specimens, respectively; all G2 strains had short electropherotype. Rotavirus gastroenteritis was detected year-round and, in 1998, the incidence rates tended to be higher during the second semester than in the first semester: 45.2% and 26.1% (p = 0.13), respectively. Rotavirus infections peaked at the second semester of life with frequencies of 30.1% and 13.5% for diarrhoeic children and controls, respectively. While the six rotavirus strains bearing G2-type specificity were circulating throughout the whole study period, G1 serotypes (n = 27) emerged as from June 1998 onwards, 20 (74.1%) of which clustering in 1998. These data underscore the importance of rotaviruses in the aetiology of severe infantile gastroenteritis in Northeastern Brazil and sustain the concept that a future vaccine should confer protection against more than one serotype.
Resumo:
Every year forest fires consume large areas, being a major concern in many countries like Australia, United States and Mediterranean Basin European Countries (e.g., Portugal, Spain, Italy and Greece). Understanding patterns of such events, in terms of size and spatiotemporal distributions, may help to take measures beforehand in view of possible hazards and decide strategies of fire prevention, detection and suppression. Traditional statistical tools have been used to study forest fires. Nevertheless, those tools might not be able to capture the main features of fires complex dynamics and to model fire behaviour [1]. Forest fires size-frequency distributions unveil long range correlations and long memory characteristics, which are typical of fractional order systems [2]. Those complex correlations are characterized by self-similarity and absence of characteristic length-scale, meaning that forest fires exhibit power-law (PL) behaviour. Forest fires have also been proved to exhibit time-clustering phenomena, with timescales of the order of few days [3]. In this paper, we study forest fires in the perspective of dynamical systems and fractional calculus (FC). Public domain forest fires catalogues, containing data of events occurred in Portugal, in the period 1980 up to 2011, are considered. The data is analysed in an annual basis, modelling the occurrences as sequences of Dirac impulses. The frequency spectra of such signals are determined using Fourier transforms, and approximated through PL trendlines. The PL parameters are then used to unveil the fractional-order dynamics characteristics of the data. To complement the analysis, correlation indices are used to compare and find possible relationships among the data. It is shown that the used approach can be useful to expose hidden patterns not captured by traditional tools.
Resumo:
A competitividade no fabrico de componentes para a indústria automóvel é um factor-chave para o sucesso de qualquer empresa que queira permanecer neste sector de actividade. Atendendo a que o custo de mão-de-obra tem tendência a subir, e que a qualidade é muito mais difícil de assegurar quando os processos assentam essencialmente em produção manual, a automatização ganha cada vez maior relevo, permitindo uma maior produtividade e repetibilidade, assegurando simultaneamente níveis de qualidade superiores, o que contribui também para um incremento da produtividade ainda mais acentuado. Em Portugal, muitas empresas que trabalham para o sector automóvel já apostam fortemente na automatização de processos, e até na robotização. Esta é a única via para melhorar a competitividade e conseguir concorrer com países onde a mão-de-obra é bastante mais económica, ou com outros onde a automação está fortemente instalada. Este trabalho centrou-se na optimização de um equipamento destinado ao fabrico semiautomático de estruturas de assentamento dos estofos para automóveis. O equipamento original estava já fortemente automatizado, mas necessitava ainda de algumas operações manuais, as quais se resumiam a pouco mais do que transferência e agrupamento de subconjuntos. O trabalho teve que ter em conta todas as limitações impostas pelos sistemas já existentes, e ser realizável com o custo mais económico possível. Depois de vários estudos e propostas, o projecto foi implementado.
Resumo:
Plasmodium falciparum resistant strain development has encouraged the search for new antimalarial drugs. Febrifugine is a natural substance with high activity against P. falciparum presenting strong emetic property and liver toxicity, which prevent it from being used as a clinical drug. The search for analogues that could have a better clinical performance is a current topic. We aim to investigate the theoretical electronic structure by means of febrifugine derivative family semi-empirical molecular orbital calculations, seeking the electronic indexes that could help the design of new efficient derivatives. The theoretical results show there is a clustering in well-defined ranges of several electronic indexes of the most selective molecules. The model proposed for achieving high selectivity was tested with success.
Resumo:
Further improvements in demand response programs implementation are needed in order to take full advantage of this resource, namely for the participation in energy and reserve market products, requiring adequate aggregation and remuneration of small size resources. The present paper focuses on SPIDER, a demand response simulation that has been improved in order to simulate demand response, including realistic power system simulation. For illustration of the simulator’s capabilities, the present paper is proposes a methodology focusing on the aggregation of consumers and generators, providing adequate tolls for the demand response program’s adoption by evolved players. The methodology proposed in the present paper focuses on a Virtual Power Player that manages and aggregates the available demand response and distributed generation resources in order to satisfy the required electrical energy demand and reserve. The aggregation of resources is addressed by the use of clustering algorithms, and operation costs for the VPP are minimized. The presented case study is based on a set of 32 consumers and 66 distributed generation units, running on 180 distinct operation scenarios.
Resumo:
In this paper we study several natural and man-made complex phenomena in the perspective of dynamical systems. For each class of phenomena, the system outputs are time-series records obtained in identical conditions. The time-series are viewed as manifestations of the system behavior and are processed for analyzing the system dynamics. First, we use the Fourier transform to process the data and we approximate the amplitude spectra by means of power law functions. We interpret the power law parameters as a phenomenological signature of the system dynamics. Second, we adopt the techniques of non-hierarchical clustering and multidimensional scaling to visualize hidden relationships between the complex phenomena. Third, we propose a vector field based analogy to interpret the patterns unveiled by the PL parameters.
Resumo:
The last 40 years of the world economy are analyzed by means of computer visualization methods. Multidimensional scaling and the hierarchical clustering tree techniques are used. The current Western downturn in favor of Asian partners may still be reversed in the coming decades.
Resumo:
Atmospheric temperatures characterize Earth as a slow dynamics spatiotemporal system, revealing long-memory and complex behavior. Temperature time series of 54 worldwide geographic locations are considered as representative of the Earth weather dynamics. These data are then interpreted as the time evolution of a set of state space variables describing a complex system. The data are analyzed by means of multidimensional scaling (MDS), and the fractional state space portrait (fSSP). A centennial perspective covering the period from 1910 to 2012 allows MDS to identify similarities among different Earth’s locations. The multivariate mutual information is proposed to determine the “optimal” order of the time derivative for the fSSP representation. The fSSP emerges as a valuable alternative for visualizing system dynamics.