925 resultados para Cluster analysis.
Resumo:
A space-time analysis of American visceral leishmaniasis (AVL) in humans in the city of Bauru, Sao Paulo State, Brazil was carried out based on 239 cases diagnosed between June 2003 and October 2008. Spatial analysis of the disease showed that cases occurred especially in the city's urban areas. AVL annual incidence rates were calculated, demonstrating that the highest rate occurred in 2006 (19.55/100,000 inhabitants). This finding was confirmed by the time series analysis, which also showed a positive tendency over the period analyzed. The present study allows us to conclude that the disease was clustered in the Southwest side of the city in 2006, suggesting that this area may require special attention with regard to control and prevention measures.
Resumo:
A space-time analysis of American visceral leishmaniasis (AVL) in humans in the city of Bauru, São Paulo State, Brazil was carried out based on 239 cases diagnosed between June 2003 and October 2008. Spatial analysis of the disease showed that cases occurred especially in the city's urban areas. AVL annual incidence rates were calculated, demonstrating that the highest rate occurred in 2006 (19.55/100,000 inhabitants). This finding was confirmed by the time series analysis, which also showed a positive tendency over the period analyzed. The present study allows us to conclude that the disease was clustered in the Southwest side of the city in 2006, suggesting that this area may require special attention with regard to control and prevention measures.
Resumo:
Abstract. Rock magnetic, biochemical and inorganic records of the sediment cores PG1351 and Lz1024 from Lake El’gygytgyn, Chukotka peninsula, Far East Russian Arctic, were subject to a hierarchical agglomerative cluster analysis in order to refine and extend the pattern of climate modes as defined by Melles et al. (2007). Cluster analysis of the data obtained from both cores yielded similar results, differentiating clearly between the four climate modes warm, peak warm, cold and dry, and cold and moist. In addition, two transitional phases were identified, representing the early stages of a cold phase and slightly colder conditions during a warm phase. The statistical approach can thus be used to resolve gradual changes in the sedimentary units as an indicator of available oxygen in the hypolimnion in greater detail. Based upon cluster analyses on core Lz1024, the published succession of climate modes in core PG1351, covering the last 250 ka, was modified and extended back to 350 ka. Comparison to the marine oxygen isotope (�18O) stack LR04 (Lisiecki and Raymo, 2005) and the summer insolation at 67.5� N, with the extended Lake El’gygytgyn parameter records of magnetic susceptibility (�LF), total organic carbon content (TOC) and the chemical index of alteration (CIA; Minyuk et al., 2007), revealed that all stages back to marine isotope stage (MIS) 10 and most of the substages are clearly reflected in the pattern derived from the cluster analysis.
Resumo:
Background: Despite almost 40 years of research into the etiology of Kawasaki Syndrome (KS), there is little research published on spatial and temporal clustering of KS cases. Previous analysis has found significant spatial and temporal clustering of cases, therefore cluster analyses were performed to substantiate these findings and provide insight into incident KS cases discharged from a pediatric tertiary care hospital. Identifying clusters from a single institution would allow for prospective analysis of risk factors and potential exposures for further insight into KS etiology. ^ Methods: A retrospective study was carried out to examine the epidemiology and distribution of patients presenting to Texas Children’s Hospital in Houston, Texas, with a diagnosis of Acute Febrile Mucocutaneous Lymph Node Syndrome (MCLS) upon discharge from January 1, 2005 to December 31, 2009. Spatial, temporal, and space-time cluster analyses were performed using the Bernoulli model with case and control event data. ^ Results: 397 of 102,761 total patients admitted to Texas Children’s Hospital had a principal or secondary diagnosis of Acute Febrile MCLS upon over the 5 year period. Demographic data for KS cases remained consistent with known disease epidemiology. Spatial, temporal, and space-time analyses of clustering using the Bernoulli model demonstrated no statistically significant clusters. ^ Discussion: Despite previous findings of spatial-temporal clustering of KS cases, there were no significant clusters of KS cases discharged from a single institution. This implicates the need for an expanded approach to conducting spatial-temporal cluster analysis and KS surveillance given the limitations of evaluating data from a single institution.^
Resumo:
Multivariate analyses of latest Pliocene through Holocene benthic foraminifera from 61 samples from Deep-Sea Drilling Project (DSDP) Site 214, eastem Indian Ocean were carried out. The 46 highest ranked species were used in R-mode factor analysis which has enabled to the identification of three environmentally significant assemblages at Site 214. Assemblage 1 is characterized by Uvigerina hispido-costata, Osangularia culter , Gavelinopsis lobatulus, Cibicides wuellerstorfi and Karreriella baccata as principal species. This assemblage is inferred to reflect high-energy, well-oxygenated and probably low-organic carbon deep-sea environment at Site 214. Assemblage 2 is defined principally by Globocassidulina pacifica and U. proboscidea and is considered to indicate an organic carbon-rich environment which resulted from high surface productivity irrespective of dissolved oxygen content. Assemblage 3 is marked by Oridorsalis umbonatus, Textularia lythostrota, Hoeglundina elegans, Pyrgo murrhina, and Pullenia quinqueloba as principal species. This assemblage is inferred to indicate a low-organic carbon environment with high pore water oxygen concentration leading to better preservation of deep-sea sediments.
Resumo:
A system of cluster analysis for genome-wide expression data from DNA microarray hybridization is described that uses standard statistical algorithms to arrange genes according to similarity in pattern of gene expression. The output is displayed graphically, conveying the clustering and the underlying expression data simultaneously in a form intuitive for biologists. We have found in the budding yeast Saccharomyces cerevisiae that clustering gene expression data groups together efficiently genes of known similar function, and we find a similar tendency in human data. Thus patterns seen in genome-wide expression experiments can be interpreted as indications of the status of cellular processes. Also, coexpression of genes of known function with poorly characterized or novel genes may provide a simple means of gaining leads to the functions of many genes for which information is not available currently.
Resumo:
Varicella-zoster virus open reading frame 10 (ORF10) protein, the homolog of the herpes simplex virus protein VP16, can transactivate immediate-early promoters from both viruses. A protein sequence comparison procedure termed hydrophobic cluster analysis was used to identify a motif centered at Phe-28, near the amino terminus of ORF10, that strongly resembles the sequence of the activating domain surrounding Phe-442 of VP16. With a series of GAL4-ORF10 fusion proteins, we mapped the ORF10 transcriptional-activation domain to the amino-terminal region (aa 5-79). Extensive mutagenesis of Phe-28 in GAL4-ORF10 fusion proteins demonstrated the importance of an aromatic or bulky hydrophobic amino acid at this position, as shown previously for Phe-442 of VP16. Transactivation by the native ORF10 protein was abolished when Phe-28 was replaced by Ala. Similar amino-terminal domains were identified in the VP16 homologs of other alphaherpesviruses. Hydrophobic cluster analysis correctly predicted activation domains of ORF10 and VP16 that share critical characteristics of a distinctive subclass of acidic activation domains.
Resumo:
Optimal currency area theory suggests that business cycle comovement is a sufficient condition for monetary union, particularly if there are low levels of labour mobility between potential members of the monetary union. Previous studies of co-movement of business cycle variables (mainly authored by Artis and Zhang in the late 1990s) found that there was a core of member states in the EU that could be grouped together as having similar business cycle comovements, but these studies always used Germany as the country against which to compare. In this study, the analysis of Artis and Zhang is extended and updated but correlating against both German and euro area macroeconomic aggregates and using more recent techniques in cluster analysis, namely model-based clustering techniques.
Resumo:
Cluster analysis via a finite mixture model approach is considered. With this approach to clustering, the data can be partitioned into a specified number of clusters g by first fitting a mixture model with g components. An outright clustering of the data is then obtained by assigning an observation to the component to which it has the highest estimated posterior probability of belonging; that is, the ith cluster consists of those observations assigned to the ith component (i = 1,..., g). The focus is on the use of mixtures of normal components for the cluster analysis of data that can be regarded as being continuous. But attention is also given to the case of mixed data, where the observations consist of both continuous and discrete variables.
Resumo:
Normal mixture models are often used to cluster continuous data. However, conventional approaches for fitting these models will have problems in producing nonsingular estimates of the component-covariance matrices when the dimension of the observations is large relative to the number of observations. In this case, methods such as principal components analysis (PCA) and the mixture of factor analyzers model can be adopted to avoid these estimation problems. We examine these approaches applied to the Cabernet wine data set of Ashenfelter (1999), considering the clustering of both the wines and the judges, and comparing our results with another analysis. The mixture of factor analyzers model proves particularly effective in clustering the wines, accurately classifying many of the wines by location.
Resumo:
This paper considers a model-based approach to the clustering of tissue samples of a very large number of genes from microarray experiments. It is a nonstandard problem in parametric cluster analysis because the dimension of the feature space (the number of genes) is typically much greater than the number of tissues. Frequently in practice, there are also clinical data available on those cases on which the tissue samples have been obtained. Here we investigate how to use the clinical data in conjunction with the microarray gene expression data to cluster the tissue samples. We propose two mixture model-based approaches in which the number of components in the mixture model corresponds to the number of clusters to be imposed on the tissue samples. One approach specifies the components of the mixture model to be the conditional distributions of the microarray data given the clinical data with the mixing proportions also conditioned on the latter data. Another takes the components of the mixture model to represent the joint distributions of the clinical and microarray data. The approaches are demonstrated on some breast cancer data, as studied recently in van't Veer et al. (2002).