88 resultados para Cluster Analysis of Variables
em University of Queensland eSpace - Australia
Resumo:
The purpose of this study was to investigate the relationship between self-awareness, emotional distress, motivation, and outcome in adults with severe traumatic brain injury. A sample of 55 patients were selected from 120 consecutive patients with severe traumatic brain injury admitted to the rehabilitation unit of a large metropolitan public hospital. Subjects received multidisciplinary inpatient rehabilitation and different types of outpatient rehabilitation and community-based services according to availability and need, Measures used in the cluster analysis were the Patient Competency Rating Scale, Self-Awareness of Deficits Interview, Head Injury Behavior Scale, Change Assessment Questionnaire, the Beck Depression Inventory, and Beck Anxiety Inventory; outcome measures were the Disability Rating Scale, Community Integration Questionnaire, and Sickness Impact Profile. A three-cluster solution was selected, with groups labeled as high self-awareness (n = 23), low self-awareness (n = 23), and good recovery (n = 8). The high self-awareness cluster had significantly higher levels of self-awareness, motivation, and emotional distress than the low self-awareness cluster but did not differ significantly in outcome. Self-awareness after brain injury is associated with greater motivation to change behavior and higher levels of depression and anxiety; however, it was not clear that this heightened motivation actually led to any improvement in outcome. Rehabilitation timing and approach may need to be tailored to match the individual's level of self-awareness, motivation, and emotional distress.
Resumo:
Normal mixture models are often used to cluster continuous data. However, conventional approaches for fitting these models will have problems in producing nonsingular estimates of the component-covariance matrices when the dimension of the observations is large relative to the number of observations. In this case, methods such as principal components analysis (PCA) and the mixture of factor analyzers model can be adopted to avoid these estimation problems. We examine these approaches applied to the Cabernet wine data set of Ashenfelter (1999), considering the clustering of both the wines and the judges, and comparing our results with another analysis. The mixture of factor analyzers model proves particularly effective in clustering the wines, accurately classifying many of the wines by location.
Resumo:
This paper considers a model-based approach to the clustering of tissue samples of a very large number of genes from microarray experiments. It is a nonstandard problem in parametric cluster analysis because the dimension of the feature space (the number of genes) is typically much greater than the number of tissues. Frequently in practice, there are also clinical data available on those cases on which the tissue samples have been obtained. Here we investigate how to use the clinical data in conjunction with the microarray gene expression data to cluster the tissue samples. We propose two mixture model-based approaches in which the number of components in the mixture model corresponds to the number of clusters to be imposed on the tissue samples. One approach specifies the components of the mixture model to be the conditional distributions of the microarray data given the clinical data with the mixing proportions also conditioned on the latter data. Another takes the components of the mixture model to represent the joint distributions of the clinical and microarray data. The approaches are demonstrated on some breast cancer data, as studied recently in van't Veer et al. (2002).
Resumo:
We describe a network module detection approach which combines a rapid and robust clustering algorithm with an objective measure of the coherence of the modules identified. The approach is applied to the network of genetic regulatory interactions surrounding the tumor suppressor gene p53. This algorithm identifies ten clusters in the p53 network, which are visually coherent and biologically plausible.
Resumo:
This paper describes the application of a new technique, rough clustering, to the problem of market segmentation. Rough clustering produces different solutions to k-means analysis because of the possibility of multiple cluster membership of objects. Traditional clustering methods generate extensional descriptions of groups, that show which objects are members of each cluster. Clustering techniques based on rough sets theory generate intensional descriptions, which outline the main characteristics of each cluster. In this study, a rough cluster analysis was conducted on a sample of 437 responses from a larger study of the relationship between shopping orientation (the general predisposition of consumers toward the act of shopping) and intention to purchase products via the Internet. The cluster analysis was based on five measures of shopping orientation: enjoyment, personalization, convenience, loyalty, and price. The rough clusters obtained provide interpretations of different shopping orientations present in the data without the restriction of attempting to fit each object into only one segment. Such descriptions can be an aid to marketers attempting to identify potential segments of consumers.
Resumo:
Cluster analysis via a finite mixture model approach is considered. With this approach to clustering, the data can be partitioned into a specified number of clusters g by first fitting a mixture model with g components. An outright clustering of the data is then obtained by assigning an observation to the component to which it has the highest estimated posterior probability of belonging; that is, the ith cluster consists of those observations assigned to the ith component (i = 1,..., g). The focus is on the use of mixtures of normal components for the cluster analysis of data that can be regarded as being continuous. But attention is also given to the case of mixed data, where the observations consist of both continuous and discrete variables.
Resumo:
Finite mixture models are being increasingly used to model the distributions of a wide variety of random phenomena. While normal mixture models are often used to cluster data sets of continuous multivariate data, a more robust clustering can be obtained by considering the t mixture model-based approach. Mixtures of factor analyzers enable model-based density estimation to be undertaken for high-dimensional data where the number of observations n is very large relative to their dimension p. As the approach using the multivariate normal family of distributions is sensitive to outliers, it is more robust to adopt the multivariate t family for the component error and factor distributions. The computational aspects associated with robustness and high dimensionality in these approaches to cluster analysis are discussed and illustrated.
Resumo:
Power system small signal stability analysis aims to explore different small signal stability conditions and controls, namely: (1) exploring the power system security domains and boundaries in the space of power system parameters of interest, including load flow feasibility, saddle node and Hopf bifurcation ones; (2) finding the maximum and minimum damping conditions; and (3) determining control actions to provide and increase small signal stability. These problems are presented in this paper as different modifications of a general optimization to a minimum/maximum, depending on the initial guesses of variables and numerical methods used. In the considered problems, all the extreme points are of interest. Additionally, there are difficulties with finding the derivatives of the objective functions with respect to parameters. Numerical computations of derivatives in traditional optimization procedures are time consuming. In this paper, we propose a new black-box genetic optimization technique for comprehensive small signal stability analysis, which can effectively cope with highly nonlinear objective functions with multiple minima and maxima, and derivatives that can not be expressed analytically. The optimization result can then be used to provide such important information such as system optimal control decision making, assessment of the maximum network's transmission capacity, etc. (C) 1998 Elsevier Science S.A. All rights reserved.
Resumo:
Regression analyses of a long series of light-trap catches at Narrabri, Australia, were used to describe the seasonal dynamics of Helicoverpa armigera (Hubner). The size of the second generation was significantly related to the size of the first generation, to winter rainfall, which had a positive effect, and to spring rainfall which had a negative effect. These variables accounted for up to 96% of the variation in size of the second generation from year to year. Rainfall and crop hosts were also important for the size of the third generation. The area and tonnage of many potential host crops were significantly correlated with winter rain. When winter rain was omitted from the analysis, the sizes of both the second and third generations could be expressed as a function of the size of the previous generation and of the areas planted to lucerne, sorghum and maize. Lucerne and maize always had positive coefficients and sorghum a negative one. We extended our analysis to catches of H. punctigera (Wallengren), which declines in abundance after the second generation. Winter rain had a positive effect on the sizes of the second and third generations, and rain in spring or early summer had a negative effect. Only the area grown to lucerne had a positive effect on abundance. Forecasts of pest levels from a few months to a few weeks in advance are discussed, along with the improved understanding of the seasonal dynamics of both species and the significance of crops in the management of insecticide resistance for H. armigera.
Resumo:
Transposon mutagenesis and complementation studies previously identified a gene (xabB) for a large (526 kDa) polyketide-peptide synthase required for biosynthesis of albicidin antibiotics and phytotoxins in the sugarcane leaf scald pathogen Xanthomonas albilineans. A cistron immediately downstream from xabB encodes a polypeptide of 343 aa containing three conserved motifs characteristic of a family of S-adenosyl-L-methionine (SAM)-dependent O-methyltransferases. Insertional mutagenesis and complementation indicate that the product of this cistron (designated xabC) is essential for albicidin production, and that there is no other required downstream cistron. The xab promoter region is bidirectional, and insertional mutagenesis of the first open reading frame (ORF) in the divergent gene also blocks albicidin biosynthesis. This divergent ORF (designated thp) encodes a protein of 239 aa displaying high similarity to several IS21-like transposition helper proteins. The thp cistron is not located in a recognizable transposon, and is probably a remnant from a past transposition event that may have contributed to the development of the albicidin biosynthetic gene cluster. Failure of 'in trans' complementation of rhp indicates that a downstream cistron transcribed with thp is required for albicidin biosynthesis. (C) 2000 Elsevier Science B.V. All rights reserved.
Resumo:
The importance of overweight as a risk factor for coronary heart disease (CHD) remains unsettled. We estimated the relative risk (RR) for CHD associated with underweight (body mass index, BMI < 20 kg/m2), overweight (25 – 30 kg/m2) and obesity (= 30 kg/m2), compared with normal weight (20 – 25 kg/m2) in a random effects meta-analysis of 30 prospective studies, including 389,239 healthy, predominantly Caucasian persons. We also explored sources of heterogeneity between studies and examined effects of systematic adjustment for confounding and intermediary variables. Pooled age-, sex- and smoking-adjusted RRs (95% confidence interval) for overweight, obesity and underweight compared with normal weight were 1.33 (1.24 – 1.43), 1.69 (1.44 – 1.99) and 1.01 (0.85 – 1.20), respectively. Stratified analyses showed that pooled RRs for BMI were higher for studies with longer follow-up (= vs. < 15 years) and younger populations (< vs. = 60 years). Additional adjustment for blood pressure, cholesterol levels and physical activity decreased the RR per 5 BMI units from 1.28 (1.21 – 1.34) to 1.16 (1.11 – 1.21). We conclude that overweight and obesity are associated with a substantially increased CHD risk in Caucasians, whereas underweight is not. Prevention and reduction of overweight and obesity, therefore, remain of importance for preventing CHD.
Resumo:
Catalogues the demographic changes in Bangladesh during the period 1975-2000 and examines how they relate to key socio-economic attributes. Trends are examined in population growth, growth of the working age population, women’s workforce participation, age-dependency ratio, female-male ratio, longevity, fertility, mortality and mean age at first marriage. Bangladesh has made significant breakthroughs in all these areas, a feat not matched by most other South Asian countries, but comparable with the South-East Asia region as whole. The study isolates factors contributing to the changes in each attribute. It assesses the correlation between Bangladesh’s demographic changes and selected socio-economic indicators namely, its per capita GDP, female labour force participation, per capita public health expenditure and educational achievements by both men and women. All five socio-economic variables display statistically significant correlation, in varying degrees, with measures of the demographic changes. Per capita GDP is probably the most significant determinant of demographic changes in Bangladesh. The study observes that men’s education reinforces women’s education and with increased workforce participation contributed to reduced fertility. The study suggests that the role of family planning programs in curbing population growth in Bangladesh maybe overestimated.
Diversity and commonality in national identities: an exploratory analysis of cross-national patterns
Resumo:
Issues of boundary maintenance are implicit in all studies of national identity. By definition, national communities consist of those who are included but surrounded (literally or metaphorically) by those who are excluded. Most extant research on national identity explores criteria for national membership largely in terms of official or public definitions described, for example, in citizenship and immigration laws or in texts of popular culture. We know much less about how ordinary people in various nations reason about these issues. An analysis of cross-national (N = 23) survey data from the 1995 International Social Science Program reveals a core pattern in most of the countries studied. Respondents were asked how important various criteria were in being 'truly' a member of a particular nation. Exploratory factor analysis shows that these items cluster in terms of two underlying dimensions. Ascriptive/objectivist criteria relating to birth, religion and residence can be distinguished from civic/voluntarist criteria relating to subjective feelings of membership and belief in core institutions. In most nations the ascriptive/objectivist dimension of national identity was more prominent than the subjective civic/voluntarist dimension. Taken overall, these findings suggest an unanticipated homogeneity in the ways that citizens around the world think about national identity. To the extent that these dimensions also mirror the well-known distinction between ethnic and civic national identification, they suggest that the former remains robust despite globalization, mass migration and cultural pluralism. Throughout the world official definitions of national identification have tended to shift towards a civic model. Yet citizens remain remarkably traditional in outlook. A task for future research is to investigate the macrosociological forces that produce both commonality and difference in the core patterns we have identified.
Resumo:
Existing procedures for the generation of polymorphic DNA markers are not optimal for insect studies in which the organisms are often tiny and background molecular Information is often non-existent. We have used a new high throughput DNA marker generation protocol called randomly amplified DNA fingerprints (RAF) to analyse the genetic variability In three separate strains of the stored grain pest, Rhyzopertha dominica. This protocol is quick, robust and reliable even though it requires minimal sample preparation, minute amounts of DNA and no prior molecular analysis of the organism. Arbitrarily selected oligonucleotide primers routinely produced similar to 50 scoreable polymorphic DNA markers, between individuals of three Independent field isolates of R. dominica. Multivariate cluster analysis using forty-nine arbitrarily selected polymorphisms generated from a single primer reliably separated individuals into three clades corresponding to their geographical origin. The resulting clades were quite distinct, with an average genetic difference of 37.5 +/- 6.0% between clades and of 21.0 +/- 7.1% between individuals within clades. As a prelude to future gene mapping efforts, we have also assessed the performance of RAF under conditions commonly used in gene mapping. In this analysis, fingerprints from pooled DNA samples accurately and reproducibly reflected RAF profiles obtained from Individual DNA samples that had been combined to create the bulked samples.
Resumo:
A laboratory scale sequencing batch reactor (SBR) operating for enhanced biological phosphorus removal (EBPR) and fed with a mixture of volatile fatty acids (VFAs) showed stable and efficient EBPR capacity over a four-year-period. Phosphorus (P), poly-beta-hydroxyalkanoate (PHA) and glycogen cycling consistent with classical anaerobic/aerobic EBPR were demonstrated with the order of anaerobic VFA uptake being propionate, acetate then butyrate. The SBR was operated without pH control and 63.67+/-13.86 mg P l(-1) was released anaerobically. The P% of the sludge fluctuated between 6% and 10% over the operating period (average of 8.04+/-1.31%). Four main morphological types of floc-forming bacteria were observed in the sludge during one year of in-tensive microscopic observation. Two of them were mainly responsible for anaerobic/aerobic P and PHA transformations. Fluorescence in situ hybridization (FISH) and post-FISH chemical staining for intracellular polyphosphate and PHA were used to determine that 'Candidatus Accumulibacter phosphatis' was the most abundant polyphosphate accumulating organism (PAO), forming large clusters of coccobacilli (1.0-1.5 mum) and comprising 53% of the sludge bacteria. Also by these methods, large coccobacillus-shaped gammaproteobacteria (2.5-3.5 mum) from a recently described novel cluster were glycogen-accumulating organisms (GAOs) comprising 13% of the bacteria. Tetrad-forming organisms (TFOs) consistent with the 'G bacterium' morphotype were alphaproteobacteria , but not Amaricoccus spp., and comprised 25% of all bacteria. According to chemical staining, TFOs were occasionally able to store PHA anaerobically and utilize it aerobically.