27 resultados para HIERARCHICAL CLUSTER ANALYSIS
Resumo:
Seven years of multi-environment yield trials of navy bean (Phaseolus vulgaris L.) grown in Queensland were examined. As is common with plant breeding evaluation trials, test entries and locations varied between years. Grain yield data were analysed for each year using cluster and ordination analyses (pattern analyses). These methods facilitate descriptions of genotype performance across environments and the discrimination among genotypes provided by the environments. The observed trends for genotypic yield performance across environments were partly consistent with agronomic and disease reactions at specific environments and also partly explainable by breeding and selection history. In some cases, similarities in discrimination among environments were related to geographic proximity, in others management practices, and in others similarities occurred between geographically widely separated environments which differed in management practices. One location was identified as having atypical line discrimination. The analysis indicated that the number of test locations was below requirements for adequate representation of line x environment interaction. The pattern analyses methods used were an effective aid in describing the patterns in data for each year and illustrated the variations in adaptive patterns from year to year. The study has implications for assessing the number and location of test sites for plant breeding multi-environment trials, and for the understanding of genetic traits contributing to line x environment interactions.
Resumo:
The concentrations of major, minor and trace metals were measured in water samples collected from five shallow Antarctic lakes (Carezza, Edmonson Point (No 14 and 15a), Inexpressible Island and Tarn Flat) found in Terra Nova Bay (northern Victoria Land, Antarctica) during the Italian Expeditions of 1993-2001. The total concentrations of a large suite of elements (Al, As, Ba, Ca, Cd, Ce, Co, Cr, Cs, Cu, Fe, Ga, Gd, K, La, Li, Mg, Mn, Mo, Na, Nd, Ni, Pb, Pr, Rb, Sc, Si, Sr, Ta, Ti, U, V, Y, W, Zn and Zr) were determined using spectroscopic techniques (ICP-AES, GF-AAS and ICP-MS). The results are similar to those obtained for the freshwater lakes of the Larsemann Hills, East Antarctica, and for the McMurdo Dry Valleys. Principal Component Analysis (PCA) and Cluster Analysis (CA) were performed to identify groups of samples with similar characteristics and to find correlations between the variables. The variability observed within the water samples is closely connected to the sea spray input; hence, it is primarily a consequence of geographical and meteorological factors, such as distance from the ocean and time of year. The trace element levels, in particular those of heavy metals, are very low, suggesting an origin from natural sources rather than from anthropogenic contamination.
Resumo:
A marker database was compiled for isolates of the potato and tomato late blight pathogen, Phytophthora infestans, originating from 41 locations which include 31 countries plus 10 regions within Mexico. Presently, the database contains information on 1,776 isolates for one or more of the following markers: restriction fragment length polymorphism (RFLP) fingerprint consisting of 23 bands; mating type; dilocus allozyme genotype; mitochondrial DNA haplotype; sensitivity to the fungicide metalaxyl; and virulence. In the database, 305 entries have unique RFLP fingerprints and 258 entries have unique multilocus genotypes based on RFLP fingerprint, dilocus allozyme genotype, and mating type. A nomenclature is described for naming multilocus genotypes based on the International Organization for Standardization (ISO) two-letter country code and a unique number, Forty-two previously published multilocus genotypes are represented in the database with references to publications. As a result of compilation of the database, seven new genotypes were identified and named. Cluster analysis of genotypes from clonally propagated populations worldwide generally confirmed a previously published classification of old and new genotypes. Genotypes from geographically distant countries were frequently clustered, and several old and new genotypes were found in two or more distant countries. The cluster analysis also demonstrated that A2 genotypes from Argentina differed from all others. The database is available via the Internet, and thus can serve as a resource for Phytophthora workers worldwide.
Resumo:
Normal mixture models are being increasingly used to model the distributions of a wide variety of random phenomena and to cluster sets of continuous multivariate data. However, for a set of data containing a group or groups of observations with longer than normal tails or atypical observations, the use of normal components may unduly affect the fit of the mixture model. In this paper, we consider a more robust approach by modelling the data by a mixture of t distributions. The use of the ECM algorithm to fit this t mixture model is described and examples of its use are given in the context of clustering multivariate data in the presence of atypical observations in the form of background noise.
Resumo:
1. Cluster analysis of reference sites with similar biota is the initial step in creating River Invertebrate Prediction and Classification System (RIVPACS) and similar river bioassessment models such as Australian River Assessment System (AUSRIVAS). This paper describes and tests an alternative prediction method, Assessment by Nearest Neighbour Analysis (ANNA), based on the same philosophy as RIVPACS and AUSRIVAS but without the grouping step that some people view as artificial. 2. The steps in creating ANNA models are: (i) weighting the predictor variables using a multivariate approach analogous to principal axis correlations, (ii) calculating the weighted Euclidian distance from a test site to the reference sites based on the environmental predictors, (iii) predicting the faunal composition based on the nearest reference sites and (iv) calculating an observed/expected (O/E) analogous to RIVPACS/AUSRIVAS. 3. The paper compares AUSRIVAS and ANNA models on 17 datasets representing a variety of habitats and seasons. First, it examines each model's regressions for Observed versus Expected number of taxa, including the r(2), intercept and slope. Second, the two models' assessments of 79 test sites in New Zealand are compared. Third, the models are compared on test and presumed reference sites along a known trace metal gradient. Fourth, ANNA models are evaluated for western Australia, a geographically distinct region of Australia. The comparisons demonstrate that ANNA and AUSRIVAS are generally equivalent in performance, although ANNA turns out to be potentially more robust for the O versus E regressions and is potentially more accurate on the trace metal gradient sites. 4. The ANNA method is recommended for use in bioassessment of rivers, at least for corroborating the results of the well established AUSRIVAS- and RIVPACS-type models, if not to replace them.
Resumo:
The relative abundance and topographical distribution of retinal cone photoreceptors was measured in 19 bird species to identify possible correlations between photoreceptor complement and visual ecology. In contrast to previous studies, all five types of cone photoreceptor were distinguished, using bright field and epifluorescent light microscopy, in four retinal quadrants. Land birds tended to show either posterior dorsal to anterior ventral or anterior dorsal to posterior ventral gradients in cone photoreceptor distribution, fundus coloration and oil droplet pigmentation across the retina. Marine birds tended to show dorsal to ventral gradients instead. Statistical analyses showed that the proportions of the different cone types varied significantly across the retinae of all species investigated. Cluster analysis was performed on the data to identify groups or clusters of species on the basis of their oil droplet complement. Using the absolute percentages of each oil droplet type in each quadrant for the analysis produced clusters that tended to reflect phylogenetic relatedness between species rather than similarities in their visual ecology. Repeating the analysis after subtracting the mean percentage of a given oil droplet type across the whole retina (the 'eye mean') from the percentage of that oil droplet type in each quadrant, i.e. to give a measure of the variation about the mean, resulted in clusters that reflected diet, feeding behaviour and habitat to a greater extent than phylogeny.
Resumo:
Outcome after traumatic brain injury (TBI) is characterized by a high degree of variability which has often been difficult to capture in traditional outcome studies. The purpose of this study was to describe patterns of community integration 2-5 years after TBI. Participants were 208 patients admitted to a Brain Injury Rehabilitation Unit between 1991-1995 in Brisbane, Australia. The design comprised retrospective data collection and questionnaire follow-up by mail. Mean follow-up was 3.5 years. Demographic, injury severity and functional status variables were retrieved from hospital records. Community integration was assessed using the Community Integration Questionnaire (CIQ), and vocational status measured by a self administered questionnaire. Data was analysed using cluster analysis which divided the data into meaningful subsets. Based on the CIQ subscale scores of home, social and productive integration, a three cluster solution was selected, with groups labelled as working (n = 78), balanced (n = 46) and poorly integrated (n = 84). Although 38% of the sample returned to a high level of productive activity and 22% achieved a balanced lifestyle, overall community integration was poor for the remainder. This poorly integrated group had more severe injury characterized by longer periods of acute care and post-traumatic amnesia (PTA) and greater functional disability on discharge. These findings have implications for service delivery prior to and during the process of reintegration after brain injury.
Resumo:
Motivation: This paper introduces the software EMMIX-GENE that has been developed for the specific purpose of a model-based approach to the clustering of microarray expression data, in particular, of tissue samples on a very large number of genes. The latter is a nonstandard problem in parametric cluster analysis because the dimension of the feature space (the number of genes) is typically much greater than the number of tissues. A feasible approach is provided by first selecting a subset of the genes relevant for the clustering of the tissue samples by fitting mixtures of t distributions to rank the genes in order of increasing size of the likelihood ratio statistic for the test of one versus two components in the mixture model. The imposition of a threshold on the likelihood ratio statistic used in conjunction with a threshold on the size of a cluster allows the selection of a relevant set of genes. However, even this reduced set of genes will usually be too large for a normal mixture model to be fitted directly to the tissues, and so the use of mixtures of factor analyzers is exploited to reduce effectively the dimension of the feature space of genes. Results: The usefulness of the EMMIX-GENE approach for the clustering of tissue samples is demonstrated on two well-known data sets on colon and leukaemia tissues. For both data sets, relevant subsets of the genes are able to be selected that reveal interesting clusterings of the tissues that are either consistent with the external classification of the tissues or with background and biological knowledge of these sets.
Resumo:
The vascular and bryophyte floras of subantarctic Heard Island were classified using cluster analysis into six vegetation communities: Open Cushion Carpet, Mossy Feldmark, Wet Mixed Herbfield, Coastal Biotic Vegetation, Saltspray Vegetation, and Closed Cushion Carpet. Multidimensional scaling indicated that the vegetation communities were not well delineated but were continua. Discriminant analysis and a classification tree identified altitude, wind, peat depth, bryophyte cover and extent of bare ground, and particle size as discriminating variables. The combination of small area, glaciation, and harsh climate has resulted in reduced vegetation variety in comparison to those subantarctic islands north of the Antarctic Polar Front Zone. Some of the functional groups and vegetation communities found on warmer subantarctic islands are not present on Heard Island, notably ferns and sedges and fernbrakes and extensive mires, respectively.
Resumo:
The authors discern the community structure of the postindustrial city, with reference to Australia. They focus empirically on three major types of Australian urban center: urban regions. metropolitan areas that are not part of urban regions, and other major cities. These three account for almost three-quarters of the Australian population. The authors draw on a conceptualization formulated by Marcuse and van Kempen to guide the analysis, with a combination of cluster analysis and discriminant analysis being applied to aggregate (essentially census) data to identify the communities. Nine major Australian urban communities are identified-four are affluent. four are disadvantaged. and one is a working-class community. The communities found, however, differed greatly from those cited in the Marcuse and van Kempen schema.
Resumo:
This study identifies and explores a new country of origin (COO) cue, “owned by….” The importance of three extrinsic cues “owned by …,” “made in …” and price was examined using conjoint analysis. Data were collected from a sample of 268 undergraduate students familiar with color televisions. Segments were formed using cluster analysis and analyzed using multiple discriminant analysis. “Owned by …” was found to be important and distinct from the “made in …” cue. Segments based on the two COO cues were identified using importance weights and individual utilities. When segments were formed using individual utilities the individual difference construct, economic nationalism, provided discriminatory power while consumer ethnocentrism did not, supporting the hypothesis that economic nationalism and consumer ethnocentrism differ. Practitioners can now use “owned by …” knowing that it forms an important and distinct marketing tool. Limitations and future research are discussed.