930 resultados para HIERARCHICAL CLUSTER ANALYSIS
Resumo:
This study tested the hypothesis that social engagement (SE) with peers is a fundamental aspect of social competence during early childhood. Relations between SE and a set of previously validated social competence indicators, as well as additional variables derived from observation and sociometric interviews were assessed using both variable-centered and person-centered approaches (N = 1453, 696 girls) in 4 samples (3 U.S.A., 1 Portuguese). Directly observed SE was positively associated with broad-band measures of socially competent behavior, peer acceptance, being a target of peers' attention, and also with broad-band personality dimensions. Using individual Q-items significantly associated with SE in 3 of our 4 samples, a hierarchical cluster analysis yielded a 5-cluster solution that grouped cases efficiently. Tests on relations between cluster membership and the set of social competence and other variables revealed significant main effects of cluster membership in the full sample and within each individual sample, separately. With the exception of tests for peer negative preference, children in the lowest SE cluster also had significantly lower overall social competence, personality functioning scores than did children in higher SE clusters.
Resumo:
Materia Suplementar disponível em: http://dx.doi.org/10.1037/dev0000142.supp
Resumo:
Purpose: To develop a high-performance liquid chromatography (HPLC) fingerprint method for the quality control and origin discrimination of Gastrodiae rhizoma . Methods: Twelve batches of G. rhizoma collected from Sichuan, Guizhou and Shanxi provinces in china were used to establish the fingerprint. The chromatographic peak (gastrodin) was taken as the reference peak, and all sample separation was performed on a Agilent C18 (250 mm×4.6 mmx5 μm) column with a column temperature of 25 °C. The mobile phase was acetonitrile/0.8 % phosphate water solution (in a gradient elution mode) and the flow rate of 1 mL/min. The detection wavelength was 270 nm. The method was validated as per the guidelines of Chinese Pharmacopoeia. Results: The chromatograms of the samples showed 11 common peaks, of which no. 4 was identified as that of Gastrodin. Data for the samples were analyzed statistically using similarity analysis and hierarchical cluster analysis (HCA). The similarity index between reference chromatogram and samples’ chromatograms were all > 0.80. The similarity index of G. rhizoma from Guizhou, Shanxi and Sichuan is evident as follows: 0.854 - 0.885, 0.915 - 0.930 and 0.820 - 0.848, respectively. The samples could be divided into three clusters at a rescaled distance of 7.5: S1 - S4 as cluster 1; S5 - S8 cluster 2, and others grouped into cluster 3. Conclusion: The findings indicate that HPLC fingerprinting technology is appropriate for quality control and origin discrimination of G. rhizoma.
Resumo:
Invasive insects that successfully establish in introduced areas can significantly alter natural communities. These pests require specific establishment criteria (e.g. host suitability) that, when known, can help quantify potential damage to infested areas. Emerald ash borer (Agrilus planipennis [Coleoptera: Buprestidae]) is an invasive phloem-feeding pest which is responsible for the death of millions of ash trees (Fraxinus spp. L.). Over 200 surviving ash trees were previously identified in the Huron-Clinton Metroparks located in southeast Michigan. Trees were assessed over a four year period and a hierarchical cluster analysis was performed on dieback, vigor, and presence of signs and symptoms, in order to place trees into one of three tolerance groups. The clustering of trees with different responses to emerald ash borer attack suggests that there are different tolerance levels in North American ash trees in southeastern Michigan, and these groups were designated as apparently tolerant, not tolerant and intermediate tolerance. Adult landing rates and evidence of adult emergence were significantly lower in the apparently tolerant group compared with the not tolerant group, but larval survival from eggs placed on trees did not differ between tolerance groups. Therefore, it appears that apparently tolerant trees survive because they are less attractive to adult beetles which results in fewer eggs being laid on them. Trees in the apparently tolerant group remained of higher vigor over the four years of the study. North American ash may survive the emerald ash borer epidemic due to natural variation and inherent resistance regardless of the lack of co-evolutionary history with emerald ash borer.
Resumo:
La determinazione della qualità dell’olio vergine di oliva e la definizione dell’appartenenza del prodotto ad una specifica categoria merceologica (extra vergine, vergine, lampante) può essere ottenuta mediante la valutazione organolettica effettuando il Panel test. Quest’ ultimo è attuato da un gruppo di assaggiatori esperti (panel), guidati da un capo-panel, e ha l’obiettivo di identificare e quantificare i principali attributi sensoriali (positivi e negativi) stabilendo, sulla base dei risultati, la categoria merceologica di appartenenza del prodotto. Lo scopo di questo lavoro di tesi è stato quello di valutare l’applicabilità di un metodo strumentale che, attraverso un approccio di screening rapido, possa supportare l’analisi sensoriale, fornendo una discriminazione degli oli analizzati in funzione della loro qualità (categoria merceologica). Per tale finalità, un set di 42 oli di oliva vergini provenienti dalla Spagna e dalla Croazia, classificati nelle tre categorie merceologiche sulla base del Panel test, è stato analizzato mediante guida d’onda che ha consentito di esaminare le forme d’onda, sia del guadagno che della fase, nell’intervallo di frequenza 1,6 -2,7 GHz. Dai risultati ottenuti, per diversi intervalli di frequenza gli spettri del guadagno sembrano essere influenzati dalla categoria merceologica. Inoltre, l’analisi delle componenti principali (PCA), condotta a partire da tale informazione spettrale, ha consentito, in linea generale, la discriminazione tra oli extra vergini, vergini e lampanti. Infine, la successiva Hierarchical Cluster Analysis ha permesso di identificare clusters distinti per i campioni lampanti e quelli extra vergini e vergini.
Resumo:
The taxonomy of the N(2)-fixing bacteria belonging to the genus Bradyrhizobium is still poorly refined, mainly due to conflicting results obtained by the analysis of the phenotypic and genotypic properties. This paper presents an application of a method aiming at the identification of possible new clusters within a Brazilian collection of 119 Bradryrhizobium strains showing phenotypic characteristics of B. japonicum and B. elkanii. The stability was studied as a function of the number of restriction enzymes used in the RFLP-PCR analysis of three ribosomal regions with three restriction enzymes per region. The method proposed here uses Clustering algorithms with distances calculated by average-linkage clustering. Introducing perturbations using sub-sampling techniques makes the stability analysis. The method showed efficacy in the grouping of the species B. japonicum and B. elkanii. Furthermore, two new clusters were clearly defined, indicating possible new species, and sub-clusters within each detected cluster. (C) 2008 Elsevier B.V. All rights reserved.
Resumo:
The aim of this work is to evaluate the capabilities and limitations of chemometric methods and other mathematical treatments applied on spectroscopic data and more specifically on paint samples. The uniqueness of the spectroscopic data comes from the fact that they are multivariate - a few thousands variables - and highly correlated. Statistical methods are used to study and discriminate samples. A collection of 34 red paint samples was measured by Infrared and Raman spectroscopy. Data pretreatment and variable selection demonstrated that the use of Standard Normal Variate (SNV), together with removal of the noisy variables by a selection of the wavelengths from 650 to 1830 cm−1 and 2730-3600 cm−1, provided the optimal results for infrared analysis. Principal component analysis (PCA) and hierarchical clusters analysis (HCA) were then used as exploratory techniques to provide evidence of structure in the data, cluster, or detect outliers. With the FTIR spectra, the Principal Components (PCs) correspond to binder types and the presence/absence of calcium carbonate. 83% of the total variance is explained by the four first PCs. As for the Raman spectra, we observe six different clusters corresponding to the different pigment compositions when plotting the first two PCs, which account for 37% and 20% respectively of the total variance. In conclusion, the use of chemometrics for the forensic analysis of paints provides a valuable tool for objective decision-making, a reduction of the possible classification errors, and a better efficiency, having robust results with time saving data treatments.
Resumo:
MOTIVATION: Analysis of millions of pyro-sequences is currently playing a crucial role in the advance of environmental microbiology. Taxonomy-independent, i.e. unsupervised, clustering of these sequences is essential for the definition of Operational Taxonomic Units. For this application, reproducibility and robustness should be the most sought after qualities, but have thus far largely been overlooked. RESULTS: More than 1 million hyper-variable internal transcribed spacer 1 (ITS1) sequences of fungal origin have been analyzed. The ITS1 sequences were first properly extracted from 454 reads using generalized profiles. Then, otupipe, cd-hit-454, ESPRIT-Tree and DBC454, a new algorithm presented here, were used to analyze the sequences. A numerical assay was developed to measure the reproducibility and robustness of these algorithms. DBC454 was the most robust, closely followed by ESPRIT-Tree. DBC454 features density-based hierarchical clustering, which complements the other methods by providing insights into the structure of the data. AVAILABILITY: An executable is freely available for non-commercial users at ftp://ftp.vital-it.ch/tools/dbc454. It is designed to run under MPI on a cluster of 64-bit Linux machines running Red Hat 4.x, or on a multi-core OSX system. CONTACT: dbc454@vital-it.ch or nicolas.guex@isb-sib.ch.
Resumo:
This paper examines a dataset that derives from an observational tracking, in order to analyze where and how middle-class working families spend time at home. We use an ethnographic approach to study the everyday lives of Italian dual-income middle-class families, with the aim to analyze quantitatively the use of home spaces and the types of activities of family members on weekday afternoons and evenings. The different analyses (multiple correspondence analysis, agglomerative hierarchical cluster, discriminant analysis) show how particular spaces and activities in these spaces are dominated by certain family members. We suggest a combination of qualitative and quantitative methodologies as useful tools to explore in detail the everyday lives of families, and to understand how family members use the domestic spaces. In particular, we consider relevant the use of quantitative analyses to examine ethnographic data, especially in connection with the methodological reflexivity among researchers
Resumo:
The taxonomy of the N(2)-fixing bacteria belonging to the genus Bradyrhizobium is still poorly refined, mainly due to conflicting results obtained by the analysis of the phenotypic and genotypic properties. This paper presents an application of a method aiming at the identification of possible new clusters within a Brazilian collection of 119 Bradryrhizobium strains showing phenotypic characteristics of B. japonicum and B. elkanii. The stability was studied as a function of the number of restriction enzymes used in the RFLP-PCR analysis of three ribosomal regions with three restriction enzymes per region. The method proposed here uses Clustering algorithms with distances calculated by average-linkage clustering. Introducing perturbations using sub-sampling techniques makes the stability analysis. The method showed efficacy in the grouping of the species B. japonicum and B. elkanii. Furthermore, two new clusters were clearly defined, indicating possible new species, and sub-clusters within each detected cluster. (C) 2008 Elsevier B.V. All rights reserved.
Resumo:
Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq)
Resumo:
Improvements in the analysis of microarray images are critical for accurately quantifying gene expression levels. The acquisition of accurate spot intensities directly influences the results and interpretation of statistical analyses. This dissertation discusses the implementation of a novel approach to the analysis of cDNA microarray images. We use a stellar photometric model, the Moffat function, to quantify microarray spots from nylon microarray images. The inherent flexibility of the Moffat shape model makes it ideal for quantifying microarray spots. We apply our novel approach to a Wilms' tumor microarray study and compare our results with a fixed-circle segmentation approach for spot quantification. Our results suggest that different spot feature extraction methods can have an impact on the ability of statistical methods to identify differentially expressed genes. We also used the Moffat function to simulate a series of microarray images under various experimental conditions. These simulations were used to validate the performance of various statistical methods for identifying differentially expressed genes. Our simulation results indicate that tests taking into account the dependency between mean spot intensity and variance estimation, such as the smoothened t-test, can better identify differentially expressed genes, especially when the number of replicates and mean fold change are low. The analysis of the simulations also showed that overall, a rank sum test (Mann-Whitney) performed well at identifying differentially expressed genes. Previous work has suggested the strengths of nonparametric approaches for identifying differentially expressed genes. We also show that multivariate approaches, such as hierarchical and k-means cluster analysis along with principal components analysis, are only effective at classifying samples when replicate numbers and mean fold change are high. Finally, we show how our stellar shape model approach can be extended to the analysis of 2D-gel images by adapting the Moffat function to take into account the elliptical nature of spots in such images. Our results indicate that stellar shape models offer a previously unexplored approach for the quantification of 2D-gel spots. ^
Resumo:
Web document cluster analysis plays an important role in information retrieval by organizing large amounts of documents into a small number of meaningful clusters. Traditional web document clustering is based on the Vector Space Model (VSM), which takes into account only two-level (document and term) knowledge granularity but ignores the bridging paragraph granularity. However, this two-level granularity may lead to unsatisfactory clustering results with “false correlation”. In order to deal with the problem, a Hierarchical Representation Model with Multi-granularity (HRMM), which consists of five-layer representation of data and a twophase clustering process is proposed based on granular computing and article structure theory. To deal with the zero-valued similarity problemresulted from the sparse term-paragraphmatrix, an ontology based strategy and a tolerance-rough-set based strategy are introduced into HRMM. By using granular computing, structural knowledge hidden in documents can be more efficiently and effectively captured in HRMM and thus web document clusters with higher quality can be generated. Extensive experiments show that HRMM, HRMM with tolerancerough-set strategy, and HRMM with ontology all outperform VSM and a representative non VSM-based algorithm, WFP, significantly in terms of the F-Score.
Resumo:
Biological experiments often produce enormous amount of data, which are usually analyzed by data clustering. Cluster analysis refers to statistical methods that are used to assign data with similar properties into several smaller, more meaningful groups. Two commonly used clustering techniques are introduced in the following section: principal component analysis (PCA) and hierarchical clustering. PCA calculates the variance between variables and groups them into a few uncorrelated groups or principal components (PCs) that are orthogonal to each other. Hierarchical clustering is carried out by separating data into many clusters and merging similar clusters together. Here, we use an example of human leukocyte antigen (HLA) supertype classification to demonstrate the usage of the two methods. Two programs, Generating Optimal Linear Partial Least Square Estimations (GOLPE) and Sybyl, are used for PCA and hierarchical clustering, respectively. However, the reader should bear in mind that the methods have been incorporated into other software as well, such as SIMCA, statistiXL, and R.
Resumo:
In data mining, efforts have focused on finding methods for efficient and effective cluster analysis in large databases. Active themes of research focus on the scalability of clustering methods, the effectiveness of methods for clustering complex shapes and types of data, high-dimensional clustering techniques, and methods for clustering mixed numerical and categorical data in large databases. One of the most accuracy approach based on dynamic modeling of cluster similarity is called Chameleon. In this paper we present a modified hierarchical clustering algorithm that used the main idea of Chameleon and the effectiveness of suggested approach will be demonstrated by the experimental results.