42 resultados para discriminant analysis and cluster analysis
em Consorci de Serveis Universitaris de Catalunya (CSUC), Spain
Resumo:
The main aim of this study was to replicate and extend previous results on subtypes of adolescents with substance use disorders (SUD), according to their Minnesota Multiphasic Personality Inventory for adolescents (MMPI-A) profiles. Sixty patients with SUD and psychiatric comorbidity (41.7% male, mean age = 15.9 years old) completed the MMPI-A, the Teen Addiction Severity Index (T-ASI), the Child Behaviour Checklist (CBCL), and were interviewed in order to determine DSMIV diagnoses and level of substance use. Mean MMPI-A personality profile showed moderate peaks in Psychopathic Deviate, Depression and Hysteria scales. Hierarchical cluster analysis revealed four profiles (acting-out, 35% of the sample; disorganized-conflictive, 15%; normative-impulsive, 15%; and deceptive-concealed, 35%). External correlates were found between cluster 1, CBCL externalizing symptoms at a clinical level and conduct disorders, and between cluster 2 and mixed CBCL internalized/externalized symptoms at a clinical level. Discriminant analysis showed that Depression, Psychopathic Deviate and Psychasthenia MMPI-A scales correctly classified 90% of the patients into the clusters obtained.
Resumo:
The main aim of this study was to replicate and extend previous results on subtypes of adolescents with substance use disorders (SUD), according to their Minnesota Multiphasic Personality Inventory for adolescents (MMPI-A) profiles. Sixty patients with SUD and psychiatric comorbidity (41.7% male, mean age = 15.9 years old) completed the MMPI-A, the Teen Addiction Severity Index (T-ASI), the Child Behaviour Checklist (CBCL), and were interviewed in order to determine DSMIV diagnoses and level of substance use. Mean MMPI-A personality profile showed moderate peaks in Psychopathic Deviate, Depression and Hysteria scales. Hierarchical cluster analysis revealed four profiles (acting-out, 35% of the sample; disorganized-conflictive, 15%; normative-impulsive, 15%; and deceptive-concealed, 35%). External correlates were found between cluster 1, CBCL externalizing symptoms at a clinical level and conduct disorders, and between cluster 2 and mixed CBCL internalized/externalized symptoms at a clinical level. Discriminant analysis showed that Depression, Psychopathic Deviate and Psychasthenia MMPI-A scales correctly classified 90% of the patients into the clusters obtained.
Resumo:
A general scheme for devising efficient cluster dynamics proposed in a previous paper [Phys. Rev. Lett. 72, 1541 (1994)] is extensively discussed. In particular, the strong connection among equilibrium properties of clusters and dynamic properties as the correlation time for magnetization is emphasized. The general scheme is applied to a number of frustrated spin models and the results discussed.
Resumo:
Peer-reviewed
Resumo:
The Spanish savings banks attracted quite a considerable amount of interest within the scientific arena, especially subsequent to the disappearance of the regulatory constraints during the second decade of the 1980s. Nonetheless, a lack of research identified with respect to mainstream paths given by strategic groups, and the analysis of the total factor productivity. Therefore, on the basis of the resource-based view of the firm and cluster analysis, we make use of changes in structure and performance ratios in order to identify the strategic groups extant in the sector. We attain a threeways division, which we link with different input-output specifications defining strategic paths. Consequently, on the basis of these three dissimilar approaches we compute and decompose a Hicks-Moorsteen total factor productivity index. Obtained results put forward an interesting interpretation under a multi-strategic approach, together with the setbacks of employing cluster analysis within a complex strategic environment. Moreover, we also propose an ex-post method of analysing the outcomes of the decomposed total factor productivity index that could be merged with non-traditional techniques of forming strategic groups, such as cognitive approaches.
Resumo:
Compositional data naturally arises from the scientific analysis of the chemicalcomposition of archaeological material such as ceramic and glass artefacts. Data of thistype can be explored using a variety of techniques, from standard multivariate methodssuch as principal components analysis and cluster analysis, to methods based upon theuse of log-ratios. The general aim is to identify groups of chemically similar artefactsthat could potentially be used to answer questions of provenance.This paper will demonstrate work in progress on the development of a documentedlibrary of methods, implemented using the statistical package R, for the analysis ofcompositional data. R is an open source package that makes available very powerfulstatistical facilities at no cost. We aim to show how, with the aid of statistical softwaresuch as R, traditional exploratory multivariate analysis can easily be used alongside, orin combination with, specialist techniques of compositional data analysis.The library has been developed from a core of basic R functionality, together withpurpose-written routines arising from our own research (for example that reported atCoDaWork'03). In addition, we have included other appropriate publicly availabletechniques and libraries that have been implemented in R by other authors. Availablefunctions range from standard multivariate techniques through to various approaches tolog-ratio analysis and zero replacement. We also discuss and demonstrate a smallselection of relatively new techniques that have hitherto been little-used inarchaeometric applications involving compositional data. The application of the libraryto the analysis of data arising in archaeometry will be demonstrated; results fromdifferent analyses will be compared; and the utility of the various methods discussed
Resumo:
Artifacts are present in most of the electroencephalography (EEG) recordings, making it difficult to interpret or analyze the data. In this paper a cleaning procedure based on a multivariate extension of empirical mode decomposition is used to improve the quality of the data. This is achieved by applying the cleaning method to raw EEG data. Then, a synchrony measure is applied on the raw and the clean data in order to compare the improvement of the classification rate. Two classifiers are used, linear discriminant analysis and neural networks. For both cases, the classification rate is improved about 20%.
Resumo:
A new analytical method was developed to non-destructively determine pH and degree of polymerisation (DP) of cellulose in fibres in 19th 20th century painting canvases, and to identify the fibre type: cotton, linen, hemp, ramie or jute. The method is based on NIR spectroscopy and multivariate data analysis, while for calibration and validation a reference collection of 199 historical canvas samples was used. The reference collection was analysed destructively using microscopy and chemical analytical methods. Partial least squares regression was used to build quantitative methods to determine pH and DP, and linear discriminant analysis was used to determine the fibre type. To interpret the obtained chemical information, an expert assessment panel developed a categorisation system to discriminate between canvases that may not be fit to withstand excessive mechanical stress, e.g. transportation. The limiting DP for this category was found to be 600. With the new method and categorisation system, canvases of 12 Dalí paintings from the Fundació Gala-Salvador Dalí (Figueres, Spain) were non-destructively analysed for pH, DP and fibre type, and their fitness determined, which informs conservation recommendations. The study demonstrates that collection-wide canvas condition surveys can be performed efficiently and non-destructively, which could significantly improve collection management.
Resumo:
Biplots are graphical displays of data matrices based on the decomposition of a matrix as the product of two matrices. Elements of these two matrices are used as coordinates for the rows and columns of the data matrix, with an interpretation of the joint presentation that relies on the properties of the scalar product. Because the decomposition is not unique, there are several alternative ways to scale the row and column points of the biplot, which can cause confusion amongst users, especially when software packages are not united in their approach to this issue. We propose a new scaling of the solution, called the standard biplot, which applies equally well to a wide variety of analyses such as correspondence analysis, principal component analysis, log-ratio analysis and the graphical results of a discriminant analysis/MANOVA, in fact to any method based on the singular-value decomposition. The standard biplot also handles data matrices with widely different levels of inherent variance. Two concepts taken from correspondence analysis are important to this idea: the weighting of row and column points, and the contributions made by the points to the solution. In the standard biplot one set of points, usually the rows of the data matrix, optimally represent the positions of the cases or sample units, which are weighted and usually standardized in some way unless the matrix contains values that are comparable in their raw form. The other set of points, usually the columns, is represented in accordance with their contributions to the low-dimensional solution. As for any biplot, the projections of the row points onto vectors defined by the column points approximate the centred and (optionally) standardized data. The method is illustrated with several examples to demonstrate how the standard biplot copes in different situations to give a joint map which needs only one common scale on the principal axes, thus avoiding the problem of enlarging or contracting the scale of one set of points to make the biplot readable. The proposal also solves the problem in correspondence analysis of low-frequency categories that are located on the periphery of the map, giving the false impression that they are important.
Resumo:
In order to interpret the biplot it is necessary to know which points usually variables are the ones that are important contributors to the solution, and this information is available separately as part of the biplot s numerical results. We propose a new scaling of the display, called the contribution biplot, which incorporates this diagnostic directly into the graphical display, showing visually the important contributors and thus facilitating the biplot interpretation and often simplifying the graphical representation considerably. The contribution biplot can be applied to a wide variety of analyses such as correspondence analysis, principal component analysis, log-ratio analysis and the graphical results of a discriminant analysis/MANOVA, in fact to any method based on the singular-value decomposition. In the contribution biplot one set of points, usually the rows of the data matrix, optimally represent the spatial positions of the cases or sample units, according to some distance measure that usually incorporates some form of standardization unless all data are comparable in scale. The other set of points, usually the columns, is represented by vectors that are related to their contributions to the low-dimensional solution. A fringe benefit is that usually only one common scale for row and column points is needed on the principal axes, thus avoiding the problem of enlarging or contracting the scale of one set of points to make the biplot legible. Furthermore, this version of the biplot also solves the problem in correspondence analysis of low-frequency categories that are located on the periphery of the map, giving the false impression that they are important, when they are in fact contributing minimally to the solution.
Resumo:
We study particle dispersion advected by a synthetic turbulent flow from a Lagrangian perspective and focus on the two-particle and cluster dispersion by the flow. It has been recently reported that Richardson¿s law for the two-particle dispersion can stem from different dispersion mechanisms, and can be dominated by either diffusive or ballistic events. The nature of the Richardson dispersion depends on the parameters of our flow and is discussed in terms of the values of a persistence parameter expressing the relative importance of the two above-mentioned mechanisms. We support this analysis by studying the distribution of interparticle distances, the relative velocity correlation functions, as well as the relative trajectories.
Resumo:
The concepts of void and cluster for an arbitrary point distribution in a domain D are defined and characterized by some parameters such as volume, density, number of points belonging to them, shape, etc. After assigning a weight to each void and clusterwhich is a function of its characteristicsthe concept of distance between two point configurations S1 and S2 in D is introduced, both with and without the help of a lattice in the domain D. This defines a topology for the point distributions in D, which is different for the different characterizations of the voids and clusters.
Resumo:
This paper aims at reconsidering some analytical measures to best encapsulate the interlanguage, in writing, of young beginner learners of English as a foreign language in the light of previous and work-in-progress research conducted within the BAF project, and in particular, whether clause and sentence length should be best viewed as a fluency or syntactic complexity measusre or as part of a different construct. In the light of a factor analysis (Navés, forthcoming) and multivariate and correlation studies (Navés et al. 2003, Navés, 2006, Torres et al. 2006) it becomes clear that the relationship between different analytical measures is also dependent on learner¿s cognitive maturity (age) and proficiency (amount of instruction). Finally, clause and sentence length should not be viewed as either a fluency or sytactic complexity measure but as part of a different construct. It is concluded that further research using regression analysis and cluster analysis is neeed in order to identify and validate the constructs of the writing components and their measurements.
Resumo:
We study particle dispersion advected by a synthetic turbulent flow from a Lagrangian perspective and focus on the two-particle and cluster dispersion by the flow. It has been recently reported that Richardson¿s law for the two-particle dispersion can stem from different dispersion mechanisms, and can be dominated by either diffusive or ballistic events. The nature of the Richardson dispersion depends on the parameters of our flow and is discussed in terms of the values of a persistence parameter expressing the relative importance of the two above-mentioned mechanisms. We support this analysis by studying the distribution of interparticle distances, the relative velocity correlation functions, as well as the relative trajectories.
Resumo:
Cova del Gegant is located near the city of Sitges (Barcelona, Spain). The cave is a small karst system which contains Upper Pleistocene archaeological and paleontological material (DauRa et al., 2005). The site was first excavated in 1954 and then in 1972 and 1974- (Viñas, 1972; Viñas & Villalta, 1975) and in 1985 and 1989 (maRtínez et al., 1985; moRa, 1988; maRtínez et al., 1990). Finally, in 2007, Grup de Recerca del Quaternari has restarted the archaeological research at Cova del Gegant (DauRa, 2008; DauRa et al., 2010). A human mandible was recovered during the first field season in 1954 and was recently published by DauRa et al. (2005). In the present study, we describe a new human tooth (left I2) that appeared, like the mandible, in a revision of the faunal material recovered from the site in 1974-1975. The specimen preserves the entire crown and the cervical two thirds of the root (Figure 1). The lack of the root apex makes it difficult to determine if the tooth was fully developed at the time of death. However, CT analysis reveals a pulp cavity that could be still open, suggesting root formation was incomplete. The specimen shows only slight dental wear corresponding to stage 2 of Molnar (1971 en Hillson, 1996). Morphologically, the crown shows slight shovelling and a lingual tubercle and appears similar to Neandertal incisors. Standard crown measurements (buccolingual diameter=7.7 mm; mesiodistal diameter= 7.3 mm) (Figure 2) suggest a fairly large tooth, particularly in the BL dimension, again resembling Neandertals in this regard. Discriminant analysis classified the Gegant incisor as Neandertal with a 99.8% posterior probability (Table 2). Association of this tooth with the previously described mandible is considered unlikely given the different ages at death estimated for each. Thus, there appear to be two individuals preserved in the sediments of the Gegant cave, one adult and one subadult (around 8-10 years old).