933 resultados para Stepwise Discriminant Analysis


90.00% 90.00%



Statistics are regularly used to make some form of comparison between trace evidence or deploy the exclusionary principle (Morgan and Bull, 2007) in forensic investigations. Trace evidence are routinely the results of particle size, chemical or modal analyses and as such constitute compositional data. The issue is that compositional data including percentages, parts per million etc. only carry relative information. This may be problematic where a comparison of percentages and other constraint/closed data is deemed a statistically valid and appropriate way to present trace evidence in a court of law. Notwithstanding an awareness of the existence of the constant sum problem since the seminal works of Pearson (1896) and Chayes (1960) and the introduction of the application of log-ratio techniques (Aitchison, 1986; Pawlowsky-Glahn and Egozcue, 2001; Pawlowsky-Glahn and Buccianti, 2011; Tolosana-Delgado and van den Boogaart, 2013) the problem that a constant sum destroys the potential independence of variances and covariances required for correlation regression analysis and empirical multivariate methods (principal component analysis, cluster analysis, discriminant analysis, canonical correlation) is all too often not acknowledged in the statistical treatment of trace evidence. Yet the need for a robust treatment of forensic trace evidence analyses is obvious. This research examines the issues and potential pitfalls for forensic investigators if the constant sum constraint is ignored in the analysis and presentation of forensic trace evidence. Forensic case studies involving particle size and mineral analyses as trace evidence are used to demonstrate the use of a compositional data approach using a centred log-ratio (clr) transformation and multivariate statistical analyses.


90.00% 90.00%



Dissertação de mest., Qualidade em Análises, Faculdade de Ciências e Tecnologia, Univ. do Algarve, 2013


90.00% 90.00%



Multiple regression analysis is a statistical technique which allows to predict a dependent variable from m ore than one independent variable and also to determine influential independent variables. Using experimental data, in this study the multiple regression analysis is applied to predict the room mean velocity and determine the most influencing parameters on the velocity. More than 120 experiments for four different heat source locations were carried out in a test chamber with a high level wall mounted air supply terminal at air change rates 3-6 ach. The influence of the environmental parameters such as supply air momentum, room heat load, Archimedes number and local temperature ratio, were examined by two methods: a simple regression analysis incorporated into scatter matrix plots and multiple stepwise regression analysis. It is concluded that, when a heat source is located along the jet centre line, the supply momentum mainly influences the room mean velocity regardless of the plume strength. However, when the heat source is located outside the jet region, the local temperature ratio (the inverse of the local heat removal effectiveness) is a major influencing parameter.


90.00% 90.00%



The species related to Vriesea paraibica (Bromeliaceae, Tillandsioideae) have controversial taxonomic limits. For several decades, this group has been identified in herbarium collections as V. x morreniana, an artificial hybrid that does not grow in natural habitats. The aim of this study was to assess the morphological variation in the V. paraibica complex through morphometric analyses of natural populations. Two sets of analyses were performed: the first involved six natural populations (G1) and the second was carried out on taxa that emerged from the first analysis, but using material from herbarium collections (G2). Univariate ANOVA was used, as well as discriminant analysis of 16 morphometric variables in G1 and 18 in G2. The results of the analyses of the two groups were similar and led to the selection of diagnostic traits of four species. Lengths of the lower and median floral bracts were significant for the separation of red and yellow floral bracts. Vriesea paraibica and V. interrogatoria have red bracts; these two species are differentiated by the widths of the lower and median portions of the inflorescence and by scape length. These structures are larger in the former and smaller in the latter. Of the species with yellow floral bracts, V. eltoniana is distinguished by longer leaf blades and scapes and V. flava is characterized by its shorter sepal lengths. (C) 2009 The Linnean Society of London, Botanical Journal of the Linnean Society, 2009, 159, 163-181.


90.00% 90.00%



This work presents a novel approach in order to increase the recognition power of Multiscale Fractal Dimension (MFD) techniques, when applied to image classification. The proposal uses Functional Data Analysis (FDA) with the aim of enhancing the MFD technique precision achieving a more representative descriptors vector, capable of recognizing and characterizing more precisely objects in an image. FDA is applied to signatures extracted by using the Bouligand-Minkowsky MFD technique in the generation of a descriptors vector from them. For the evaluation of the obtained improvement, an experiment using two datasets of objects was carried out. A dataset was used of characters shapes (26 characters of the Latin alphabet) carrying different levels of controlled noise and a dataset of fish images contours. A comparison with the use of the well-known methods of Fourier and wavelets descriptors was performed with the aim of verifying the performance of FDA method. The descriptor vectors were submitted to Linear Discriminant Analysis (LDA) classification method and we compared the correctness rate in the classification process among the descriptors methods. The results demonstrate that FDA overcomes the literature methods (Fourier and wavelets) in the processing of information extracted from the MFD signature. In this way, the proposed method can be considered as an interesting choice for pattern recognition and image classification using fractal analysis.


90.00% 90.00%



A new method for characterization and analysis of asphaltic mixtures aggregate particles is reported. By relying on multiscale representation of the particles, curvature estimation, and discriminant analysis for optimal separation of the categories of mixtures, a particularly effective and comprehensive methodology is obtained. The potential of the methodology is illustrated with respect to three important types of particles used in asphaltic mixtures, namely basalt, gabbro, and gravel. The obtained results show that gravel particles are markedly distinct from the other two types of particles, with the gabbro category resulting with intermediate geometrical properties. The importance of each considered measurement in the discrimination between the three categories of particles was also quantified in terms of the adopted discriminant analysis.


90.00% 90.00%



A new objective fabric pilling grading method based on wavelet texture analysis was developed. The new method created a complex texture feature vector based on the wavelet detail coefficients from all decomposition levels and horizontal, vertical and diagonal orientations, permitting a much richer and more complete representation of pilling texture in the image to be used as a basis for classification. Standard multi-factor classification techniques of principal components analysis and discriminant analysis were then used to classify the pilling samples into five pilling degrees. The preliminary investigation of the method was performed using standard pilling image sets of knitted, woven and non-woven fabrics. The results showed that this method could successfully evaluate the pilling intensity of knitted, woven and non-woven fabrics by selecting the suitable wavelet and associated analysis scale.


90.00% 90.00%



The superior characteristics of high photon flux and diffraction-limited spatial resolution achieved by synchrotron-FTIR microspectroscopy allowed molecular characterization of individual live thraustochytrids. Principal component analysis revealed distinct separation of the single live cell spectra into their corresponding strains, comprised of new Australasian thraustochytrids (AMCQS5-5 and S7) and standard cultures (AH-2 and S31). Unsupervised hierarchical cluster analysis (UHCA) indicated close similarities between S7 and AH-7 strains, with AMCQS5-5 being distinctly different. UHCA correlation conformed well to the fatty acid profiles, indicating the type of fatty acids as a critical factor in chemotaxonomic discrimination of these thraustochytrids and also revealing the distinctively high polyunsaturated fatty acid content as key identity of AMCQS5-5. Partial least squares discriminant analysis using cross-validation approach between two replicate datasets was demonstrated to be a powerful classification method leading to models of high robustness and 100% predictive accuracy for strain identification. The results emphasized the exceptional S-FTIR capability to perform real-time in vivo measurement of single live cells directly within their original medium, providing unique information on cell variability among the population of each isolate and evidence of spontaneous lipid peroxidation that could lead to deeper understanding of lipid production and oxidation in thraustochytrids for single-cell oil development.


90.00% 90.00%



This study determines for the first time Na, K, Ca, Mg, Fe, Cu, Zn, Mn, Sr, Li and Rb contents in wines from the archipelagos of Madeira and Azores (Portugal). The greater part of the mean content for the different parameters fell within the ranges described in the literature, except for sodium whose higher content may be due to the effect of marine spray. ANOVA was used to establish the metals with significant differences in mean content between the wines from both archipelagos, between table and liquor wines of Madeira, and between wines of Pico and Terceira Islands from the Azores archipelago. Principal component analysis shows differences in the wines according to the wine-making process and/or the equipment employed. Stepwise linear discriminant analysis achieves a good classification and validation of wines according to the archipelago of origin, and the island in the case of Azores wines.


90.00% 90.00%



In this study the effect of the cultivar on the volatile profile of five different banana varieties was evaluated and determined by dynamic headspace solid-phase microextraction (dHS-SPME) combined with one-dimensional gas chromatography–mass spectrometry (1D-GC–qMS). This approach allowed the definition of a volatile metabolite profile to each banana variety and can be used as pertinent criteria of differentiation. The investigated banana varieties (Dwarf Cavendish, Prata, Maçã, Ouro and Platano) have certified botanical origin and belong to the Musaceae family, the most common genomic group cultivated in Madeira Island (Portugal). The influence of dHS-SPME experimental factors, namely, fibre coating, extraction time and extraction temperature, on the equilibrium headspace analysis was investigated and optimised using univariate optimisation design. A total of 68 volatile organic metabolites (VOMs) were tentatively identified and used to profile the volatile composition in different banana cultivars, thus emphasising the sensitivity and applicability of SPME for establishment of the volatile metabolomic pattern of plant secondary metabolites. Ethyl esters were found to comprise the largest chemical class accounting 80.9%, 86.5%, 51.2%, 90.1% and 6.1% of total peak area for Dwarf Cavendish, Prata, Ouro, Maçã and Platano volatile fraction, respectively. Gas chromatographic peak areas were submitted to multivariate statistical analysis (principal component and stepwise linear discriminant analysis) in order to visualise clusters within samples and to detect the volatile metabolites able to differentiate banana cultivars. The application of the multivariate analysis on the VOMs data set resulted in predictive abilities of 90% as evaluated by the cross-validation procedure.


90.00% 90.00%



A análise isotópica tem se mostrado uma ferramenta de suma importância ao processo de rastreabilidade, no entanto, existem divergências nas análises estatísticas dos resultados, uma vez que os dados são dependentes e advindos de vários elementos químicos tais como Carbono, Hidrogênio, Oxigênio, Nitrogênio e Enxofre (CHON'S). Com o intuito de estabelecer a análise propícia para os dados de rastreabilidade em aves pela técnica de isótopos estáveis e avaliar a necessidade da análise conjunta das variáveis, foram usados dados de carbono-13 e de nitrogênio-15 de ovos (albúmen + gema) de poedeiras e músculo peitoral de frangos de corte, os quais foram submetidos à análise estatística univariada (Anova e complementada pelo teste de Tukey) e multivariada (Manova e Discriminante). Os dados foram analisados no software Minitab 16, e os resultados, consolidados na teoria, confirmam a necessidade de análise multivariada, mostrando também que a análise discriminante esclarece as dúvidas apresentadas nos resultados de outros métodos de análise comparados nesta pesquisa.


90.00% 90.00%



In this paper is reported the use of the chromatographic profiles of volatiles to determine disease markers in plants - in this case, leaves of Eucalyptus globulus contaminated by the necrotroph fungus Teratosphaeria nubilosa. The volatile fraction was isolated by headspace solid phase microextraction (HS-SPME) and analyzed by comprehensive two-dimensional gas chromatography-fast quadrupole mass spectrometry (GC. ×. GC-qMS). For the correlation between the metabolic profile described by the chromatograms and the presence of the infection, unfolded-partial least squares discriminant analysis (U-PLS-DA) with orthogonal signal correction (OSC) were employed. The proposed method was checked to be independent of factors such as the age of the harvested plants. The manipulation of the mathematical model obtained also resulted in graphic representations similar to real chromatograms, which allowed the tentative identification of more than 40 compounds potentially useful as disease biomarkers for this plant/pathogen pair. The proposed methodology can be considered as highly reliable, since the diagnosis is based on the whole chromatographic profile rather than in the detection of a single analyte. © 2013 Elsevier B.V..


90.00% 90.00%



Concentrations of 39 organic compounds were determined in three fractions (head, heart and tail) obtained from the pot still distillation of fermented sugarcane juice. The results were evaluated using analysis of variance (ANOVA), Tukey's test, principal component analysis (PCA), hierarchical cluster analysis (HCA) and linear discriminant analysis (LDA). According to PCA and HCA, the experimental data lead to the formation of three clusters. The head fractions give rise to a more defined group. The heart and tail fractions showed some overlap consistent with its acid composition. The predictive ability of calibration and validation of the model generated by LDA for the three fractions classification were 90.5 and 100%, respectively. This model recognized as the heart twelve of the thirteen commercial cachacas (92.3%) with good sensory characteristics, thus showing potential for guiding the process of cuts.


90.00% 90.00%



Abstract Background Prostate cancer is a leading cause of death in the male population, therefore, a comprehensive study about the genes and the molecular networks involved in the tumoral prostate process becomes necessary. In order to understand the biological process behind potential biomarkers, we have analyzed a set of 57 cDNA microarrays containing ~25,000 genes. Results Principal Component Analysis (PCA) combined with the Maximum-entropy Linear Discriminant Analysis (MLDA) were applied in order to identify genes with the most discriminative information between normal and tumoral prostatic tissues. Data analysis was carried out using three different approaches, namely: (i) differences in gene expression levels between normal and tumoral conditions from an univariate point of view; (ii) in a multivariate fashion using MLDA; and (iii) with a dependence network approach. Our results show that malignant transformation in the prostatic tissue is more related to functional connectivity changes in their dependence networks than to differential gene expression. The MYLK, KLK2, KLK3, HAN11, LTF, CSRP1 and TGM4 genes presented significant changes in their functional connectivity between normal and tumoral conditions and were also classified as the top seven most informative genes for the prostate cancer genesis process by our discriminant analysis. Moreover, among the identified genes we found classically known biomarkers and genes which are closely related to tumoral prostate, such as KLK3 and KLK2 and several other potential ones. Conclusion We have demonstrated that changes in functional connectivity may be implicit in the biological process which renders some genes more informative to discriminate between normal and tumoral conditions. Using the proposed method, namely, MLDA, in order to analyze the multivariate characteristic of genes, it was possible to capture the changes in dependence networks which are related to cell transformation.


90.00% 90.00%



Dahl salt-sensitive (DS) and salt-resistant (DR) inbred rat strains represent a well established animal model for cardiovascular research. Upon prolonged administration of high-salt-containing diet, DS rats develop systemic hypertension, and as a consequence they develop left ventricular hypertrophy, followed by heart failure. The aim of this work was to explore whether this animal model is suitable to identify biomarkers that characterize defined stages of cardiac pathophysiological conditions. The work had to be performed in two stages: in the first part proteomic differences that are attributable to the two separate rat lines (DS and DR) had to be established, and in the second part the process of development of heart failure due to feeding the rats with high-salt-containing diet has to be monitored. This work describes the results of the first stage, with the outcome of protein expression profiles of left ventricular tissues of DS and DR rats kept under low salt diet. Substantial extent of quantitative and qualitative expression differences between both strains of Dahl rats in heart tissue was detected. Using Principal Component Analysis, Linear Discriminant Analysis and other statistical means we have established sets of differentially expressed proteins, candidates for further molecular analysis of the heart failure mechanisms.