970 resultados para Multivariate data
Resumo:
Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES)
Resumo:
In recent years, studies based on isoenzymatic patterns of geographic variation have revealed that what is usually called the Africanized honey bee does not constitute a single population. Instead, several local populations exist with various degrees of admixture with European honey bees. In this paper, we evaluated new data on morphometric patterns of Africanized honey bees collected at 42 localities in Brazil, using univariate and multivariate (canonical) trend surface and spatial autocorrelation analyses. The clinal patterns of variation found for genetically independent characters (wing size characters and some wing venation angles) are concordant with previous studies of malate dehydrogenase (MDH) allelic frequencies and support the hypothesis that larger honey bees in southern and southeastern Brazil originated by racial admixture in the initial phases of African honey bee colonization. Geographic variation patterns of Africanized honey bee populations reflect a demic diffusion process in which European genes were gradually lost because of the higher fitness of the African gene pool in Neotropical environmental conditions.
Resumo:
In the context of Bayesian statistical analysis, elicitation is the process of formulating a prior density f(.) about one or more uncertain quantities to represent a person's knowledge and beliefs. Several different methods of eliciting prior distributions for one unknown parameter have been proposed. However, there are relatively few methods for specifying a multivariate prior distribution and most are just applicable to specific classes of problems and/or based on restrictive conditions, such as independence of variables. Besides, many of these procedures require the elicitation of variances and correlations, and sometimes elicitation of hyperparameters which are difficult for experts to specify in practice. Garthwaite et al. (2005) discuss the different methods proposed in the literature and the difficulties of eliciting multivariate prior distributions. We describe a flexible method of eliciting multivariate prior distributions applicable to a wide class of practical problems. Our approach does not assume a parametric form for the unknown prior density f(.), instead we use nonparametric Bayesian inference, modelling f(.) by a Gaussian process prior distribution. The expert is then asked to specify certain summaries of his/her distribution, such as the mean, mode, marginal quantiles and a small number of joint probabilities. The analyst receives that information, treating it as a data set D with which to update his/her prior beliefs to obtain the posterior distribution for f(.). Theoretical properties of joint and marginal priors are derived and numerical illustrations to demonstrate our approach are given. (C) 2010 Elsevier B.V. All rights reserved.
Resumo:
The correspondence between morphometric and isozymic geographic variation patterns of Africanized honey bees in Brazil was analyzed. Morphometric data consisted of mean vectors of 19 wing traits measured in 42 local populations distributed throughout the country. Isozymic data refer to allelic frequencies of malate dehydrogenase (MDH), and were obtained from Lobo and Krieger. The two data sets were analyzed through canonical trend surface, principal components and spatial autocorrelation analyses, and showed north-south dines, demonstrating that Africanized honey bees in southern and southeastern Brazil are more similar to European honey bees than those found in northern and northeastern regions. Also, the morphometric variation is within the limits established by the racial admixture model, considering the expected values of Africanized honey bee fore wing length (WL) in southern and northeastern regions of Brazil, estimated by combining average values of WL in the three main subspecies involved in the Africanization process (Apis mellifera scutellata, A. m. ligustica and A. m. mellifera) with racial admixture coefficients.
Resumo:
A methodology to define favorable areas in petroleum and mineral exploration is applied, which consists in weighting the exploratory variables, in order to characterize their importance as exploration guides. The exploration data are spatially integrated in the selected area to establish the association between variables and deposits, and the relationships among distribution, topology, and indicator pattern of all variables. Two methods of statistical analysis were compared. The first one is the Weights of Evidence Modeling, a conditional probability approach (Agterberg, 1989a), and the second one is the Principal Components Analysis (Pan, 1993). In the conditional method, the favorability estimation is based on the probability of deposit and variable joint occurrence, with the weights being defined as natural logarithms of likelihood ratios. In the multivariate analysis, the cells which contain deposits are selected as control cells and the weights are determined by eigendecomposition, being represented by the coefficients of the eigenvector related to the system's largest eigenvalue. The two techniques of weighting and complementary procedures were tested on two case studies: 1. Recôncavo Basin, Northeast Brazil (for Petroleum) and 2. Itaiacoca Formation of Ribeira Belt, Southeast Brazil (for Pb-Zn Mississippi Valley Type deposits). The applied methodology proved to be easy to use and of great assistance to predict the favorability in large areas, particularly in the initial phase of exploration programs. © 1998 International Association for Mathematical Geology.
Resumo:
Rain acidity may be ascribed to emissions from power station stacks, as well as emissions from other industry, biomass burning, maritime influences, agricultural influences, etc. Rain quality data are available for 30 sites in the South African interior, some from as early as 1985 for up to 14 rainfall seasons, while others only have relatively short records. The article examines trends over time in the raw and volume weighted concentrations of the parameters measured, separately for each of the sites for which sufficient data are available. The main thrust, however, is to examine the inter-relationship structure between the concentrations within each rain event (unweighted data), separately for each site, and to examine whether these inter-relationships have changed over time. The rain events at individual sites can be characterized by approximately eight combinations of rainfall parameters (or rain composition signatures), and these are common to all sites. Some sites will have more events from one signature than another, but there appear to be no signatures unique to a single site. Analysis via factor and cluster analysis, with a correspondence analysis of the results, also aid interpretation of the patterns. This spatio-temporal analysis, performed by pooling all rain event data, irrespective of site or time period, results in nine combinations of rainfall parameters being sufficient to characterize the rain events. The sites and rainfall seasons show patterns in these combinations of parameters, with some combinations appearing more frequently during certain rainfall seasons. In particular, the presence of the combination of low acetate and formate with high magnesium appears to be increasing in the later rainfall seasons, as does this combination together with calcium, sodium, chloride, potassium and fluoride. As expected, sites close together exhibit similar signatures. Copyright © 2002 John Wiley & Sons, Ltd.
Resumo:
Physiological potential characterization of seed lots is usually performed by germination and vigor tests; however, the choice of a single test does not reflect such potential, once each test assesses seeds of differentiated mode. Multivariate techniques allow understanding structural dependence contained in each variable, as well as characterize groups of seed lots according to specific standards. The study aimed at evaluating variability among soybean seed lots and discriminate these lots by multivariate exploratory techniques as function of seed vigor. Experiment was performed with 20 soybean seed lots (10 lots cv. BRS Valiosa RR and 10 lots cv. M-SOY 7908 RR). Seed physiological potential was assessed by testing for: germination (standard, and under different water availability); vigor (accelerated aging and electrical conductivity); and field seedling emergence. Cluster analysis of seed lots, as well as Principal Component Analysis was performed using data obtained on all tests. Multivariate techniques allowed stratifying seed lots into two distinct groups. Principal Component Analysis showed that values obtained for variables: field seedling emergence, accelerated aging, and germination under different water availability were linked to BRS Valiosa RR; while to variables germination and electrical conductivity, were linked to M-SOY 7908 RR.
Resumo:
Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES)
Resumo:
Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq)
Resumo:
Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)
Resumo:
Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)
Resumo:
Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)
Resumo:
In this paper we describe how morphological castes can be distinguished using multivariate statistical methods combined with jackknife estimators of the allometric coefficients. Data from the polymorphic ant, Camponotus rufipes, produced two distinct patterns of allometric variation, and thus two morphological castes. Morphometric analysis distinguished different allometric patterns within the two castes, with overall variability being greater in the major workers. Caste-specific scaling variabilities were associated with the relative importance of first principal component. The static multivariate allometric coefficients for each of 10 measured characters were different between castes, but their relative magnitudes within castes were similar. Multivariate statistical analysis of worker polymorphism in ants is a more complete descriptor of shape variation than, and provides statistical and conceptual advantages over, the standard bivariate techniques commonly used.
Resumo:
Background Several researchers seek methods for the selection of homogeneous groups of animals in experimental studies, a fact justified because homogeneity is an indispensable prerequisite for casualization of treatments. The lack of robust methods that comply with statistical and biological principles is the reason why researchers use empirical or subjective methods, influencing their results. Objective To develop a multivariate statistical model for the selection of a homogeneous group of animals for experimental research and to elaborate a computational package to use it. Methods The set of echocardiographic data of 115 male Wistar rats with supravalvular aortic stenosis (AoS) was used as an example of model development. Initially, the data were standardized, and became dimensionless. Then, the variance matrix of the set was submitted to principal components analysis (PCA), aiming at reducing the parametric space and at retaining the relevant variability. That technique established a new Cartesian system into which the animals were allocated, and finally the confidence region (ellipsoid) was built for the profile of the animals’ homogeneous responses. The animals located inside the ellipsoid were considered as belonging to the homogeneous batch; those outside the ellipsoid were considered spurious. Results The PCA established eight descriptive axes that represented the accumulated variance of the data set in 88.71%. The allocation of the animals in the new system and the construction of the confidence region revealed six spurious animals as compared to the homogeneous batch of 109 animals. Conclusion The biometric criterion presented proved to be effective, because it considers the animal as a whole, analyzing jointly all parameters measured, in addition to having a small discard rate.
Resumo:
Concentrations of 39 organic compounds were determined in three fractions (head, heart and tail) obtained from the pot still distillation of fermented sugarcane juice. The results were evaluated using analysis of variance (ANOVA), Tukey's test, principal component analysis (PCA), hierarchical cluster analysis (HCA) and linear discriminant analysis (LDA). According to PCA and HCA, the experimental data lead to the formation of three clusters. The head fractions give rise to a more defined group. The heart and tail fractions showed some overlap consistent with its acid composition. The predictive ability of calibration and validation of the model generated by LDA for the three fractions classification were 90.5 and 100%, respectively. This model recognized as the heart twelve of the thirteen commercial cachacas (92.3%) with good sensory characteristics, thus showing potential for guiding the process of cuts.