Biblioteca Digital

940 resultados para correlation coefficient analysis

A cophenetic correlation coefficient for Tocher's method

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The objective of this work was to propose a way of using the Tocher's method of clustering to obtain a matrix similar to the cophenetic one obtained for hierarchical methods, which would allow the calculation of a cophenetic correlation. To illustrate the obtention of the proposed cophenetic matrix, we used two dissimilarity matrices - one obtained with the generalized squared Mahalanobis distance and the other with the Euclidean distance - between 17 garlic cultivars, based on six morphological characters. Basically, the proposal for obtaining the cophenetic matrix was to use the average distances within and between clusters, after performing the clustering. A function in R language was proposed to compute the cophenetic matrix for Tocher's method. The empirical distribution of this correlation coefficient was briefly studied. For both dissimilarity measures, the values of cophenetic correlation obtained for the Tocher's method were higher than those obtained with the hierarchical methods (Ward's algorithm and average linkage - UPGMA). Comparisons between the clustering made with the agglomerative hierarchical methods and with the Tocher's method can be performed using a criterion in common: the correlation between matrices of original and cophenetic distances.

The electron antineutrino angular correlation coefficient a in free neutron decay: Testing the standard model with the aSPECT-spectrometer

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The beta-decay of free neutrons is a strongly over-determined process in the Standard Model (SM) of Particle Physics and is described by a multitude of observables. Some of those observables are sensitive to physics beyond the SM. For example, the correlation coefficients of the involved particles belong to them. The spectrometer aSPECT was designed to measure precisely the shape of the proton energy spectrum and to extract from it the electron anti-neutrino angular correlation coefficient "a". A first test period (2005/ 2006) showed the “proof-of-principles”. The limiting influence of uncontrollable background conditions in the spectrometer made it impossible to extract a reliable value for the coefficient "a" (publication: Baessler et al., 2008, Europhys. Journ. A, 38, p.17-26). A second measurement cycle (2007/ 2008) aimed to under-run the relative accuracy of previous experiments (Stratowa et al. (1978), Byrne et al. (2002)) da/a =5%. I performed the analysis of the data taken there which is the emphasis of this doctoral thesis. A central point are background studies. The systematic impact of background on a was reduced to da/a(syst.)=0.61 %. The statistical accuracy of the analyzed measurements is da/a(stat.)=1.4 %. Besides, saturation effects of the detector electronics were investigated which were initially observed. These turned out not to be correctable on a sufficient level. An applicable idea how to avoid the saturation effects will be discussed in the last chapter.

A Bayesian approach to estimating the intraclass correlation coefficient

Relevância:

100.00% 100.00%

Publicador:

Resumo:

A Bayesian approach to estimating the intraclass correlation coefficient was used for this research project. The background of the intraclass correlation coefficient, a summary of its standard estimators, and a review of basic Bayesian terminology and methodology were presented. The conditional posterior density of the intraclass correlation coefficient was then derived and estimation procedures related to this derivation were shown in detail. Three examples of applications of the conditional posterior density to specific data sets were also included. Two sets of simulation experiments were performed to compare the mean and mode of the conditional posterior density of the intraclass correlation coefficient to more traditional estimators. Non-Bayesian methods of estimation used were: the methods of analysis of variance and maximum likelihood for balanced data; and the methods of MIVQUE (Minimum Variance Quadratic Unbiased Estimation) and maximum likelihood for unbalanced data. The overall conclusion of this research project was that Bayesian estimates of the intraclass correlation coefficient can be appropriate, useful and practical alternatives to traditional methods of estimation. ^

Statnote 32 : the partial correlation coefficient

Relevância:

100.00% 100.00%

Publicador:

Resumo:

In previous statnotes, the application of correlation and regression methods to the analysis of two variables (X,Y) was described. The most important statistic used to measure the degree of correlation between two variables is Pearson’s ‘product moment correlation coefficient’ (‘r’). The correlation between two variables may be due to their common relation to other variables. Hence, investigators using correlation studies need to be alert to the possibilities of spurious correlation and the methods of ‘partial correlation’ are one method of taking this into account. This statnote applies the methods of partial correlation to three scenarios. First, to a fairly obvious example of a spurious correlation resulting from the ‘size effect’ involving the relationship between the number of general practitioners (GP) and the number of deaths of patients in a town. Second, to the relationship between the abundance of the nitrogen-fixing bacterium Azotobacter in soil and three soil variables, and finally, to a more complex scenario, first introduced in Statnote 24involving the relationship between the growth of lichens in the field and climate.

A correlation sensitivity analysis of non-life underwriting risk in solvency capital requirement estimation

Relevância:

100.00% 100.00%

Publicador:

Resumo:

This paper analyses the impact of using different correlation assumptions between lines of business when estimating the risk-based capital reserve, the Solvency Capital Requirement (SCR), under Solvency II regulations. A case study is presented and the SCR is calculated according to the Standard Model approach. Alternatively, the requirement is then calculated using an Internal Model based on a Monte Carlo simulation of the net underwriting result at a one-year horizon, with copulas being used to model the dependence between lines of business. To address the impact of these model assumptions on the SCR we conduct a sensitivity analysis. We examine changes in the correlation matrix between lines of business and address the choice of copulas. Drawing on aggregate historical data from the Spanish non-life insurance market between 2000 and 2009, we conclude that modifications of the correlation and dependence assumptions have a significant impact on SCR estimation.

Partial correlation : Network analysis

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The final year project came to us as an opportunity to get involved in a topic which has appeared to be attractive during the learning process of majoring in economics: statistics and its application to the analysis of economic data, i.e. econometrics.Moreover, the combination of econometrics and computer science is a very hot topic nowadays, given the Information Technologies boom in the last decades and the consequent exponential increase in the amount of data collected and stored day by day. Data analysts able to deal with Big Data and to find useful results from it are verydemanded in these days and, according to our understanding, the work they do, although sometimes controversial in terms of ethics, is a clear source of value added both for private corporations and the public sector. For these reasons, the essence of this project is the study of a statistical instrument valid for the analysis of large datasets which is directly related to computer science: Partial Correlation Networks.The structure of the project has been determined by our objectives through the development of it. At first, the characteristics of the studied instrument are explained, from the basic ideas up to the features of the model behind it, with the final goal of presenting SPACE model as a tool for estimating interconnections in between elements in large data sets. Afterwards, an illustrated simulation is performed in order to show the power and efficiency of the model presented. And at last, the model is put into practice by analyzing a relatively large data set of real world data, with the objective of assessing whether the proposed statistical instrument is valid and useful when applied to a real multivariate time series. In short, our main goals are to present the model and evaluate if Partial Correlation Network Analysis is an effective, useful instrument and allows finding valuable results from Big Data.As a result, the findings all along this project suggest the Partial Correlation Estimation by Joint Sparse Regression Models approach presented by Peng et al. (2009) to work well under the assumption of sparsity of data. Moreover, partial correlation networks are shown to be a very valid tool to represent cross-sectional interconnections in between elements in large data sets.The scope of this project is however limited, as there are some sections in which deeper analysis would have been appropriate. Considering intertemporal connections in between elements, the choice of the tuning parameter lambda, or a deeper analysis of the results in the real data application are examples of aspects in which this project could be completed.To sum up, the analyzed statistical tool has been proved to be a very useful instrument to find relationships that connect the elements present in a large data set. And after all, partial correlation networks allow the owner of this set to observe and analyze the existing linkages that could have been omitted otherwise.

A correlation sensitivity analysis of non-life underwriting risk in solvency capital requirement estimation

Relevância:

100.00% 100.00%

Publicador:

Resumo:

This paper analyses the impact of using different correlation assumptions between lines of business when estimating the risk-based capital reserve, the Solvency Capital Requirement -SCR-, under Solvency II regulations. A case study is presented and the SCR is calculated according to the Standard Model approach. Alternatively, the requirement is then calculated using an Internal Model based on a Monte Carlo simulation of the net underwriting result at a one-year horizon, with copulas being used to model the dependence between lines of business. To address the impact of these model assumptions on the SCR we conduct a sensitivity analysis. We examine changes in the correlation matrix between lines of business and address the choice of copulas. Drawing on aggregate historical data from the Spanish non-life insurance market between 2000 and 2009, we conclude that modifications of the correlation and dependence assumptions have a significant impact on SCR estimation.

Sparse Correlation Kernel Analysis and Reconstruction

Relevância:

100.00% 100.00%

Publicador:

Resumo:

This paper presents a new paradigm for signal reconstruction and superresolution, Correlation Kernel Analysis (CKA), that is based on the selection of a sparse set of bases from a large dictionary of class- specific basis functions. The basis functions that we use are the correlation functions of the class of signals we are analyzing. To choose the appropriate features from this large dictionary, we use Support Vector Machine (SVM) regression and compare this to traditional Principal Component Analysis (PCA) for the tasks of signal reconstruction, superresolution, and compression. The testbed we use in this paper is a set of images of pedestrians. This paper also presents results of experiments in which we use a dictionary of multiscale basis functions and then use Basis Pursuit De-Noising to obtain a sparse, multiscale approximation of a signal. The results are analyzed and we conclude that 1) when used with a sparse representation technique, the correlation function is an effective kernel for image reconstruction and superresolution, 2) for image compression, PCA and SVM have different tradeoffs, depending on the particular metric that is used to evaluate the results, 3) in sparse representation techniques, L_1 is not a good proxy for the true measure of sparsity, L_0, and 4) the L_epsilon norm may be a better error metric for image reconstruction and compression than the L_2 norm, though the exact psychophysical metric should take into account high order structure in images.

High-precision measurements of the co-polar correlation coefficient: non-Gaussian errors and retrieval of the dispersion parameter µ in rainfall

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The co-polar correlation coefficient (ρhv) has many applications, including hydrometeor classification, ground clutter and melting layer identification, interpretation of ice microphysics and the retrieval of rain drop size distributions (DSDs). However, we currently lack the quantitative error estimates that are necessary if these applications are to be fully exploited. Previous error estimates of ρhv rely on knowledge of the unknown "true" ρhv and implicitly assume a Gaussian probability distribution function of ρhv samples. We show that frequency distributions of ρhv estimates are in fact highly negatively skewed. A new variable: L = -log10(1 - ρhv) is defined, which does have Gaussian error statistics, and a standard deviation depending only on the number of independent radar pulses. This is verified using observations of spherical drizzle drops, allowing, for the first time, the construction of rigorous confidence intervals in estimates of ρhv. In addition, we demonstrate how the imperfect co-location of the horizontal and vertical polarisation sample volumes may be accounted for. The possibility of using L to estimate the dispersion parameter (µ) in the gamma drop size distribution is investigated. We find that including drop oscillations is essential for this application, otherwise there could be biases in retrieved µ of up to ~8. Preliminary results in rainfall are presented. In a convective rain case study, our estimates show µ to be substantially larger than 0 (an exponential DSD). In this particular rain event, rain rate would be overestimated by up to 50% if a simple exponential DSD is assumed.

Comparative Analysis of 2-Flap Designs for Extraction of Mandibular Third Molar

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Objective: The objective of the study was to analyze 2-flap designs for surgical extraction of third molar, evaluating the periodontal status of the second lower molar.Study Design: Forty-five lower third molars were extracted from 24 patients. In 23 teeth, a vertical incision to the mandibular ramus was used (technique A), whereas 22 teeth were submitted to classic L-shaped flap (technique B) with controls at 60 and 90 days postoperatively.Results: Pearson correlation coefficient analysis showed a significant correlation only between immediate preoperative probing depth variables from techniques A and B in the studied surfaces. Statistical significances in the preoperative (vestibular) and postoperative day 60 (distovestibular and vestibular) were noted. In contrast, Student t-test showed no statistical difference in probing depths between preoperative and postoperative values, as well as no statistically significant difference regarding the type of incision alone.Conclusions: Technique A allowed a less traumatic surgery, guaranteeing a more comfortable postoperative period.

Statistical Properties of the Integrative Correlation Coefficient: a Measure of Cross-study Gene Reproducibility

Relevância:

100.00% 100.00%

Publicador:

(Table 2) Similarity Indexes (Pearson's correlation coefficient) of the silicoflagellate assemblages of IODP Exp302

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The silicoflagellate and ebridian assemblages in early middle Eocene Arctic cores obtained by IODP Expedition 302 (ACEX) were studied in order to decipher the paleoceanography of the upper water column. The assemblages in Lithologic Unit 2 (49.7-45.1 Ma), one of the biosiliceous intervals, were usually endemic as compared to the assemblages that occurred outside of the Arctic Ocean. The presence of these endemic assemblages is probably due to a unique environmental setting, controlled by the degree of mixing between the low-salinity Arctic waters and relatively high salinity waters supplied from outside the Arctic Ocean, such as the Atlantic and possibly the Western Siberian Sea. Using the basin-to-basin fractionation model, the early middle Eocene Arctic Ocean corresponds to an estuarine circulation type, which includes the modern-day Black Sea. The abundant down-core occurrence of ebridians strongly suggests the past presence of low-salinity waters, and may indicate that low oxygen concentrations prevailed in the euphotic layer, on the basis of the ecology of the modern ebridian Hermesinum adriaticum.

Statnote 33:the intra-class correlation coefficient

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The intra-class correlation coefficient (ICC or ri) is a method of measuring correlation when the data are paired and therefore, should be used when experimental units are organised into groups. A useful analogy is with the unpaired or paired ‘t’ test to compare the differences between the means of two groups. In studies of reproducibility, there may actually be little difference between the ICC and Pearson’s ‘r’ for ‘true’ repeated measurements. If, however, there is a systematic change in the measurements made on the first compared with the second occasion, then the ICC will be significantly less than ‘r’, and less confidence would be placed in the reproducibility of the results.

Diversity and path coefficient analysis of Southern African maize hybrids

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Detailed knowledge on genetic diversity among germplasm is important for hybrid maize ( Zea mays L.) breeding. The objective of the study was to determine genetic diversity in widely grown hybrids in Southern Africa, and compare effectiveness of phenotypic analysis models for determining genetic distances between hybrids. Fifty hybrids were evaluated at one site with two replicates. The experiment was a randomized complete block design. Phenotypic and genotypic data were analyzed using SAS and Power Marker respectively. There was significant (p < 0.01) variation and diversity among hybrid brands but small within brand clusters. Polymorphic Information Content (PIC) ranged from 0.07 to 0.38 with an average of 0.34 and genetic distance ranged from 0.08 to 0.50 with an average of 0.43. SAH23 and SAH21 (0.48) and SAH33 and SAH3 (0.47) were the most distantly related hybrids. Both single nucleotide polymorphism (SNP) markers and phenotypic data models were effective for discriminating genotypes according to genetic distance. SNP markers revealed nine clusters of hybrids. The 12-trait phenotypic analysis model, revealed eight clusters at 85%, while the five-trait model revealed six clusters. Path analysis revealed significant direct and indirect effects of secondary traits on yield. Plant height and ear height were negatively correlated with grain yield meaning shorter hybrids gave high yield. Ear weight, days to anthesis, and number of ears had highest positive direct effects on yield. These traits can provide good selection index for high yielding maize hybrids. Results confirmed that diversity of hybrids is small within brands and also confirm that phenotypic trait models are effective for discriminating hybrids.

Topografia das artérias e veias hilares em rins de cão (Canis familiaris, L. 1758) da raça Pequinês

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Em 30 pares de rins estudaram-se as artérias e veias, no hilo de cão da raça Pequinês. Os ramos arteriais são contados à direita e à esquerda, entre 6 e 16, e 5 e 13, e as raízes venosas, 1 e 7, e 1 e 10; os vasos arteriais são exclusivamente periféricos 100% e 93,3%, e as raízes venosas centrais; os vasos arteriais são iguais em número, 13,3%, e as raízes venosas, 46,6%; os quadrantes craniais são mais densamente povoados. O teste t de Student não é significante, ao nível de 5%, quanto ao sexo e aos rins (direito e esquerdo). O coeficiente de correlação linear de Pearson é positivo entre o número de artérias e veias, nas fêmeas, para ambos os rins, mas inexistente nas mesmas condições entre os machos.

«
1
2
3
4
5
6
7
8
...
62
63
»