18 resultados para Genetic Variance-covariance Matrix


100.00% 100.00%



We propose a new method for estimating the covariance matrix of a multivariate time series of nancial returns. The method is based on estimating sample covariances from overlapping windows of observations which are then appropriately weighted to obtain the nal covariance estimate. We extend the idea of (model) covariance averaging o ered in the covariance shrinkage approach by means of greater ease of use, exibility and robustness in averaging information over different data segments. The suggested approach does not su er from the curse of dimensionality and can be used without problems of either approximation or any demand for numerical optimization.


100.00% 100.00%



The main hallmark of diabetic nephropathy is elevation in urinary albumin excretion. We performed a genome-wide linkage scan in 63 extended families with multiple members with type II diabetes. Urinary albumin excretion, measured as the albumin-to-creatinine ratio (ACR), was determined in 426 diabetic and 431 nondiabetic relatives who were genotyped for 383 markers. The data were analyzed using variance components linkage analysis. Heritability (h2) of ACR was significant in diabetic (h2=0.23, P=0.0007), and nondiabetic (h2=0.39, P=0.0001) relatives. There was no significant difference in genetic variance of ACR between diabetic and nondiabetic relatives (P=0.16), and the genetic correlation (rG=0.64) for ACR between these two groups was not different from 1 (P=0.12). These results suggested that similar genes contribute to variation in ACR in diabetic and nondiabetic relatives. This hypothesis was supported further by the linkage results.


100.00% 100.00%



This paper analyses multivariate statistical techniques for identifying and isolating abnormal process behaviour. These techniques include contribution charts and variable reconstructions that relate to the application of principal component analysis (PCA). The analysis reveals firstly that contribution charts produce variable contributions which are linearly dependent and may lead to an incorrect diagnosis, if the number of principal components retained is close to the number of recorded process variables. The analysis secondly yields that variable reconstruction affects the geometry of the PCA decomposition. The paper further introduces an improved variable reconstruction method for identifying multiple sensor and process faults and for isolating their influence upon the recorded process variables. It is shown that this can accommodate the effect of reconstruction, i.e. changes in the covariance matrix of the sensor readings and correctly re-defining the PCA-based monitoring statistics and their confidence limits. (c) 2006 Elsevier Ltd. All rights reserved.


100.00% 100.00%



We draw an explicit connection between the statistical properties of an entangled two-mode continuous variable (CV) resource and the amount of entanglement that can be dynamically transferred to a pair of noninteracting two-level systems. More specifically, we rigorously reformulate entanglement-transfer process by making use of covariance matrix formalism. When the resource state is Gaussian, our method makes the approach to the transfer of quantum correlations much more flexible than in previously considered schemes and allows the straightforward inclusion of the effects of noise affecting the CV system. Moreover, the proposed method reveals that the use of de-Gaussified two-mode states is almost never advantageous for transferring entanglement with respect to the full Gaussian picture, despite the entanglement in the non-Gaussian resource can be much larger than in its Gaussian counterpart. We can thus conclude that the entanglement-transfer map overthrows the


100.00% 100.00%



This paper investigates the center selection of multi-output radial basis function (RBF) networks, and a multi-output fast recursive algorithm (MFRA) is proposed. This method can not only reveal the significance of each candidate center based on the reduction in the trace of the error covariance matrix, but also can estimate the network weights simultaneously using a back substitution approach. The main contribution is that the center selection procedure and the weight estimation are performed within a well-defined regression context, leading to a significantly reduced computational complexity. The efficiency of the algorithm is confirmed by a computational complexity analysis, and simulation results demonstrate its effectiveness. (C) 2010 Elsevier B.V. All rights reserved.


100.00% 100.00%



This paper discusses the monitoring of complex nonlinear and time-varying processes. Kernel principal component analysis (KPCA) has gained significant attention as a monitoring tool for nonlinear systems in recent years but relies on a fixed model that cannot be employed for time-varying systems. The contribution of this article is the development of a numerically efficient and memory saving moving window KPCA (MWKPCA) monitoring approach. The proposed technique incorporates an up- and downdating procedure to adapt (i) the data mean and covariance matrix in the feature space and (ii) approximates the eigenvalues and eigenvectors of the Gram matrix. The article shows that the proposed MWKPCA algorithm has a computation complexity of O(N2), whilst batch techniques, e.g. the Lanczos method, are of O(N3). Including the adaptation of the number of retained components and an l-step ahead application of the MWKPCA monitoring model, the paper finally demonstrates the utility of the proposed technique using a simulated nonlinear time-varying system and recorded data from an industrial distillation column.


100.00% 100.00%



Multiple-input-multiple-output (MIMO) radar schemes whereby the transmit array is partitioned into subarrays have recently been proposed in the literature to combine advantages of phased array and MIMO radar technology. In this work, we utilize this architecture to significantly simplify a transmit procedure in which the covariance matrix across the MIMO radar array is optimized to improve the Cramer-Rao bound (CRB) on target parameter estimation. The MIMO effective array for regular subarrayed transmit apertures is studied, and necessary conditions to obtain a filled effective aperture are presented, which is important for maintaining nonambiguous, low sidelobe beampatterns. The performance of the subarrayed transmit approach is evaluated in terms of the CRB on target parameter estimation, and the optimisation of the beamformer applied to the subarrays to minimize the CRB is considered. The subarrayed transmit scheme is found to have a CRB which is suboptimal to the full diversity transmission, as expected, but is solvable in a small fraction of the time using an iterative beamspace algorithm developed here.


100.00% 100.00%



Cereal grains are the dominant source of cadmium in the human diet, with rice being to the fore. Here we explore the effect of geographic, genetic, and processing (milling) factors on rice grain cadmium and rice consumption rates that lead to dietary variance in cadmium intake. From a survey of 12 countries on four continents, cadmium levels in rice grain were the highest in Bangladesh and Sri Lanka, with both these countries also having high per capita rice intakes. For Bangladesh and Sri Lanka, there was high weekly intake of cadmium from rice, leading to intakes deemed unsafe by international and national regulators. While genetic variance, and to a lesser extent milling, provide strategies for reducing cadmium in rice, caution has to be used, as there is environmental regulation as well as genetic regulation of cadmium accumulation within rice grains. For countries that import rice, grain cadmium can be controlled by where that rice is sourced, but for countries with subsistence rice economies that have high levels of cadmium in rice grain, agronomic and breeding strategies are required to lower grain cadmium.


100.00% 100.00%



High-dimensional gene expression data provide a rich source of information because they capture the expression level of genes in dynamic states that reflect the biological functioning of a cell. For this reason, such data are suitable to reveal systems related properties inside a cell, e.g., in order to elucidate molecular mechanisms of complex diseases like breast or prostate cancer. However, this is not only strongly dependent on the sample size and the correlation structure of a data set, but also on the statistical hypotheses tested. Many different approaches have been developed over the years to analyze gene expression data to (I) identify changes in single genes, (II) identify changes in gene sets or pathways, and (III) identify changes in the correlation structure in pathways. In this paper, we review statistical methods for all three types of approaches, including subtypes, in the context of cancer data and provide links to software implementations and tools and address also the general problem of multiple hypotheses testing. Further, we provide recommendations for the selection of such analysis methods.


100.00% 100.00%



This paper explores the performance of sliding-window based training, termed as semi batch, using multilayer perceptron (MLP) neural network in the presence of correlated data. The sliding window training is a form of higher order instantaneous learning strategy without the need of covariance matrix, usually employed for modeling and tracking purposes. Sliding-window framework is implemented to combine the robustness of offline learning algorithms with the ability to track online the underlying process of a function. This paper adopted sliding window training with recent advances in conjugate gradient direction with application of data store management e.g. simple distance measure, angle evaluation and the novel prediction error test. The simulation results show the best convergence performance is gained by using store management techniques. © 2012 Springer-Verlag.


100.00% 100.00%



The recent bankruptcy filing by deCODE, a company with an exceptional pedigree in associating genetic variance with disease onset, highlights the commercial risks of translational research. Indeed, deCODE's approach was similar to that adapted by academic researchers who seek to connect genetics and disease. We argue here that neither a purely corporate nor purely academic model is entirely appropriate for such research. Instead, we suggest that the private sector undertake the high-throughput elements of translational research, while the public sector and governments assume the role of providing long-term funding to develop gifted scientists with the confidence to attempt to use genetic data as a stepping stone to a truly mechanistic understanding of complex disease.


100.00% 100.00%



A geostatistical version of the classical Fisher rule (linear discriminant analysis) is presented.This method is applicable when a large dataset of multivariate observations is available within a domain split in several known subdomains, and it assumes that the variograms (or covariance functions) are comparable between subdomains, which only differ in the mean values of the available variables. The method consists on finding the eigen-decomposition of the matrix W-1B, where W is the matrix of sills of all direct- and cross-variograms, and B is the covariance matrix of the vectors of weighted means within each subdomain, obtained by generalized least squares. The method is used to map peat blanket occurrence in Northern Ireland, with data from the Tellus
survey, which requires a minimal change to the general recipe: to use compositionally-compliant variogram tools and models, and work with log-ratio transformed data.


100.00% 100.00%



The complexity of modern geochemical data sets is increasing in several aspects (number of available samples, number of elements measured, number of matrices analysed, geological-environmental variability covered, etc), hence it is becoming increasingly necessary to apply statistical methods to elucidate their structure. This paper presents an exploratory analysis of one such complex data set, the Tellus geochemical soil survey of Northern Ireland (NI). This exploratory analysis is based on one of the most fundamental exploratory tools, principal component analysis (PCA) and its graphical representation as a biplot, albeit in several variations: the set of elements included (only major oxides vs. all observed elements), the prior transformation applied to the data (none, a standardization or a logratio transformation) and the way the covariance matrix between components is estimated (classical estimation vs. robust estimation). Results show that a log-ratio PCA (robust or classical) of all available elements is the most powerful exploratory setting, providing the following insights: the first two processes controlling the whole geochemical variation in NI soils are peat coverage and a contrast between “mafic” and “felsic” background lithologies; peat covered areas are detected as outliers by a robust analysis, and can be then filtered out if required for further modelling; and peat coverage intensity can be quantified with the %Br in the subcomposition (Br, Rb, Ni).


40.00% 40.00%



We discuss complementarity relations in a bipartite continuous variable system. Building up from the work done on discrete d-dimensional systems, we prove that for symmetric two-mode states, quantum complementarity relations can be put in a simple relation with the elements of the variance matrix. When this condition is not satisfied, such a connection becomes non-trivial. Our investigation is the first step towards an operative characterization of the complementarity in a scenario that has not been investigated so far.


30.00% 30.00%



Heart-of-palm (Euterpe edulis Mart.) is a wild palm with a wide distribution throughout the Atlantic Rainforest. Populations of E. edulis represent important renewable natural resources but are currently under threat from predatory exploitation. Furthermore, because the species is indigenous to the Atlantic Rainforest, which is located in the most economically developed and populated region of Brazil, social and economic pressures have devastated heart-of-palm forests. In order to estimate the partitioning of genetic variation of endangered E. edulis populations, 429 AFLP markers were used to analyse 150 plants representing 11 populations of the species distribution range. Analysis of the genetic structure of populations carried out using analysis of molecular variance (AMOVA) revealed moderate genetic variation within populations (57.4%). Genetic differentiation between populations (F-ST = 0.426) was positively correlated with geographical distance. These results could be explained by the historical fragmentation of the Atlantic coastal region, together with the life cycle and mating system The data obtained in this work should have important implications for conservation and future breeding programmes of E. edulis.