918 resultados para Multivariate Linkage Analysis


Relevância:

30.00% 30.00%

Publicador:

Resumo:

To improve the mechanics properties of polyurethane materials at a high or low temperature, a hydroxy compound N-100 of HDI was synthesized, The structure analysis and characterization were made by NMR (H-1, C-13, H-1-H-1 COSY, C-13-H-1 COSY), In addition, quantitative description of the network was made on the basis of some ideal assumptions, 1D and 2D NMR can differentiate four sorts of carbonyl groups and establish the connections of all carbon and hydrogen atoms of mixed structures that originated from five different substitutions, Besides, the alkene and isocyanate, urea, biuret and trimerized isocyanuric groups were also detected, Therefore, the structure of N-100 was suggested be a polyisocyanate with complicated network which contained nitrogen atom as cross-linkage, isocyanate and alkene as end groups, The consistence of calculated values with tested values of isocyanate content, mean function degree and mean molecular weight demonstrated the correct of structure characterization and the validity of network description.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

In this paper, the new topological indices A(x1)-A(x3) suggested in our laboratory and molecular connectivity indices have been applied to multivariate analysis in structure-property studies. The topological indices of twenty asymmetrical phosphono bisazo derivatives of chromotropic acid have been calculated. The structure-property relationships between colour reagents and their colour reactions with ytterbium have been studied by A(x1)-A(x3) indices and molecular connectivity indices with satisfactory results. Multiple regression analysis and neural networks were employed simultaneously in this study.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Multivariate classification methods were used to evaluate data on the concentrations of eight metals in human senile lenses measured by atomic absorption spectrometry. Principal components analysis and hierarchical clustering separated senile cataract lenses, nuclei from cataract lenses, and normal lenses into three classes on the basis of the eight elements. Stepwise discriminant analysis was applied to give discriminant functions with five selected variables. Results provided by the linear learning machine method were also satisfactory; the k-nearest neighbour method was less useful.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Amplified fragment length polymorphisms (AFLPs) were used for genome mapping in the Pacific Oyster Crassostrea gigas Thunberg. Seventeen selected primer combinations produced 1106 peaks, of which 384 (34.7%) were polymorphic in a backcross family. Among the polymorphic markers, 349 were segregating through either the female or the male parent. Chi-square analysis indicated that 255 (73.1%) of the markers segregated in a Mendelian ratio, and 94 (26.9%) showed significant (P < 0.05) segregation distortion. Separate genetic linkage maps were constructed for the female and male parents. The female framework map consisted of 119 markers in 11 linkage groups, spanning 1030.7 cM, with an average interval of 9.5 cM per marker. The male map contained 96 markers in 10 linkage groups, covering 758.4 cM, with 8.8 cM per marker. The estimated genome length of the Pacific oyster was 1258 cM for the female and 933 cM for the male, and the observed coverage was 82.0% for the female map and 81.3% for the male map. Most distorted markers were deficient for homozygotes and closely linked to each other on the genetic map, suggesting the presence of major recessive deleterious genes in the Pacific oyster.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Nonlinear multivariate statistical techniques on fast computers offer the potential to capture more of the dynamics of the high dimensional, noisy systems underlying financial markets than traditional models, while making fewer restrictive assumptions. This thesis presents a collection of practical techniques to address important estimation and confidence issues for Radial Basis Function networks arising from such a data driven approach, including efficient methods for parameter estimation and pruning, a pointwise prediction error estimator, and a methodology for controlling the "data mining'' problem. Novel applications in the finance area are described, including customized, adaptive option pricing and stock price prediction.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

For two multinormal populations with equal covariance matrices the likelihood ratio discriminant function, an alternative allocation rule to the sample linear discriminant function when n1 ≠ n2 ,is studied analytically. With the assumption of a known covariance matrix its distribution is derived and the expectation of its actual and apparent error rates evaluated and compared with those of the sample linear discriminant function. This comparison indicates that the likelihood ratio allocation rule is robust to unequal sample sizes. The quadratic discriminant function is studied, its distribution reviewed and evaluation of its probabilities of misclassification discussed. For known covariance matrices the distribution of the sample quadratic discriminant function is derived. When the known covariance matrices are proportional exact expressions for the expectation of its actual and apparent error rates are obtained and evaluated. The effectiveness of the sample linear discriminant function for this case is also considered. Estimation of true log-odds for two multinormal populations with equal or unequal covariance matrices is studied. The estimative, Bayesian predictive and a kernel method are compared by evaluating their biases and mean square errors. Some algebraic expressions for these quantities are derived. With equal covariance matrices the predictive method is preferable. Where it derives this superiority is investigated by considering its performance for various levels of fixed true log-odds. It is also shown that the predictive method is sensitive to n1 ≠ n2. For unequal but proportional covariance matrices the unbiased estimative method is preferred. Product Normal kernel density estimates are used to give a kernel estimator of true log-odds. The effect of correlation in the variables with product kernels is considered. With equal covariance matrices the kernel and parametric estimators are compared by simulation. For moderately correlated variables and large dimension sizes the product kernel method is a good estimator of true log-odds.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Background: We conducted a survival analysis of all the confirmed cases of Adult Tuberculosis (TB) patients treated in Cork-City, Ireland. The aim of this study was to estimate Survival time (ST), including median time of survival and to assess the association and impact of covariates (TB risk factors) to event status and ST. The outcome of the survival analysis is reported in this paper. Methods: We used a retrospective cohort study research design to review data of 647 bacteriologically confirmed TB patients from the medical record of two teaching hospitals. Mean age 49 years (Range 18–112). We collected information on potential risk factors of all confirmed cases of TB treated between 2008–2012. For the survival analysis, the outcome of interest was ‘treatment failure’ or ‘death’ (whichever came first). A univariate descriptive statistics analysis was conducted using a non- parametric procedure, Kaplan -Meier (KM) method to estimate overall survival (OS), while the Cox proportional hazard model was used for the multivariate analysis to determine possible association of predictor variables and to obtain adjusted hazard ratio. P value was set at <0.05, log likelihood ratio test at >0.10. Data were analysed using SPSS version 15.0. Results: There was no significant difference in the survival curves of male and female patients. (Log rank statistic = 0.194, df = 1, p = 0.66) and among different age group (Log rank statistic = 1.337, df = 3, p = 0.72). The mean overall survival (OS) was 209 days (95%CI: 92–346) while the median was 51 days (95% CI: 35.7–66). The mean ST for women was 385 days (95%CI: 76.6–694) and for men was 69 days (95%CI: 48.8–88.5). Multivariate Cox regression showed that patient who had history of drug misuse had 2.2 times hazard than those who do not have drug misuse. Smokers and alcohol drinkers had hazard of 1.8 while patients born in country of high endemicity (BICHE) had hazard of 6.3 and HIV co-infection hazard was 1.2. Conclusion: There was no significant difference in survival curves of male and female and among age group. Women had a higher ST compared to men. But men had a higher hazard rate compared to women. Anti-TNF, immunosuppressive medication and diabetes were found to be associated with longer ST, while alcohol, smoking, RICHE, BICHE was associated with shorter ST.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Tumor microenvironmental stresses, such as hypoxia and lactic acidosis, play important roles in tumor progression. Although gene signatures reflecting the influence of these stresses are powerful approaches to link expression with phenotypes, they do not fully reflect the complexity of human cancers. Here, we describe the use of latent factor models to further dissect the stress gene signatures in a breast cancer expression dataset. The genes in these latent factors are coordinately expressed in tumors and depict distinct, interacting components of the biological processes. The genes in several latent factors are highly enriched in chromosomal locations. When these factors are analyzed in independent datasets with gene expression and array CGH data, the expression values of these factors are highly correlated with copy number alterations (CNAs) of the corresponding BAC clones in both the cell lines and tumors. Therefore, variation in the expression of these pathway-associated factors is at least partially caused by variation in gene dosage and CNAs among breast cancers. We have also found the expression of two latent factors without any chromosomal enrichment is highly associated with 12q CNA, likely an instance of "trans"-variations in which CNA leads to the variations in gene expression outside of the CNA region. In addition, we have found that factor 26 (1q CNA) is negatively correlated with HIF-1alpha protein and hypoxia pathways in breast tumors and cell lines. This agrees with, and for the first time links, known good prognosis associated with both a low hypoxia signature and the presence of CNA in this region. Taken together, these results suggest the possibility that tumor segmental aneuploidy makes significant contributions to variation in the lactic acidosis/hypoxia gene signatures in human cancers and demonstrate that latent factor analysis is a powerful means to uncover such a linkage.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Complex diseases will have multiple functional sites, and it will be invaluable to understand the cross-locus interaction in terms of linkage disequilibrium (LD) between those sites (epistasis) in addition to the haplotype-LD effects. We investigated the statistical properties of a class of matrix-based statistics to assess this epistasis. These statistical methods include two LD contrast tests (Zaykin et al., 2006) and partial least squares regression (Wang et al., 2008). To estimate Type 1 error rates and power, we simulated multiple two-variant disease models using the SIMLA software package. SIMLA allows for the joint action of up to two disease genes in the simulated data with all possible multiplicative interaction effects between them. Our goal was to detect an interaction between multiple disease-causing variants by means of their linkage disequilibrium (LD) patterns with other markers. We measured the effects of marginal disease effect size, haplotype LD, disease prevalence and minor allele frequency have on cross-locus interaction (epistasis). In the setting of strong allele effects and strong interaction, the correlation between the two disease genes was weak (r=0.2). In a complex system with multiple correlations (both marginal and interaction), it was difficult to determine the source of a significant result. Despite these complications, the partial least squares and modified LD contrast methods maintained adequate power to detect the epistatic effects; however, for many of the analyses we often could not separate interaction from a strong marginal effect. While we did not exhaust the entire parameter space of possible models, we do provide guidance on the effects that population parameters have on cross-locus interaction.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

We discuss a general approach to dynamic sparsity modeling in multivariate time series analysis. Time-varying parameters are linked to latent processes that are thresholded to induce zero values adaptively, providing natural mechanisms for dynamic variable inclusion/selection. We discuss Bayesian model specification, analysis and prediction in dynamic regressions, time-varying vector autoregressions, and multivariate volatility models using latent thresholding. Application to a topical macroeconomic time series problem illustrates some of the benefits of the approach in terms of statistical and economic interpretations as well as improved predictions. Supplementary materials for this article are available online. © 2013 Copyright Taylor and Francis Group, LLC.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

We examined how marine plankton interaction networks, as inferred by multivariate autoregressive (MAR) analysis of time-series, differ based on data collected at a fixed sampling location (L4 station in the Western English Channel) and four similar time-series prepared by averaging Continuous Plankton Recorder (CPR) datapoints in the region surrounding the fixed station. None of the plankton community structures suggested by the MAR models generated from the CPR datasets were well correlated with the MAR model for L4, but of the four CPR models, the one most closely resembling the L4 model was that for the CPR region nearest to L4. We infer that observation error and spatial variation in plankton community dynamics influenced the model performance for the CPR datasets. A modified MAR framework in which observation error and spatial variation are explicitly incorporated could allow the analysis to better handle the diverse time-series data collected in marine environments.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The relationship between biodiversity and stability of marine benthic assemblages was investigated using existing data sets (n = 28) covering various spatial (m-km) and temporal (1973-2006) scales in different benthic habitats (emergent rock, rock pools and sedimentary habitats) through meta-analyses. Assemblage stability was estimated by measuring temporal variances of species richness, total abundance (density or % cover) and community species composition and abundance structure (using multivariate analyses). Positive relationships between temporal variability in species number and richness were generally observed at both quadrat (<1 m2) and site (100 m2) scales, while no relationships were observed by multivariate analyses. Positive relationships were also observed at the scale of site between temporal variability in species number and variability in community structure with evenness estimates. This implies that the relationship between species richness or evenness and species richness variability is slightly positive and depends on the scale of observation, suggesting that biodiversity per se is important for the stability of ecosystems. Changes within community assemblages in terms of structure are, however, generally independent of biodiversity, suggesting no effect of diversity, but the potential impact of individual species, and/or environmental factors. Except for sedimentary and rock pool habitats, no relationship was observed between temporal variation of the aggregated variable of total abundances and diversity at either scale. Overall our results emphasise that relationships depend on scale of measurements, type of habitats and the marine systems (North Atlantic and Mediterranean) considered.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

This paper presents a statistical-based fault diagnosis scheme for application to internal combustion engines. The scheme relies on an identified model that describes the relationships between a set of recorded engine variables using principal component analysis (PCA). Since combustion cycles are complex in nature and produce nonlinear relationships between the recorded engine variables, the paper proposes the use of nonlinear PCA (NLPCA). The paper further justifies the use of NLPCA by comparing the model accuracy of the NLPCA model with that of a linear PCA model. A new nonlinear variable reconstruction algorithm and bivariate scatter plots are proposed for fault isolation, following the application of NLPCA. The proposed technique allows the diagnosis of different fault types under steady-state operating conditions. More precisely, nonlinear variable reconstruction can remove the fault signature from the recorded engine data, which allows the identification and isolation of the root cause of abnormal engine behaviour. The paper shows that this can lead to (i) an enhanced identification of potential root causes of abnormal events and (ii) the masking of faulty sensor readings. The effectiveness of the enhanced NLPCA based monitoring scheme is illustrated by its application to a sensor fault and a process fault. The sensor fault relates to a drift in the fuel flow reading, whilst the process fault relates to a partial blockage of the intercooler. These faults are introduced to a Volkswagen TDI 1.9 Litre diesel engine mounted on an experimental engine test bench facility.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

This paper points out a serious flaw in dynamic multivariate statistical process control (MSPC). The principal component analysis of a linear time series model that is employed to capture auto- and cross-correlation in recorded data may produce a considerable number of variables to be analysed. To give a dynamic representation of the data (based on variable correlation) and circumvent the production of a large time-series structure, a linear state space model is used here instead. The paper demonstrates that incorporating a state space model, the number of variables to be analysed dynamically can be considerably reduced, compared to conventional dynamic MSPC techniques.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Cooperatives have a long historical experience in the Spanish economy and have demonstrated their ability to compete against traditional firms in the market. To maintain this capability, while taking advantage of the competitive advantages associated with their idiosyncrasies as social economy enterprises, they should take into consideration that the economy is increasingly globalized and increasingly knowledge-based, especially with regards to technological content. As a consequence, the innovative capacity appears to be a key aspect in order to be able to challenge competitors. This article characterizes the innovative behavior of cooperatives in the region of Castile and Leon and analyses the internal and external factors affecting their innovative performance, based on data from a survey of 581 cooperatives. The results of the empirical analysis, which is performed by multivariate binary logistic regression on various types of innovation, lead us to identify the size of the organizations, the existence of planning, the R & D activities and the human capital as the main determining factors.