957 resultados para Multivariate analysis of variance
Resumo:
Meta-analysis of genome-wide association studies (GWASs) has led to the discoveries of many common variants associated with complex human diseases. There is a growing recognition that identifying "causal" rare variants also requires large-scale meta-analysis. The fact that association tests with rare variants are performed at the gene level rather than at the variant level poses unprecedented challenges in the meta-analysis. First, different studies may adopt different gene-level tests, so the results are not compatible. Second, gene-level tests require multivariate statistics (i.e., components of the test statistic and their covariance matrix), which are difficult to obtain. To overcome these challenges, we propose to perform gene-level tests for rare variants by combining the results of single-variant analysis (i.e., p values of association tests and effect estimates) from participating studies. This simple strategy is possible because of an insight that multivariate statistics can be recovered from single-variant statistics, together with the correlation matrix of the single-variant test statistics, which can be estimated from one of the participating studies or from a publicly available database. We show both theoretically and numerically that the proposed meta-analysis approach provides accurate control of the type I error and is as powerful as joint analysis of individual participant data. This approach accommodates any disease phenotype and any study design and produces all commonly used gene-level tests. An application to the GWAS summary results of the Genetic Investigation of ANthropometric Traits (GIANT) consortium reveals rare and low-frequency variants associated with human height. The relevant software is freely available.
Resumo:
The human brainstem is a densely packed, complex but highly organised structure. It not only serves as a conduit for long projecting axons conveying motor and sensory information, but also is the location of multiple primary nuclei that control or modulate a vast array of functions, including homeostasis, consciousness, locomotion, and reflexive and emotive behaviours. Despite its importance, both in understanding normal brain function as well as neurodegenerative processes, it remains a sparsely studied structure in the neuroimaging literature. In part, this is due to the difficulties in imaging the internal architecture of the brainstem in vivo in a reliable and repeatable fashion. A modified multivariate mixture of Gaussians (mmMoG) was applied to the problem of multichannel tissue segmentation. By using quantitative magnetisation transfer and proton density maps acquired at 3 T with 0.8 mm isotropic resolution, tissue probability maps for four distinct tissue classes within the human brainstem were created. These were compared against an ex vivo fixated human brain, imaged at 0.5 mm, with excellent anatomical correspondence. These probability maps were used within SPM8 to create accurate individual subject segmentations, which were then used for further quantitative analysis. As an example, brainstem asymmetries were assessed across 34 right-handed individuals using voxel based morphometry (VBM) and tensor based morphometry (TBM), demonstrating highly significant differences within localised regions that corresponded to motor and vocalisation networks. This method may have important implications for future research into MRI biomarkers of pre-clinical neurodegenerative diseases such as Parkinson's disease.
Resumo:
Structural equation models are widely used in economic, socialand behavioral studies to analyze linear interrelationships amongvariables, some of which may be unobservable or subject to measurementerror. Alternative estimation methods that exploit different distributionalassumptions are now available. The present paper deals with issues ofasymptotic statistical inferences, such as the evaluation of standarderrors of estimates and chi--square goodness--of--fit statistics,in the general context of mean and covariance structures. The emphasisis on drawing correct statistical inferences regardless of thedistribution of the data and the method of estimation employed. A(distribution--free) consistent estimate of $\Gamma$, the matrix ofasymptotic variances of the vector of sample second--order moments,will be used to compute robust standard errors and a robust chi--squaregoodness--of--fit squares. Simple modifications of the usual estimateof $\Gamma$ will also permit correct inferences in the case of multi--stage complex samples. We will also discuss the conditions under which,regardless of the distribution of the data, one can rely on the usual(non--robust) inferential statistics. Finally, a multivariate regressionmodel with errors--in--variables will be used to illustrate, by meansof simulated data, various theoretical aspects of the paper.
Resumo:
PURPOSE: This study was performed to determine the impact of perfusion and diffusion magnetic resonance imaging (MRI) sequences on patients during treatment of newly diagnosed glioblastoma. Special emphasis has been given to these imaging technologies as tools to potentially anticipate disease progression, as progression-free survival is frequently used as a surrogate endpoint. METHODS AND MATERIALS: Forty-one patients from a phase II temolozomide clinical trial were included. During follow-up, images were integrated 21 to 28 days after radiochemotherapy and every 2 months thereafter. Assessment of scans included measurement of size of lesion on T1 contrast-enhanced, T2, diffusion, and perfusion images, as well as mass effect. Classical criteria on tumor size variation and clinical parameters were used to set disease progression date. RESULTS: A total of 311 MRI examinations were reviewed. At disease progression (32 patients), a multivariate Cox regression determined 2 significant survival parameters: T1 largest diameter (p < 0.02) and T2 size variation (p < 0.05), whereas perfusion and diffusion were not significant. CONCLUSION: Perfusion and diffusion techniques cannot be used to anticipate tumor progression. Decision making at disease progression is critical, and classical T1 and T2 imaging remain the gold standard. Specifically, a T1 contrast enhancement over 3 cm in largest diameter together with an increased T2 hypersignal is a marker of inferior prognosis.
Resumo:
BACKGROUND: Circulating 25-hydroxyvitamin D [25(OH)D] concentration is inversely associated with peripheral arterial disease and hypertension. Vascular remodeling may play a role in this association, however, data relating vitamin D level to specific remodeling biomarkers among ESRD patients is sparse. We tested whether 25(OH)D concentration is associated with markers of vascular remodeling and inflammation in African American ESRD patients.METHODS: We conducted a cross-sectional study among ESRD patients receiving maintenance hemodialysis within Emory University-affiliated outpatient hemodialysis units. Demographic, clinical and dialysis treatment data were collected via direct patient interview and review of patients records at the time of enrollment, and each patient gave blood samples. Associations between 25(OH)D and biomarker concentrations were estimated in univariate analyses using Pearson's correlation coefficients and in multivariate analyses using linear regression models. 25(OH) D concentration was entered in multivariate linear regression models as a continuous variable and binary variable (<15 ng/ml and =15 ng/ml). Adjusted estimate concentrations of biomarkers were compared between 25(OH) D groups using analysis of variance (ANOVA). Finally, results were stratified by vascular access type.RESULTS: Among 91 patients, mean (standard deviation) 25(OH)D concentration was 18.8 (9.6) ng/ml, and was low (<15 ng/ml) in 43% of patients. In univariate analyses, low 25(OH) D was associated with lower serum calcium, higher serum phosphorus, and higher LDL concentrations. 25(OH) D concentration was inversely correlated with MMP-9 concentration (r = -0.29, p = 0.004). In multivariate analyses, MMP-9 concentration remained negatively associated with 25(OH) D concentration (P = 0.03) and anti-inflammatory IL-10 concentration positively correlated with 25(OH) D concentration (P = 0.04).CONCLUSIONS: Plasma MMP-9 and circulating 25(OH) D concentrations are significantly and inversely associated among ESRD patients. This finding may suggest a potential mechanism by which low circulating 25(OH) D functions as a cardiovascular risk factor.
Resumo:
The emergence of host-races within aphids may constitute an obstacle to pest management by means of plant resistance. There are examples of host-races within cereals aphids, but their occurrence in Rose Grain Aphid, Metopolophium dirhodum (Walker, 1849), has not been reported yet. In this work, RAPD markers were used to assess effects of the hosts and geographic distance on the genetic diversity of M. dirhodum lineages. Twenty-three clones were collected on oats and wheat in twelve localitites of southern Brazil. From twenty-seven primers tested, only four primers showed polymorphisms. Fourteen different genotypes were revealed by cluster analysis. Five genotypes were collected only on wheat; seven only on oats and two were collected in both hosts. Genetic and geographical distances among all clonal lineages were not correlated. Analysis of molecular variance showed that some molecular markers are not randomly distributed among clonal lineages collected on oats and on wheat. These results suggest the existence of host-races within M. dirhodum, which should be further investigated using a combination of ecological and genetic data.
Resumo:
We consider two fundamental properties in the analysis of two-way tables of positive data: the principle of distributional equivalence, one of the cornerstones of correspondence analysis of contingency tables, and the principle of subcompositional coherence, which forms the basis of compositional data analysis. For an analysis to be subcompositionally coherent, it suffices to analyse the ratios of the data values. The usual approach to dimension reduction in compositional data analysis is to perform principal component analysis on the logarithms of ratios, but this method does not obey the principle of distributional equivalence. We show that by introducing weights for the rows and columns, the method achieves this desirable property. This weighted log-ratio analysis is theoretically equivalent to spectral mapping , a multivariate method developed almost 30 years ago for displaying ratio-scale data from biological activity spectra. The close relationship between spectral mapping and correspondence analysis is also explained, as well as their connection with association modelling. The weighted log-ratio methodology is applied here to frequency data in linguistics and to chemical compositional data in archaeology.
Resumo:
In the analysis of multivariate categorical data, typically the analysis of questionnaire data, it is often advantageous, for substantive and technical reasons, to analyse a subset of response categories. In multiple correspondence analysis, where each category is coded as a column of an indicator matrix or row and column of Burt matrix, it is not correct to simply analyse the corresponding submatrix of data, since the whole geometric structure is different for the submatrix . A simple modification of the correspondence analysis algorithm allows the overall geometric structure of the complete data set to be retained while calculating the solution for the selected subset of points. This strategy is useful for analysing patterns of response amongst any subset of categories and relating these patterns to demographic factors, especially for studying patterns of particular responses such as missing and neutral responses. The methodology is illustrated using data from the International Social Survey Program on Family and Changing Gender Roles in 1994.
Resumo:
We consider the joint visualization of two matrices which have common rowsand columns, for example multivariate data observed at two time pointsor split accord-ing to a dichotomous variable. Methods of interest includeprincipal components analysis for interval-scaled data, or correspondenceanalysis for frequency data or ratio-scaled variables on commensuratescales. A simple result in matrix algebra shows that by setting up thematrices in a particular block format, matrix sum and difference componentscan be visualized. The case when we have more than two matrices is alsodiscussed and the methodology is applied to data from the InternationalSocial Survey Program.
Resumo:
AIM: The specific natural history of superficial soft tissue sarcomas (S-STS) has been rarely considered. We describe the clinical characteristics of a large series of S-STS (N=367) from the French Sarcoma Group (GSF-GETO) database and analyse the prognostic factors affecting outcome. METHODS: We performed univariate and multivariate analyses for overall survival (OS), metastasis-free survival (MFS) and local recurrence-free survival (LRFS). RESULTS: The median age was 59 years. Fifty-eight percent patients were female. Tumour locations were as follows: extremities, 55%; trunk wall, 35.4%; head and neck, 8% and unknown, 1.6%. Median tumour size was 3.0 cm. The most frequent tumour types were unclassified sarcoma (24.3%) and leiomyosarcoma (22.3%). Thirty-three percent of cases were grade 3. Median follow-up was 6.18 years. The 5-year OS, MFS and LRFS rates were 80.9%, 80.7% and 74.7%, respectively. Multivariate analysis retained histological type and wide resection for predicting LRFS and histological type and grade as prognostic factors of MFS. The factors influencing OS were age, histological type, grade and wide resection. STS with early invasion into but not through the underlying fascia had a significantly poorer MFS than with strict S-STS. CONCLUSION: S-STS represent a separate category characterised by a better outcome. Adequate surgery, i.e. wide resection, is essential in the management of S-STS.
Resumo:
PURPOSE: The prognostic impact of complete response (CR) achievement in multiple myeloma (MM) has been shown mostly in the context of autologous stem-cell transplantation. Other levels of response have been defined because, even with high-dose therapy, CR is a relatively rare event. The purpose of this study was to analyze the prognostic impact of very good partial response (VGPR) in patients treated with high-dose therapy. PATIENTS AND METHODS: All patients were included in the Intergroupe Francophone du Myelome 99-02 and 99-04 trials and treated with vincristine, doxorubicin, and dexamethasone (VAD) induction therapy followed by double autologous stem-cell transplantation (ASCT). Best post-ASCT response assessment was available for 802 patients. RESULTS: With a median follow-up of 67 months, median event-free survival (EFS) and 5-year EFS were 42 months and 34%, respectively, for 405 patients who achieved at least VGPR after ASCT versus 32 months and 26% in 288 patients who achieved only partial remission (P = .005). Five-year overall survival (OS) was significantly superior in patients achieving at least VGPR (74% v 61% P = .0017). In multivariate analysis, achievement of less than VGPR was an independent factor predicting shorter EFS and OS. Response to VAD had no impact on EFS and OS. The impact of VGPR achievement on EFS and OS was significant in patients with International Staging System stages 2 to 3 and for patients with poor-risk cytogenetics t(4;14) or del(17p). CONCLUSION: In the context of ASCT, achievement of at least VGPR is a simple prognostic factor that has importance in intermediate and high-risk MM and can be informative in more patients than CR.
Resumo:
The spatial variability of strongly weathered soils under sugarcane and soybean/wheat rotation was quantitatively assessed on 33 fields in two regions in São Paulo State, Brazil: Araras (15 fields with sugarcane) and Assis (11 fields with sugarcane and seven fields with soybean/wheat rotation). Statistical methods used were: nested analysis of variance (for 11 fields), semivariance analysis and analysis of variance within and between fields. Spatial levels from 50 m to several km were analyzed. Results are discussed with reference to a previously published study carried out in the surroundings of Passo Fundo (RS). Similar variability patterns were found for clay content, organic C content and cation exchange capacity. The fields studied are quite homogeneous with respect to these relatively stable soil characteristics. Spatial variability of other characteristics (resin extractable P, pH, base- and Al-saturation and also soil colour), varies with region and, or land use management. Soil management for sugarcane seems to have induced modifications to greater depths than for soybean/wheat rotation. Surface layers of soils under soybean/wheat present relatively little variation, apparently as a result of very intensive soil management. The major part of within-field variation occurs at short distances (< 50 m) in all study areas. Hence, little extra information would be gained by increasing sampling density from, say, 1/km² to 1/50 m². For many purposes, the soils in the study regions can be mapped with the same observation density, but residual variance will not be the same in all areas. Bulk sampling may help to reveal spatial patterns between 50 and 1.000 m.
Resumo:
M. Santos, G. Gold, E. Kövari, F. R. Herrmann, P. R. Hof, C. Bouras and P. Giannakopoulos (2010) Neuropathology and Applied Neurobiology36, 661-672 Neuropathological analysis of lacunes and microvascular lesions in late-onset depression Aims: Previous neuropathological studies documented that small vascular and microvascular pathology is associated with cognitive decline. More recently, we showed that thalamic and basal ganglia lacunes are associated with post-stroke depression and may affect emotional regulation. The present study examines whether this is also the case for late-onset depression. Methods: We performed a detailed analysis of small macrovascular and microvascular pathology in the post mortem brains of 38 patients with late-onset major depression (LOD) and 29 healthy elderly controls. A clinical diagnosis of LOD was established while the subjects were alive using the DSM-IV criteria. Additionally, we retrospectively reviewed all charts for the presence of clinical criteria of vascular depression. Neuropathological evaluation included bilateral semi-quantitative assessment of lacunes, deep white matter and periventricular demyelination, cortical microinfarcts and both focal and diffuse gliosis. The association between vascular burden and LOD was investigated using Fisher's exact test and univariate and multivariate logistic regression models. Results: Neither the existence of lacunes nor the presence of microvascular ischaemic lesions was related to occurrence of LOD. Similarly, there was no relationship between vascular lesion scores and LOD. This was also the case within the subgroup of LOD patients fulfilling the clinical criteria for vascular depression. Conclusions: Our results challenge the vascular depression hypothesis by showing that neither deep white matter nor periventricular demyelination is associated with LOD. In conjunction with our previous observations in stroke patients, they also imply that the impact of lacunes on mood may be significant solely in the presence of acute brain compromise.
Resumo:
BACKGROUND: Finding genes that are differentially expressed between conditions is an integral part of understanding the molecular basis of phenotypic variation. In the past decades, DNA microarrays have been used extensively to quantify the abundance of mRNA corresponding to different genes, and more recently high-throughput sequencing of cDNA (RNA-seq) has emerged as a powerful competitor. As the cost of sequencing decreases, it is conceivable that the use of RNA-seq for differential expression analysis will increase rapidly. To exploit the possibilities and address the challenges posed by this relatively new type of data, a number of software packages have been developed especially for differential expression analysis of RNA-seq data. RESULTS: We conducted an extensive comparison of eleven methods for differential expression analysis of RNA-seq data. All methods are freely available within the R framework and take as input a matrix of counts, i.e. the number of reads mapping to each genomic feature of interest in each of a number of samples. We evaluate the methods based on both simulated data and real RNA-seq data. CONCLUSIONS: Very small sample sizes, which are still common in RNA-seq experiments, impose problems for all evaluated methods and any results obtained under such conditions should be interpreted with caution. For larger sample sizes, the methods combining a variance-stabilizing transformation with the 'limma' method for differential expression analysis perform well under many different conditions, as does the nonparametric SAMseq method.
Resumo:
OBJECTIVE: To evaluate the power of various parameters of the vestibulo-ocular reflex (VOR) in detecting unilateral peripheral vestibular dysfunction and in characterizing certain inner ear pathologies. STUDY DESIGN: Prospective study of consecutive ambulatory patients presenting with acute onset of peripheral vertigo and spontaneous nystagmus. SETTING: Tertiary referral center. PATIENTS: Seventy-four patients (40 females, 34 males) and 22 normal subjects (11 females, 11 males) were included in the study. Patients were classified in three main diagnoses: vestibular neuritis: 40; viral labyrinthitis: 22; Meniere's disease: 12. METHODS: The VOR function was evaluated by standard caloric and impulse rotary tests (velocity step). A mathematical model of vestibular function was used to characterize the VOR response to rotational stimulation. The diagnostic value of the different VOR parameters was assessed by uni- and multivariable logistic regression. RESULTS: In univariable analysis, caloric asymmetry emerged as the most powerful VOR parameter in identifying unilateral vestibular deficit, with a boundary limit set at 20%. In multivariable analysis, the combination of caloric asymmetry and rotational time constant asymmetry significantly improved the discriminatory power over caloric alone (p<0.0001) and produced a detection score with a correct classification of 92.4%. In discriminating labyrinthine diseases, different combinations of the VOR parameters were obtained for each diagnosis (p<0.003) supporting that the VOR characteristics differ between the three inner ear disorders. However, the clinical usefulness of these characteristics in separating the pathologies was limited. CONCLUSION: We propose a powerful logistic model combining the indices of caloric and time constant asymmetries to detect a peripheral vestibular loss, with an accuracy of 92.4%. Based on vestibular data only, the discrimination between the different inner ear diseases is statistically possible, which supports different pathophysiologic changes in labyrinthine pathologies.