13 resultados para Statistical Power
em University of Queensland eSpace - Australia
Resumo:
Deficiencies in DNA repair have been hypothesized to increase cancer risk and excess cancer incidence is a feature of inherited diseases caused by defects in DNA damage recognition and repair. We investigated, using a case-control design, whether the double-strand break repair gene polymorphisms RAD51 5' untranslated region -135 G > C, XRCC2 R188H G > A, and XRCC3 T241M C > T were associated with risk of breast or ovarian cancer in Australian women. Sample sets included 1,456 breast cancer cases and 793 age-matched controls ages under 60 years of age, 549 incident ovarian cancer cases, and 335 controls of similar age distribution. For the total sample and the subsample of Caucasian women, there were no significant differences in genotype distribution between breast cancer cases and controls or between ovarian cancer cases and combined control groups. The crude odds ratios (OR) and 95% confidence intervals (95% CI) associated with the RAD51 GC/CC genotype frequency was OR, 1.10; 95% CI, 0.80-1.41 for breast cancer and OR, 1.22; 95% CI, 0.92-1.62 for ovarian cancer. Similarly, there were no increased risks associated with the XRCC2 GA/AA genotype (OR, 0.98; 95% CI, 0.76-1.26 for breast cancer and OR, 0.93; 95% CI, 0.69-1.25 for ovarian cancer) or the XRCC3 CT/TT genotype (OR, 0.92; 95% Cl, 0.77-1.10 for breast cancer and OR, 0.87; 95% CI, 0.71-1.08 for ovarian cancer). Results were little changed after adjustment for age and other measured risk factors. Although there was little statistical power to detect modest increases in risk for the homozygote variant genotypes, particularly for the rare RAD51 and XRCC2 variants, the data suggest that none of these variants play a major role in the etiology of breast or ovarian cancer.
Resumo:
Testing for simultaneous vicariance across comparative phylogeographic data sets is a notoriously difficult problem hindered by mutational variance, the coalescent variance, and variability across pairs of sister taxa in parameters that affect genetic divergence. We simulate vicariance to characterize the behaviour of several commonly used summary statistics across a range of divergence times, and to characterize this behaviour in comparative phylogeographic datasets having multiple taxon-pairs. We found Tajima's D to be relatively uncorrelated with other summary statistics across divergence times, and using simple hypothesis testing of simultaneous vicariance given variable population sizes, we counter-intuitively found that the variance across taxon pairs in Nei and Li's net nucleotide divergence (pi(net)), a common measure of population divergence, is often inferior to using the variance in Tajima's D across taxon pairs as a test statistic to distinguish ancient simultaneous vicariance from variable vicariance histories. The opposite and more intuitive pattern is found for testing more recent simultaneous vicariance, and overall we found that depending on the timing of vicariance, one of these two test statistics can achieve high statistical power for rejecting simultaneous vicariance, given a reasonable number of intron loci (> 5 loci, 400 bp) and a range of conditions. These results suggest that components of these two composite summary statistics should be used in future simulation-based methods which can simultaneously use a pool of summary statistics to test comparative the phylogeographic hypotheses we consider here.
Resumo:
We often need to estimate the size of wild populations to determine the appropriate management action, for example, to set a harvest quota. Monitoring is usually planned under the assumption that it must be carried out at fixed intervals in time, typically annually, before the harvest quota is set. However, monitoring can be very expensive, and we should weigh the cost of monitoring against the improvement that it makes in decision making. A less costly alternative to monitoring annually is to predict the population size using a population model and information from previous surveys. In this paper, the problem of monitoring frequency is posed within a decision-theory framework. We discover that a monitoring regime that varies according to the state of the system call outperform fixed-interval monitoring This idea is illustrated using data for a red kangaroo (Macropits rufus) population in South Australia. Whether or not one should monitor in a given year is dependent on the estimated population density in the previous year, the uncertainty in that population estimate, and past rainfall. We discover that monitoring is-important when a model-based prediction of population density is very uncertain. This may occur if monitoring has not taken place for several years, or if rainfall has been above average. Monitoring is also important when prior information suggests that the population is near a critical threshold in population abundance. However, monitoring is less important when the optimal management action would not be altered by new information.
Resumo:
Objective: To devise more-effective physical activity interventions, the mediating mechanisms yielding behavioral change need to be identified. The Baron-Kenny method is most commonly used. but has low statistical power and May not identify mechanisms of behavioral change in small-to-medium size Studies. More powerful statistical tests are available, Study Design and Setting: Inactive adults (N = 52) were randomized to either a print or a print-plus-telephone intervention. Walking and exercise-related social support Were assessed at baseline, after file intervention, and 4 weeks later. The Baron-Kenny and three alternative methods of mediational analysis (Freedman-Schatzkin; MacKinnon et al.: bootstrap method) were used to examine the effects of social support on initial behavior change and maintenance. Results: A significant mediational effect of social support on initial behavior change was indicated by the MacKinnon et al., bootstrap. and. marginally. Freedman-Schatzkin methods, but not by the Baron-Kenny method. No significant mediational effecl of social support on maintenance of walking was found. Conclusions: Methodologically rigorous intervention studies to identify mediators of change in physical activity are costly and labor intensive, and may not be feasible with large samples. The Use of statistically powerful tests of mediational effects in small-scale studies can inform the development of more effective interventions. (C) 2006 Elsevier Inc. All rights reserved.
Resumo:
Effective detection of population trend is crucial for managing threatened species. Little theory exists, however, to assist managers in choosing the most cost-effective monitoring techniques for diagnosing trend. We present a framework for determining the optimal monitoring strategy by simulating a manager collecting data on a declining species, the Chestnut-rumped Hylacola (Hylacola pyrrhopygia parkeri), to determine whether the species should be listed under the IUCN (World Conservation Union) Red List. We compared the efficiencies of two strategies for detecting trend, abundance, and presence-absence surveys, underfinancial constraints. One might expect the abundance surveys to be superior under all circumstances because more information is collected at each site. Nevertheless, the presence-absence data can be collected at more sites because the surveyor is not obliged to spend a fixed amount of time at each site. The optimal strategy for monitoring was very dependent on the budget available. Under some circumstances, presence-absence surveys outperformed abundance surveys for diagnosing the IUCN Red List categories cost-effectively. Abundance surveys were best if the species was expected to be recorded more than 16 times/year; otherwise, presence-absence surveys were best. The relationship between the strategies we investigated is likely to be relevant for many comparisons of presence-absence or abundance data. Managers of any cryptic or low-density species who hope to maximize their success of estimating trend should find an application for our results.
Resumo:
This study has three main objectives. First, it develops a generalization of the commonly used EKS method to multilateral price comparisons. It is shown that the EKS system can be generalized so that weights can be attached to each of the link comparisons used in the EKS computations. These weights can account for differing levels of reliability of the underlying binary comparisons. Second, various reliability measures and corresponding weighting schemes are presented and their merits discussed. Finally, these new methods are applied to an international data set of manufacturing prices from the ICOP project. Although theoretically superior, it appears that the empirical impact of the weighted EKS method is generally small compared to the unweighted EKS. It is also found that this impact is larger when it is applied at lower levels of aggregation. Finally, the importance of using sector specific PPPs in assessing relative levels of manufacturing productivity is indicated.
Resumo:
Genetic assignment methods use genotype likelihoods to draw inference about where individuals were or were not born, potentially allowing direct, real-time estimates of dispersal. We used simulated data sets to test the power and accuracy of Monte Carlo resampling methods in generating statistical thresholds for identifying F-0 immigrants in populations with ongoing gene flow, and hence for providing direct, real-time estimates of migration rates. The identification of accurate critical values required that resampling methods preserved the linkage disequilibrium deriving from recent generations of immigrants and reflected the sampling variance present in the data set being analysed. A novel Monte Carlo resampling method taking into account these aspects was proposed and its efficiency was evaluated. Power and error were relatively insensitive to the frequency assumed for missing alleles. Power to identify F-0 immigrants was improved by using large sample size (up to about 50 individuals) and by sampling all populations from which migrants may have originated. A combination of plotting genotype likelihoods and calculating mean genotype likelihood ratios (D-LR) appeared to be an effective way to predict whether F-0 immigrants could be identified for a particular pair of populations using a given set of markers.
Resumo:
The distributions of eyes-closed resting electroencephalography (EEG) power spectra and their residuals were described and compared using classically averaged and adaptively aligned averaged spectra. Four minutes of eyes-closed resting EEG was available from 69 participants. Spectra were calculated with 0.5-Hz resolution and were analyzed at this level. It was shown that power in the individual 0.5 Hz frequency bins can be considered normally distributed when as few as three or four 2-second epochs of EEG are used in the average. A similar result holds for the residuals. Power at the peak Alpha frequency has quite different statistical behaviour to power at other frequencies and it is considered that power at peak Alpha represents a relatively individuated process that is best measured through aligned averaging. Previous analyses of contrasts in upper and lower alpha bands may be explained in terms of the variability or distribution of the peak Alpha frequency itself.
Resumo:
Background The identification and characterization of genes that influence the risk of common, complex multifactorial disease primarily through interactions with other genes and environmental factors remains a statistical and computational challenge in genetic epidemiology. We have previously introduced a genetic programming optimized neural network (GPNN) as a method for optimizing the architecture of a neural network to improve the identification of gene combinations associated with disease risk. The goal of this study was to evaluate the power of GPNN for identifying high-order gene-gene interactions. We were also interested in applying GPNN to a real data analysis in Parkinson's disease. Results We show that GPNN has high power to detect even relatively small genetic effects (2–3% heritability) in simulated data models involving two and three locus interactions. The limits of detection were reached under conditions with very small heritability (
Resumo:
Background: The identification and characterization of genes that influence the risk of common, complex multifactorial disease primarily through interactions with other genes and environmental factors remains a statistical and computational challenge in genetic epidemiology. We have previously introduced a genetic programming optimized neural network (GPNN) as a method for optimizing the architecture of a neural network to improve the identification of gene combinations associated with disease risk. The goal of this study was to evaluate the power of GPNN for identifying high-order gene-gene interactions. We were also interested in applying GPNN to a real data analysis in Parkinson's disease. Results: We show that GPNN has high power to detect even relatively small genetic effects (2-3% heritability) in simulated data models involving two and three locus interactions. The limits of detection were reached under conditions with very small heritability (
Resumo:
We consider the problems of computing the power and exponential moments EXs and EetX of square Gaussian random matrices X=A+BWC for positive integer s and real t, where W is a standard normal random vector and A, B, C are appropriately dimensioned constant matrices. We solve the problems by a matrix product scalarization technique and interpret the solutions in system-theoretic terms. The results of the paper are applicable to Bayesian prediction in multivariate autoregressive time series and mean-reverting diffusion processes.