945 resultados para Number Crunch


Relevância:

20.00% 20.00%

Publicador:

Resumo:

Estimation of the number of mixture components (k) is an unsolved problem. Available methods for estimation of k include bootstrapping the likelihood ratio test statistics and optimizing a variety of validity functionals such as AIC, BIC/MDL, and ICOMP. We investigate the minimization of distance between fitted mixture model and the true density as a method for estimating k. The distances considered are Kullback-Leibler (KL) and “L sub 2”. We estimate these distances using cross validation. A reliable estimate of k is obtained by voting of B estimates of k corresponding to B cross validation estimates of distance. This estimation methods with KL distance is very similar to Monte Carlo cross validated likelihood methods discussed by Smyth (2000). With focus on univariate normal mixtures, we present simulation studies that compare the cross validated distance method with AIC, BIC/MDL, and ICOMP. We also apply the cross validation estimate of distance approach along with AIC, BIC/MDL and ICOMP approach, to data from an osteoporosis drug trial in order to find groups that differentially respond to treatment.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Submicroscopic changes in chromosomal DNA copy number dosage are common and have been implicated in many heritable diseases and cancers. Recent high-throughput technologies have a resolution that permits the detection of segmental changes in DNA copy number that span thousands of basepairs across the genome. Genome-wide association studies (GWAS) may simultaneously screen for copy number-phenotype and SNP-phenotype associations as part of the analytic strategy. However, genome-wide array analyses are particularly susceptible to batch effects as the logistics of preparing DNA and processing thousands of arrays often involves multiple laboratories and technicians, or changes over calendar time to the reagents and laboratory equipment. Failure to adjust for batch effects can lead to incorrect inference and requires inefficient post-hoc quality control procedures that exclude regions that are associated with batch. Our work extends previous model-based approaches for copy number estimation by explicitly modeling batch effects and using shrinkage to improve locus-specific estimates of copy number uncertainty. Key features of this approach include the use of diallelic genotype calls from experimental data to estimate batch- and locus-specific parameters of background and signal without the requirement of training data. We illustrate these ideas using a study of bipolar disease and a study of chromosome 21 trisomy. The former has batch effects that dominate much of the observed variation in quantile-normalized intensities, while the latter illustrates the robustness of our approach to datasets where as many as 25% of the samples have altered copy number. Locus-specific estimates of copy number can be plotted on the copy-number scale to investigate mosaicism and guide the choice of appropriate downstream approaches for smoothing the copy number as a function of physical position. The software is open source and implemented in the R package CRLMM available at Bioconductor (http:www.bioconductor.org).

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Simulation-based assessment is a popular and frequently necessary approach to evaluation of statistical procedures. Sometimes overlooked is the ability to take advantage of underlying mathematical relations and we focus on this aspect. We show how to take advantage of large-sample theory when conducting a simulation using the analysis of genomic data as a motivating example. The approach uses convergence results to provide an approximation to smaller-sample results, results that are available only by simulation. We consider evaluating and comparing a variety of ranking-based methods for identifying the most highly associated SNPs in a genome-wide association study, derive integral equation representations of the pre-posterior distribution of percentiles produced by three ranking methods, and provide examples comparing performance. These results are of interest in their own right and set the framework for a more extensive set of comparisons.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Amplifications and deletions of chromosomal DNA, as well as copy-neutral loss of heterozygosity have been associated with diseases processes. High-throughput single nucleotide polymorphism (SNP) arrays are useful for making genome-wide estimates of copy number and genotype calls. Because neighboring SNPs in high throughput SNP arrays are likely to have dependent copy number and genotype due to the underlying haplotype structure and linkage disequilibrium, hidden Markov models (HMM) may be useful for improving genotype calls and copy number estimates that do not incorporate information from nearby SNPs. We improve previous approaches that utilize a HMM framework for inference in high throughput SNP arrays by integrating copy number, genotype calls, and the corresponding confidence scores when available. Using simulated data, we demonstrate how confidence scores control smoothing in a probabilistic framework. Software for fitting HMMs to SNP array data is available in the R package ICE.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Genotyping platforms such as Affymetrix can be used to assess genotype-phenotype as well as copy number-phenotype associations at millions of markers. While genotyping algorithms are largely concordant when assessed on HapMap samples, tools to assess copy number changes are more variable and often discordant. One explanation for the discordance is that copy number estimates are susceptible to systematic differences between groups of samples that were processed at different times or by different labs. Analysis algorithms that do not adjust for batch effects are prone to spurious measures of association. The R package crlmm implements a multilevel model that adjusts for batch effects and provides allele-specific estimates of copy number. This paper illustrates a workflow for the estimation of allele-specific copy number, develops markerand study-level summaries of batch effects, and demonstrates how the marker-level estimates can be integrated with complimentary Bioconductor software for inferring regions of copy number gain or loss. All analyses are performed in the statistical environment R. A compendium for reproducing the analysis is available from the author’s website (http://www.biostat.jhsph.edu/~rscharpf/crlmmCompendium/index.html).

Relevância:

20.00% 20.00%

Publicador:

Resumo:

PURPOSE: To identify groups of early breast cancer patients with substantial risk (10-year risk > 20%) for locoregional failure (LRF) who might benefit from postmastectomy radiotherapy (RT). PATIENTS AND METHODS: Prognostic factors for LRF were evaluated among 6,660 patients (2,588 node-negative patients, 4,072 node-positive patients) in International Breast Cancer Study Group Trials I to IX treated with chemotherapy and/or endocrine therapy, and observed for a median of 14 years. In total, 1,251 LRFs were detected. All patients were treated with mastectomy without RT. RESULTS: No group with 10-year LRF risk exceeding 20% was found among patients with node-negative disease. Among patients with node-positive breast cancer, increasing numbers of uninvolved nodes were significantly associated with decreased risk of LRF, even after adjustment for other prognostic factors. The highest quartile of uninvolved nodes was compared with the lowest quartile. Among premenopausal patients, LRF risk was decreased by 35% (P = .0010); among postmenopausal patients, LRF risk was decreased by 46% (P < .0001). The 10-year cumulative incidence of LRF was 20% among patients with one to three involved lymph nodes and fewer than 10 uninvolved nodes. Age younger than 40 years and vessel invasion were also associated significantly with increased risk. Among patients with node-positive disease, overall survival was significantly greater in those with higher numbers of uninvolved nodes examined (P < .0001). CONCLUSION: Patients with one to three involved nodes and a low number of uninvolved nodes, vessel invasion, or young age have an increased risk of LRF and may be candidates for a similar treatment as those with at least four lymph node metastases.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Voluntary exercise (VE) has a beneficial influence on the heart and mean lifespan. The present study evaluates structural adaptations of cardiomyocytes and their mitochondria due to VE by new, unbiased stereological methods. Female, 7-9-week-old mice were randomly assigned to a control (CG, n = 7) or VE group (EG, n = 7). EG animals were housed in cages with free access to a running wheel and had a mean running distance of 6.7 (1.8) km per day. After 4 weeks, the hearts of all mice were processed for light and electron microscopy. We estimated the number and volume of cardiomyocytes by the disector method and the number and volume of mitochondria by estimation of the Euler number. In comparison to CG, VE did not have an effect on the myocardial volume of the left ventricle (CG: 93 (10), EG: 103 (17) (mm(3))), the number of cardiomyocytes (CG: 2.81 (0.27), EG: 2.82 (0.43) (x10(6))) and their number-weighted mean volume. However, the composition of the cardiomyocytes changed due to VE. The total volume of mitochondria (CG: 21.8 (4.9), EG: 32.2 (4.3) (mm(3)), P < 0.01) and the total number (CG: 3.76 (0.44), EG: 7.02 (1.13) (x10(10)), P < 0.001) were significantly higher in EG than in CG. The mean number-weighted mitochondrial volume was smaller in EG than in CG (P < 0.05). In summary, VE does not alter ventricular volume nor cardiomyocyte volume or number but the oxidative capacity of cardiomyocytes by an increased mitochondrial number and total volume in the left ventricle. These structural changes may participate in the beneficial effects of VE.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Mothers should adjust the size of propagules to the selective forces to which these offspring will be exposed. Usually, a larger propagule size is favored when young are exposed to high mortality risk or conspecific competition. Here we test 2 predictions on how egg size should vary with these selective agents. When offspring are cared for by parents and/or alloparents, protection may reduce the predation risk to young, which may allow mothers to invest less per single offspring. In the cooperatively breeding cichlid Neolamprologus pulcher, brood care helpers protect group offspring and reduce the latters' mortality rate. Therefore, females are expected to reduce their investment per egg when more helpers are present. In a first experiment, we tested this prediction by manipulating the helper number. In N. pulcher, helpers compete for dispersal opportunities with similar-sized individuals of neighboring groups. If the expected future competition pressure on young is high, females should increase their investment per offspring to give them a head start. In a second experiment, we tested whether females produce larger eggs when perceived neighbor density is high. Females indeed reduced egg size with increasing helper number. However, we did not detect an effect of local density on egg size, although females took longer to produce the next clutch when local density was high. We argue that females can use the energy saved by adjusting egg size to reduced predation risk to enhance future reproductive output. Adaptive adjustment of offspring size to helper number may be an important, as yet unrecognized, strategy of cooperative breeders.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

In this report we will investigate the effect of negative energy density in a classic Friedmann cosmology. Although never measured and possibly unphysical, the evolution of a Universe containing a significant cosmological abundance of any of a number of hypothetical stable negative energy components is explored. These negative energy (Ω < 0) forms include negative phantom energy (w<-1), negative cosmological constant (w=-1), negative domain walls (w=-2/3), negative cosmic strings (w= -1/3), negative mass (w=0), negative radiation (w=1/3), and negative ultra-light (w > 1/3). Assuming that such universe components generate pressures as perfect fluids, the attractive or repulsive nature of each negative energy component is reviewed. The Friedmann equations can only be balanced when negative energies are coupled to a greater magnitude of positive energy or positive curvature, and minimal cases of both of these are reviewed. The future and fate of such universes in terms of curvature, temperature, acceleration, and energy density are reviewed including endings categorized as a Big Crunch, Big Void, or Big Rip and further qualified as "Warped", "Curved", or "Flat", "Hot" versus "Cold", "Accelerating" versus" Decelerating" versus "Coasting". A universe that ends by contracting to zero energy density is termed a Big Poof. Which contracting universes ``bounce" in expansion and which expanding universes ``turnover" into contraction are also reviewed. The name by which the ending of the Universe is mentioned is our own nomenclature.