55 resultados para variable sample size
Resumo:
High-dimensional gene expression data provide a rich source of information because they capture the expression level of genes in dynamic states that reflect the biological functioning of a cell. For this reason, such data are suitable to reveal systems related properties inside a cell, e.g., in order to elucidate molecular mechanisms of complex diseases like breast or prostate cancer. However, this is not only strongly dependent on the sample size and the correlation structure of a data set, but also on the statistical hypotheses tested. Many different approaches have been developed over the years to analyze gene expression data to (I) identify changes in single genes, (II) identify changes in gene sets or pathways, and (III) identify changes in the correlation structure in pathways. In this paper, we review statistical methods for all three types of approaches, including subtypes, in the context of cancer data and provide links to software implementations and tools and address also the general problem of multiple hypotheses testing. Further, we provide recommendations for the selection of such analysis methods.
Resumo:
Purpose: In randomised clinical trials (RCTs) the selection of appropriate outcomes is crucial to the assessment of whether one intervention is better than another. The purpose of this review is to identify different clinical outcomes reported in glaucoma trials.
Methods We conducted a systematic review of glaucoma RCTs. A sample or selection of glaucoma trials were included bounded by a time frame (between 2006 and March 2012). Only studies in English language were considered. All clinical measured and reported outcomes were included. The possible variations of clinical outcomes were defined prior to data analysis. Information on reported clinical outcomes was tabulated and analysed using descriptive statistics. Other data recorded included type of intervention and glaucoma, duration of the study, defined primary outcomes, and outcomes used for sample size calculation, if nominated.
Results The search strategy identified 4323 potentially relevant abstracts. There were 315 publications retrieved, of which 233 RCTs were included. A total of 967 clinical measures were reported. There were large variations in the definitions used to describe different outcomes and their measures. Intraocular pressure was the most commonly reported outcome (used in 201 RCTs, 86%) with a total of 422 measures (44%). Safety outcomes were commonly reported in 145 RCTs (62%) whereas visual field outcomes were used in 38 RCTs (16%).
Conclusions There is a large variation in the reporting of clinical outcomes in glaucoma RCTs. This lack of standardisation may impair the ability to evaluate the evidence of glaucoma interventions.
Resumo:
Model selection between competing models is a key consideration in the discovery of prognostic multigene signatures. The use of appropriate statistical performance measures as well as verification of biological significance of the signatures is imperative to maximise the chance of external validation of the generated signatures. Current approaches in time-to-event studies often use only a single measure of performance in model selection, such as logrank test p-values, or dichotomise the follow-up times at some phase of the study to facilitate signature discovery. In this study we improve the prognostic signature discovery process through the application of the multivariate partial Cox model combined with the concordance index, hazard ratio of predictions, independence from available clinical covariates and biological enrichment as measures of signature performance. The proposed framework was applied to discover prognostic multigene signatures from early breast cancer data. The partial Cox model combined with the multiple performance measures were used in both guiding the selection of the optimal panel of prognostic genes and prediction of risk within cross validation without dichotomising the follow-up times at any stage. The signatures were successfully externally cross validated in independent breast cancer datasets, yielding a hazard ratio of 2.55 [1.44, 4.51] for the top ranking signature.
Resumo:
Pregnancy and the postpartum period is a time of increased vulnerability for retention of excess body fat in women. Breastfeeding (BF) has been shown to have many health benefits for both mother and baby; however, its role in postpartum weight management is unclear. Our aim was to systematically review and critically appraise the literature published to date in relation to the impact of BF on postpartum weight change, weight retention and maternal body composition. Electronic literature searches were carried out using MEDLINE, EMBASE, PubMed, Web of Science, BIOSIS, CINAHL and British Nursing Index. The search covered publications up to 12 June 2012 and included observational studies (prospective and retrospective) carried out in BF mothers (either exclusively or as a subgroup), who were 2 years postpartum and with a body mass index (BMI) >18.5 kg m(-2), with an outcome measure of change in weight (including weight retention) and/or body composition. Thirty-seven prospective studies and eight retrospective studies were identified that met the selection criteria; studies were stratified according to study design and outcome measure. Overall, studies were heterogeneous, particularly in relation to sample size, measurement time points and in the classification of BF and postpartum weight change. The majority of studies reported little or no association between BF and weight change (n=27, 63%) or change in body composition (n=16, 89%), although this seemed to depend on the measurement time points and BF intensity. However, of the five studies that were considered to be of high methodological quality, four studies demonstrated a positive association between BF and weight change. This systematic review highlights the difficulties of examining the association between BF and weight management in observational research. Although the available evidence challenges the widely held belief that BF promotes weight loss, more robust studies are needed to reliably assess the impact of BF on postpartum weight management.International Journal of Obesity advance online publication, 20 August 2013; doi:10.1038/ijo.2013.132.
Resumo:
This paper reports data on self-esteem collected during a large-scale randomized trial of paired reading and Duolog Maths with primary school children (The Fife Peer-Learning Project). Self-report measures were analysed to assess the effect of a range of organisational variables on measured self-esteem. The presentation reports findings from a sample of reading-only schools in the first year of the project. Statistically significant gains in self-esteem were found across a range of conditions. Patterns of scores pointed to differences between same-age and cross-age conditions, and highlighted the influence of role played (tutor v tutee). In addition to the findings summarized here, it is hoped that preliminary findings from year two of the project - with an increased sample size - will also be shared. © 2009.
Resumo:
Objective: To assess the seasonality of cardiovascular risk factors (CVRF) in a large set of population-based studies.
Methods: Cross-sectional data from 24 population-based studies from 15 countries, with a total sample size of 237 979 subjects. CVRFs included Body Mass Index (BMI) and waist circumference; systolic (SBP) and diastolic (DBP) blood pressure; total, high (HDL) and low (LDL) density lipoprotein cholesterol; triglycerides and glucose levels. Within each study, all data were adjusted for age, gender and current smoking. For blood pressure, lipids and glucose levels, further adjustments on BMI and drug treatment were performed.
Results: In the Northern and Southern Hemispheres, CVRFs levels tended to be higher in winter and lower in summer months. These patterns were observed for most studies. In the Northern Hemisphere, the estimated seasonal variations were 0.26 kg/m2 for BMI, 0.6 cm for waist circumference, 2.9 mm Hg for SBP, 1.4 mm Hg for DBP, 0.02 mmol/L for triglycerides, 0.10 mmol/L for total cholesterol, 0.01 mmol/L for HDL cholesterol, 0.11 mmol/L for LDL cholesterol, and 0.07 mmol/L for glycaemia. Similar results were obtained when the analysis was restricted to studies collecting fasting blood samples. Similar seasonal variations were found for most CVRFs in the Southern Hemisphere, with the exception of waist circumference, HDL, and LDL cholesterol.
Conclusions: CVRFs show a seasonal pattern characterised by higher levels in winter, and lower levels in summer. This pattern could contribute to the seasonality of CV mortality.
Resumo:
Objective: We explored whether readers can understand key messages without having to read the full review, and if there were differences in understanding between various types of summary.
Design: A randomised experiment of review summaries which compared understanding of a key outcome.
Participants: Members of university staff (n = 36).
Setting: Universities on the island of Ireland.
Method: The Cochrane Review chosen examines the health impacts of the use of electric fans during heat waves. Participants were asked their expectation of the effect these would have on mortality. They were then randomly assigned a summary of the review (i.e. abstract, plain language summary, podcast or podcast transcription) and asked to spend a short time reading/listening to the summary. After this they were again asked about the effects of electric fans on mortality and to indicate if they would want to read the full Review.
Main outcome measure: Correct identification of a key review outcome.
Results: Just over half (53%) of the participants identified its key message on mortality after engaging with their summary. The figures were 33% for the abstract group, 50% for both the plain language and transcript groups and 78% for the podcast group.
Conclusions: The differences between the groups were not statistically significant but suggest that the audio summary might improve knowledge transfer compared to written summaries. These findings should be explored further using a larger sample size and with other reviews.
Resumo:
Objectives
A P-value <0.05 is one metric used to evaluate the results of a randomized controlled trial (RCT). We wondered how often statistically significant results in RCTs may be lost with small changes in the numbers of outcomes.
Study Design and Setting
A review of RCTs in high-impact medical journals that reported a statistically significant result for at least one dichotomous or time-to-event outcome in the abstract. In the group with the smallest number of events, we changed the status of patients without an event to an event until the P-value exceeded 0.05. We labeled this number the Fragility Index; smaller numbers indicated a more fragile result.
Results
The 399 eligible trials had a median sample size of 682 patients (range: 15-112,604) and a median of 112 events (range: 8-5,142); 53% reported a P-value <0.01. The median Fragility Index was 8 (range: 0-109); 25% had a Fragility Index of 3 or less. In 53% of trials, the Fragility Index was less than the number of patients lost to follow-up.
Conclusion
The statistically significant results of many RCTs hinge on small numbers of events. The Fragility Index complements the P-value and helps identify less robust results.
Resumo:
Milling of plant and soil material in plastic tubes, such as microcentrifuge tubes, over-estimates carbon (C) and under-estimates nitrogen (N) concentrations due to the introduction of polypropylene into milled samples, as identified using Fourier-transform infra-red spectroscopy.
This study compares C and N concentrations of roots and soil milled in microcentrifuge tubes versus stainless steel containers, demonstrating that a longer milling time, greater milling intensity, smaller sample size and inclusion of abrasive sample material all increase polypropylene contamination from plastic tubes leading to overestimation of C concentrations by up to 8 % (0.08 g g(-1)).
Erroneous estimations of C and N, and other analytes, must be assumed after milling in plastic tubes and milling methods should be adapted to minimise such error.
Resumo:
Leptospirosis is a globally important zoonotic infection caused by spirochaetes of the genus Leptospira. It is transmitted to humans by direct contact with infected animals or indirectly via contaminated water. It is mainly a problem of the resource-poor developing countries of the tropical and sub-tropical regions of the world but outbreaks due to an increase in travel and recreational activities have been reported in developed and more industrialized areas of the world. Current methods of diagnosis are costly, time-consuming and require the use of specialized laboratory equipment and personnel. The purpose of this paper is to report the validation of the 'Leptorapide®' test (Linnodee Ltd, Northern Ireland) for the diagnosis of human leptospirosis. It is a simple one-step latex agglutination assay performed using equal volumes of serum sample and antigen-bound latex beads. Evidence of leptospiral antibodies is determined within minutes. Agglutination is scored on a scale of 1-5 and the results interpreted using a score card provided with the kit. Validation has been performed with a large sample size obtained from individuals originating from various parts of the world including Brazil and India. The test has shown sensitivity and specificity values of 97·1% and 94·0%, respectively, relative to the microscopic agglutination test. The results demonstrate that Leptorapide offers a cost-effective and accurate alternative to the more historical methods of antibody detection.
Resumo:
This work presents two new score functions based on the Bayesian Dirichlet equivalent uniform (BDeu) score for learning Bayesian network structures. They consider the sensitivity of BDeu to varying parameters of the Dirichlet prior. The scores take on the most adversary and the most beneficial priors among those within a contamination set around the symmetric one. We build these scores in such way that they are decomposable and can be computed efficiently. Because of that, they can be integrated into any state-of-the-art structure learning method that explores the space of directed acyclic graphs and allows decomposable scores. Empirical results suggest that our scores outperform the standard BDeu score in terms of the likelihood of unseen data and in terms of edge discovery with respect to the true network, at least when the training sample size is small. We discuss the relation between these new scores and the accuracy of inferred models. Moreover, our new criteria can be used to identify the amount of data after which learning is saturated, that is, additional data are of little help to improve the resulting model.
Resumo:
Retrospective clinical datasets are often characterized by a relatively small sample size and many missing data. In this case, a common way for handling the missingness consists in discarding from the analysis patients with missing covariates, further reducing the sample size. Alternatively, if the mechanism that generated the missing allows, incomplete data can be imputed on the basis of the observed data, avoiding the reduction of the sample size and allowing methods to deal with complete data later on. Moreover, methodologies for data imputation might depend on the particular purpose and might achieve better results by considering specific characteristics of the domain. The problem of missing data treatment is studied in the context of survival tree analysis for the estimation of a prognostic patient stratification. Survival tree methods usually address this problem by using surrogate splits, that is, splitting rules that use other variables yielding similar results to the original ones. Instead, our methodology consists in modeling the dependencies among the clinical variables with a Bayesian network, which is then used to perform data imputation, thus allowing the survival tree to be applied on the completed dataset. The Bayesian network is directly learned from the incomplete data using a structural expectation–maximization (EM) procedure in which the maximization step is performed with an exact anytime method, so that the only source of approximation is due to the EM formulation itself. On both simulated and real data, our proposed methodology usually outperformed several existing methods for data imputation and the imputation so obtained improved the stratification estimated by the survival tree (especially with respect to using surrogate splits).
Resumo:
We present Hubble Space Telescope (HST) rest-frame ultraviolet imaging of the host galaxies of 16 hydrogen-poor superluminous supernovae (SLSNe), including 11 events from the Pan-STARRS Medium Deep Survey. Taking advantage of the superb angular resolution of HST, we characterize the galaxies' morphological properties, sizes, and star formation rate (SFR) densities. We determine the supernova (SN) locations within the host galaxies through precise astrometric matching and measure physical and host-normalized offsets as well as the SN positions within the cumulative distribution of UV light pixel brightness. We find that the host galaxies of H-poor SLSNe are irregular, compact dwarf galaxies, with a median half-light radius of just 0.9 kpc. The UV-derived SFR densities are high ([Sigma(SFR)] similar or equal to 0.1M(circle dot) yr(-1) kpc(-1)), suggesting that SLSNe form in overdense environments. Their locations trace the UV light of their host galaxies, with a distribution intermediate between that of long-duration gamma-ray bursts (LGRBs; which are strongly clustered on the brightest regions of their hosts) and a uniform distribution (characteristic of normal core-collapse SNe), though cannot be statistically distinguished from either with the current sample size. Taken together, this strengthens the picture that SLSN progenitors require different conditions than those of ordinary core-collapse SNe to form and that they explode in broadly similar galaxies as do LGRBs. If the tendency for SLSNe to be less clustered on the brightest regions than are LGRBs is confirmed by a larger sample, this would indicate a different, potentially lower-mass progenitor for SLSNe than LRGBs.
Resumo:
In the past decade, several rapidly evolving transients have been discovered whose timescales and luminosities are not easily explained by traditional supernovae (SNe) models. The sample size of these objects has remained small due, at least in part, to the challenges of detecting short timescale transients with traditional survey cadences. Here we present the results from a search within the Pan-STARRS1 Medium Deep Survey (PS1-MDS) for rapidly evolving and luminous transients. We identify 10 new transients with a time above half-maximum (t1/2) of less than 12 days and -16.5 > M > -20 mag. This increases the number of known events in this region of SN phase space by roughly a factor of three. The median redshift of the PS1-MDS sample is z = 0.275 and they all exploded in star-forming galaxies. In general, the transients possess faster rise than decline timescale and blue colors at maximum light (gP1-rP1 ≲ -0.2). Best-fit blackbodies reveal photospheric temperatures/radii that expand/cool with time and explosion spectra taken near maximum light are dominated by a blue continuum, consistent with a hot, optically thick, ejecta. We find it difficult to reconcile the short timescale, high peak luminosity (L > 1043erg s-1), and lack of UV line blanketing observed in many of these transients with an explosion powered mainly by the radioactive decay of 56Ni. Rather, we find that many are consistent with either (1) cooling envelope emission from the explosion of a star with a low-mass extended envelope that ejected very little (<0.03 M) radioactive material, or (2) a shock breakout within a dense, optically thick, wind surrounding the progenitor star. After calculating the detection efficiency for objects with rapid timescales in the PS1-MDS we find a volumetric rate of 4800-8000 events yr-1Gpc-3(4%-7% of the core-collapse SN rate at z = 0.2).
Resumo:
Introduction: Many cancer patients experience sleeping difficulties which can persist several years after the completion of cancer treatment. Previous research suggests that acupuncture, and variants of acupuncture (acupressure, auricular therapy) may be effective treatment options for sleep disturbance. However, current evidence is limited for cancer patients.
Methods: Feasibility study with 3 arms. Seven cancer patients with insomnia randomised to receive either auricular therapy (attaching semen vaccariae seeds to ear acupoints) (n=4), self-acupressure (n=1) or no treatment (n=2). Participants assigned to receive auricular therapy or self-acupressure stimulated the acupoints each night an hour before retiring to bed. The duration of participant involvement was 5 weeks. Subjective sleep quality was measured at baseline and post-treatment using the Pittsburgh Sleep Quality Index (PSQI). The impact of treatment on concerns of importance to the participants themselves was measured using the Measure Yourself Concerns and Wellbeing (MYCaW). Each participant also completed a treatment log book.
Results: All participants completed their treatment. All auricular therapy and self-acupressure participants recorded clinically significant improvements in global PSQI scores. In the auricular therapy arm mean global PSQI reduced from 12.5 at baseline to 8 following completion of treatment. In the self-acupressure arm PSQI reduced from 15 to 11. While in the no treatment arm the mean PSQI score was 14.5 at both baseline and follow up.
Conclusions: Despite the limited sample size, both auricular therapy and self-acupressure may represent potentially effective treatments for cancer patients with insomnia. The positive findings suggest further research is warranted into both treatment modalities.