7 resultados para SAMPLE-SIZE


Relevância:

60.00% 60.00%

Publicador:

Resumo:

With Tweet volumes reaching 500 million a day, sampling is inevitable for any application using Twitter data. Realizing this, data providers such as Twitter, Gnip and Boardreader license sampled data streams priced in accordance with the sample size. Big Data applications working with sampled data would be interested in working with a large enough sample that is representative of the universal dataset. Previous work focusing on the representativeness issue has considered ensuring the global occurrence rates of key terms, be reliably estimated from the sample. Present technology allows sample size estimation in accordance with probabilistic bounds on occurrence rates for the case of uniform random sampling. In this paper, we consider the problem of further improving sample size estimates by leveraging stratification in Twitter data. We analyze our estimates through an extensive study using simulations and real-world data, establishing the superiority of our method over uniform random sampling. Our work provides the technical know-how for data providers to expand their portfolio to include stratified sampled datasets, whereas applications are benefited by being able to monitor more topics/events at the same data and computing cost.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Diabetic kidney disease (DKD) is a devastating diabetes complication, with known heritability not fully revealed by previous genetics studies. We performed the largest genome-wide association study of type 1 DKD to date, in a 13-cohort consortium of 15,590 individuals of European ancestry genotyped on the Illumina HumanCoreExome Beadchip, which allows exploration of coding variation in addition to genomic markers.

As prior work has shown that different characterizations of the DKD phenotype highlight distinct genetic associations, we investigated a spectrum of DKD definitions based on proteinuria and renal function criteria. Controls were DKD-free after a minimum of 15 years diabetes duration; cases had diabetes for at least 10 years prior to DKD diagnosis. We also performed a quantitative trait analysis of estimated glomerular filtration rate in all participants.

Our top finding was a missense mutation in COL4A3, rs55703767 (Asp326Tyr); the minor allele is common in Europeans (20%) and East Asians (13%) but not Africans (2%). This SNP had a genome-wide significant association with traditionally defined DKD (macroalbuminuria or end-stage renal disease [ESRD], (OR= 0.79, P=1.9×10-9), and a suggestive association with macroalbuminuria (OR= 0.79, P=1.6×10-6) and ESRD (OR= 0.79, P=4.5×10-5) individually. Though its PolyPhen score is 0.3 (benign), this SNP has been implicated as a splice site disruptor.

The COL4A3 gene encodes the alpha 3 subunit of Type IV collagen, the major structural component of basement membranes. Pathogenic mutations in COL4A3 have been identified in thin basement membrane nephropathy, familial focal segmental glomerulosclerosis, and Alport syndrome. A proxy (r2=0.6) for rs55703767 had no significant associations in the CKDGen consortium, suggesting its pathogenicity occurs solely in the setting of hyperglycemia.

By significantly increasing sample size we have discovered a novel locus underlying DKD risk, paving the way for better understanding of pathology, prevention, and treatment.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Background
Prostate cancer is one of the most common male cancers worldwide. Active Surveillance (AS) has been developed to allow men with lower risk disease to postpone or avoid the adverse side effects associated with curative treatments until the disease progresses. Despite the medical benefits of AS, it is reported that living with untreated cancer can create a significant emotional burden for patients.

Methods/design
The aim of this study is to gain insight into the experiences of men eligible to undergo AS for favourable-risk PCa.

This study has a mixed-methods sequential explanatory design consisting of two phases: quantitative followed by qualitative. Phase 1 has a multiple point, prospective, longitudinal exploratory design. Ninety men diagnosed with favourable-risk prostate cancer will be assessed immediately post-diagnosis (baseline) and followed over a period of 12 months, in intervals of 3 month. Ninety age-matched men with no cancer diagnosis will also be recruited using peer nomination and followed up in the same 3 month intervals. Following completion of Phase 1, 10–15 AS participants who have reported both the best and worst psychological functioning will be invited to participate in semi-structured qualitative interviews. Phase 2 will facilitate further exploration of the quantitative results and obtain a richer understanding of participants’ personal interpretations of their illness and psychological wellbeing.

Discussion
To our knowledge, this is the first study to utilise early baseline measures; include a healthy comparison group; calculate sample size through power calculations; and use a mixed methods approach to gain a deeper more holistic insight into the experiences of men diagnosed with favourable-risk prostate cancer.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

In this updated analysis of the EXPERT-C trial we show that, in magnetic resonance imaging-defined, high-risk, locally advanced rectal cancer, adding cetuximab to a treatment strategy with neoadjuvant CAPOX followed by chemoradiotherapy, surgery, and adjuvant CAPOX is not associated with a statistically significant improvement in progression-free survival (PFS) and overall survival (OS) in both KRAS/BRAF wild-type and unselected patients. In a retrospective biomarker analysis, TP53 was not prognostic but emerged as an independent predictive biomarker for cetuximab benefit. After a median follow-up of 65.0 months, TP53 wild-type patients (n = 69) who received cetuximab had a statistically significant better PFS (89.3% vs 65.0% at 5 years; hazard ratio [HR] = 0.23; 95% confidence interval [CI] = 0.07 to 0.78; two-sided P = .02 by Cox regression) and OS (92.7% vs 67.5% at 5 years; HR = 0.16; 95% CI = 0.04 to 0.70; two-sided P = .02 by Cox regression) than TP53 wild-type patients who were treated in the control arm. An interaction between TP53 status and cetuximab effect was found (P <.05) and remained statistically significant after adjusting for statistically significant prognostic factors and KRAS.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Background: RAS mutations predict resistance to anti-epidermal growthfactor receptor (EGFR) monoclonal antibodies in metastatic colorectal cancer. We analysed RAS mutations in 30 non-metastatic rectal cancer patients treated with or without cetuximab within the 31 EXPERT-C trial.

Methods: Ninety of 149 patients with tumours available for analysis were KRAS/BRAF wild-type, and randomly assigned to capecitabine plus oxaliplatin (CAPOX) followed by chemoradiotherapy, surgery and adjuvant CAPOX or the same regimen plus cetuximab (CAPOX-C). Of these, four had a mutation of NRAS exon 3, and 84 were retrospectively analysed for additional KRAS (exon 4) and NRAS (exons 2/4) mutations by using bi-directional Sanger sequencing. The effect of cetuximab on study end-points in the RAS wild-type population was analysed.

Results: Eleven (13%) of 84 patients initially classified as KRAS/BRAF wild-type were found to have a mutation in KRAS exon 4 (11%) or NRAS exons 2/4 (2%). Overall, 78/149 (52%) assessable patients were RAS wild-type (CAPOX, n = 40; CAPOX-C, n = 38). In this population, after a median follow-up of 63.8 months, in line with the initial analysis, the addition of cetuximab was associated with numerically higher, but not statistically significant, rates of complete response (15.8% versus 7.5%, p = 0.31), 5-year progression-free survival (75.5% versus 67.5%, hazard ratio (HR) 0.61, p = 0.25) and 5-year overall survival (83.8% versus 70%, HR 0.54, p = 0.20).

Conclusions: RAS mutations beyond KRAS exon 2 and 3 were identified in 17% of locally advanced rectal cancer patients. Given the small sample size, no definitive conclusions on the effect of additional RAS mutations on cetuximab treatment in this setting can be drawn and further investigation of RAS in larger studies is warranted.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

The efficacy of tyrosine kinase (TK) inhibitors on non-cycling acute myeloid leukaemia (AML) cells, previously shown to have potent tumourigenic potential, is unknown. This pilot study describes the first attempt to characterize non-cycling cells from a small series of human FMS-like tyrosine kinase 3 (FLT3) mutation positive samples. CD34+ AML cells from patients with FLT3 mutation positive AML were cultured on murine stroma. In expansion cultures, non-cycling cells were found to retain CD34+ expression in contrast to dividing cells. Leukaemic gene rearrangements could be detected in non-cycling cells, indicating their leukaemic origin. Significantly, the FLT3-internal tandem duplication (ITD) mutation was found in the non-cycling fraction of four out of five cases. Exposure to the FLT3-directed inhibitor TKI258 clearly inhibited the growth of AML CD34+ cells in short-term cultures and colony-forming unit assays. Crucially, non-cycling cells were not eradicated, with the exception of one case, which exhibited exquisite sensitivity to the compound. Moreover, in longer-term cultures, TKI258-treated non-cycling cells showed no growth impairment compared to treatment-naive non-cycling cells. These findings suggest that non-cycling cells in AML may constitute a disease reservoir that is resistant to TK inhibition. Further studies with a larger sample size and other inhibitors are warranted.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Epidemiologically related traits may share genetic risk factors, and pleiotropic analysis could identify individual loci associated with these traits. Because of their shared epidemiological associations, we conducted pleiotropic analysis of genome-wide association studies of lung cancer (12 160 lung cancer case patients and 16 838 control subjects) and cardiovascular disease risk factors (blood lipids from 188 577 subjects, type 2 diabetes from 148 821 subjects, body mass index from 123 865 subjects, and smoking phenotypes from 74 053 subjects). We found that 6p22.1 (rs6904596, ZNF184) was associated with both lung cancer (P = 5.50x10(-6)) and blood triglycerides (P = 1.39x10(-5)). We replicated the association in 6097 lung cancer case patients and 204 657 control subjects (P = 2.40 × 10(-4)) and in 71 113 subjects with triglycerides data (P = .01). rs6904596 reached genome-wide significance in lung cancer meta-analysis (odds ratio = 1.15, 95% confidence interval = 1.10 to 1.21 ,: Pcombined = 5.20x10(-9)). The large sample size provided by the lipid GWAS data and the shared genetic risk factors between the two traits contributed to the uncovering of a hitherto unidentified genetic locus for lung cancer.