881 resultados para intrinsically multivariate prediction
Resumo:
Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)
Resumo:
Patterns of attack for collected species of phorids are predicted using multivariate morphometrics of female Pseudacteon species and worker size distributions of parasitized fire ants, Solenopsis saevissima. The model assumes that there is a direct correlation between phorid size and the size range of the worker ant attacked, and presumes that worker sizes are a resource that is divided by sympatric phorid species to minimize joint parasitism. These results suggest that the community of sympatric Pseudacteon species on only one host species coexists by restricting the size of workers attacked, and secondarily by differing diel patterns of ovipositional activity. When we compared relative abundance of species of Pseudacteon with the size distribution of foragers of S. saevissima, our observed distribution did not differ significantly from our predicted relative abundance of females of Pseudacteon. The activity of Pseudacteon may be a factor determining forager size distributions.
Resumo:
Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)
Resumo:
Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq)
Resumo:
The sera of a retrospective cohort (n = 41) composed of children with well characterized cow's milk allergy collected from multiple visits were analyzed using a protein microarray system measuring four classes of immunoglobulins. The frequency of the visits, age and gender distribution reflected real situation faced by the clinicians at a pediatric reference center for food allergy in 530 Paulo, Brazil. The profiling array results have shown that total IgG and IgA share similar specificity whilst IgM and in particular IgE are distantly related. The correlation of specificity of IgE and IgA is variable amongst the patients and this relationship cannot be used to predict atopy or the onset of tolerance to milk. The array profiling technique has corroborated the clinical selection criteria for this cohort albeit it clearly suggested that 4 out of the 41 patients might have allergies other than milk origin. There was also a good correlation between the array data and ImmunoCAP results, casein in particular. By using qualitative and quantitative multivariate analysis routines it was possible to produce validated statistical models to predict with reasonable accuracy the onset of tolerance to milk proteins. If expanded to larger study groups, the array profiling in combination with the multivariate techniques show potential to improve the prognostic of milk allergic patients. (C) 2012 Elsevier B.V. All rights reserved.
Resumo:
Current methods for quality control of sugar cane are performed in extracted juice using several methodologies, often requiring appreciable time and chemicals (eventually toxic), making the methods not green and expensive. The present study proposes the use of X-ray spectrometry together with chemometric methods as an innovative and alternative technique for determining sugar cane quality parameters, specifically sucrose concentration, POL, and fiber content. Measurements in stem, leaf, and juice were performed, and those applied directly in stem provided the best results. Prediction models for sugar cane stem determinations with a single 60 s irradiation using portable X-ray fluorescence equipment allows estimating the % sucrose, % fiber, and POL simultaneously. Average relative deviations in the prediction step of around 8% are acceptable if considering that field measurements were done. These results may indicate the best period to cut a particular crop as well as for evaluating the quality of sugar cane for the sugar and alcohol industries.
Resumo:
[EN] Objective: To explore the role of Major Vault Protein (MVP) in oral cavity squamous cell carcinoma patients. Subjects and Methods: 131 consecutive patients suffering from oral cavity squamous cell carcinoma were included in the study. In the whole series, the mean follow-up for survivors was 123.11 ± 40.36 months. Patients in tumour stages I and II were referred to surgery; patients in stage III-IV to postoperative radiotherapy (mean dose = 62.13 ± 7.74 Gy in 1.8–2 Gy/fraction). MVP expression was studied by immunohistochemistry in paraffin-embedded tumour tissue. Results: MVP expression was positive in 112 patients (85.5%) and no relation was found with clinic pathological variables. MVP overexpression (those tumours with moderate or strong expression of the protein) was related to insulin-like growth factor receptor-1 (IGF-1R) expression (P = 0.014). Tumour stage of the disease was the most important prognostic factor related to survival. Tumours overexpressing MVP and IGF-1R were strongly related to poor disease-free survival (P = 0.008, Exp(B) = 2.730, CI95% (1.302-5.724)) and cause-specific survival (P = 0.014, Exp(B) = 2.570, CI95% (1.215-5.437)) in patients achieving tumour stages III-IV, in multivariate analysis. Conclusions: MVP and IGF-1R expression were related in oral squamous cell carcinoma and conferred reduced long-term survival in patients suffering from advanced stages of the disease.
Resumo:
The literature on the erosive potential of drinks and other products is summarised, and aspects of the conduct of screening tests as well as possible correlations of the erosive potential with various solution parameters are discussed. The solution parameters that have been suggested as important include pH, acid concentration (with respect to buffer capacity and concentration of undissociated acid), degree of saturation, calcium and phosphate concentrations, and inhibitors of erosion. Based on the available data, it is concluded that the dominant factor in erosion is pH. The effect of buffer capacity seems to be pH dependent. The degree of saturation probably has a non-linear relationship with erosion. While calcium at elevated concentrations is known to reduce erosion effectively, it is not known whether it is important at naturally occurring concentrations. Fluoride at naturally occurring concentrations is inversely correlated with erosive potential, but phosphate is probably not. Natural plant gums, notably pectin, do not inhibit erosion, so they are unlikely to interfere with the prediction of erosive potential. The non-linearity of some solution factors and interactions with pH need to be taken into account when developing multivariate models for predicting the erosive potential of different solutions. Finally, the erosive potential of solutions towards enamel and dentine might differ.
Resumo:
Brain tumor is one of the most aggressive types of cancer in humans, with an estimated median survival time of 12 months and only 4% of the patients surviving more than 5 years after disease diagnosis. Until recently, brain tumor prognosis has been based only on clinical information such as tumor grade and patient age, but there are reports indicating that molecular profiling of gliomas can reveal subgroups of patients with distinct survival rates. We hypothesize that coupling molecular profiling of brain tumors with clinical information might improve predictions of patient survival time and, consequently, better guide future treatment decisions. In order to evaluate this hypothesis, the general goal of this research is to build models for survival prediction of glioma patients using DNA molecular profiles (U133 Affymetrix gene expression microarrays) along with clinical information. First, a predictive Random Forest model is built for binary outcomes (i.e. short vs. long-term survival) and a small subset of genes whose expression values can be used to predict survival time is selected. Following, a new statistical methodology is developed for predicting time-to-death outcomes using Bayesian ensemble trees. Due to a large heterogeneity observed within prognostic classes obtained by the Random Forest model, prediction can be improved by relating time-to-death with gene expression profile directly. We propose a Bayesian ensemble model for survival prediction which is appropriate for high-dimensional data such as gene expression data. Our approach is based on the ensemble "sum-of-trees" model which is flexible to incorporate additive and interaction effects between genes. We specify a fully Bayesian hierarchical approach and illustrate our methodology for the CPH, Weibull, and AFT survival models. We overcome the lack of conjugacy using a latent variable formulation to model the covariate effects which decreases computation time for model fitting. Also, our proposed models provides a model-free way to select important predictive prognostic markers based on controlling false discovery rates. We compare the performance of our methods with baseline reference survival methods and apply our methodology to an unpublished data set of brain tumor survival times and gene expression data, selecting genes potentially related to the development of the disease under study. A closing discussion compares results obtained by Random Forest and Bayesian ensemble methods under the biological/clinical perspectives and highlights the statistical advantages and disadvantages of the new methodology in the context of DNA microarray data analysis.
Resumo:
BACKGROUND Clinical prognostic groupings for localised prostate cancers are imprecise, with 30-50% of patients recurring after image-guided radiotherapy or radical prostatectomy. We aimed to test combined genomic and microenvironmental indices in prostate cancer to improve risk stratification and complement clinical prognostic factors. METHODS We used DNA-based indices alone or in combination with intra-prostatic hypoxia measurements to develop four prognostic indices in 126 low-risk to intermediate-risk patients (Toronto cohort) who will receive image-guided radiotherapy. We validated these indices in two independent cohorts of 154 (Memorial Sloan Kettering Cancer Center cohort [MSKCC] cohort) and 117 (Cambridge cohort) radical prostatectomy specimens from low-risk to high-risk patients. We applied unsupervised and supervised machine learning techniques to the copy-number profiles of 126 pre-image-guided radiotherapy diagnostic biopsies to develop prognostic signatures. Our primary endpoint was the development of a set of prognostic measures capable of stratifying patients for risk of biochemical relapse 5 years after primary treatment. FINDINGS Biochemical relapse was associated with indices of tumour hypoxia, genomic instability, and genomic subtypes based on multivariate analyses. We identified four genomic subtypes for prostate cancer, which had different 5-year biochemical relapse-free survival. Genomic instability is prognostic for relapse in both image-guided radiotherapy (multivariate analysis hazard ratio [HR] 4·5 [95% CI 2·1-9·8]; p=0·00013; area under the receiver operator curve [AUC] 0·70 [95% CI 0·65-0·76]) and radical prostatectomy (4·0 [1·6-9·7]; p=0·0024; AUC 0·57 [0·52-0·61]) patients with prostate cancer, and its effect is magnified by intratumoral hypoxia (3·8 [1·2-12]; p=0·019; AUC 0·67 [0·61-0·73]). A novel 100-loci DNA signature accurately classified treatment outcome in the MSKCC low-risk to intermediate-risk cohort (multivariate analysis HR 6·1 [95% CI 2·0-19]; p=0·0015; AUC 0·74 [95% CI 0·65-0·83]). In the independent MSKCC and Cambridge cohorts, this signature identified low-risk to high-risk patients who were most likely to fail treatment within 18 months (combined cohorts multivariate analysis HR 2·9 [95% CI 1·4-6·0]; p=0·0039; AUC 0·68 [95% CI 0·63-0·73]), and was better at predicting biochemical relapse than 23 previously published RNA signatures. INTERPRETATION This is the first study of cancer outcome to integrate DNA-based and microenvironment-based failure indices to predict patient outcome. Patients exhibiting these aggressive features after biopsy should be entered into treatment intensification trials. FUNDING Movember Foundation, Prostate Cancer Canada, Ontario Institute for Cancer Research, Canadian Institute for Health Research, NIHR Cambridge Biomedical Research Centre, The University of Cambridge, Cancer Research UK, Cambridge Cancer Charity, Prostate Cancer UK, Hutchison Whampoa Limited, Terry Fox Research Institute, Princess Margaret Cancer Centre Foundation, PMH-Radiation Medicine Program Academic Enrichment Fund, Motorcycle Ride for Dad (Durham), Canadian Cancer Society.
Resumo:
OBJECTIVE Reliable tools to predict long-term outcome among patients with well compensated advanced liver disease due to chronic HCV infection are lacking. DESIGN Risk scores for mortality and for cirrhosis-related complications were constructed with Cox regression analysis in a derivation cohort and evaluated in a validation cohort, both including patients with chronic HCV infection and advanced fibrosis. RESULTS In the derivation cohort, 100/405 patients died during a median 8.1 (IQR 5.7-11.1) years of follow-up. Multivariate Cox analyses showed age (HR=1.06, 95% CI 1.04 to 1.09, p<0.001), male sex (HR=1.91, 95% CI 1.10 to 3.29, p=0.021), platelet count (HR=0.91, 95% CI 0.87 to 0.95, p<0.001) and log10 aspartate aminotransferase/alanine aminotransferase ratio (HR=1.30, 95% CI 1.12 to 1.51, p=0.001) were independently associated with mortality (C statistic=0.78, 95% CI 0.72 to 0.83). In the validation cohort, 58/296 patients with cirrhosis died during a median of 6.6 (IQR 4.4-9.0) years. Among patients with estimated 5-year mortality risks <5%, 5-10% and >10%, the observed 5-year mortality rates in the derivation cohort and validation cohort were 0.9% (95% CI 0.0 to 2.7) and 2.6% (95% CI 0.0 to 6.1), 8.1% (95% CI 1.8 to 14.4) and 8.0% (95% CI 1.3 to 14.7), 21.8% (95% CI 13.2 to 30.4) and 20.9% (95% CI 13.6 to 28.1), respectively (C statistic in validation cohort = 0.76, 95% CI 0.69 to 0.83). The risk score for cirrhosis-related complications also incorporated HCV genotype (C statistic = 0.80, 95% CI 0.76 to 0.83 in the derivation cohort; and 0.74, 95% CI 0.68 to 0.79 in the validation cohort). CONCLUSIONS Prognosis of patients with chronic HCV infection and compensated advanced liver disease can be accurately assessed with risk scores including readily available objective clinical parameters.
Resumo:
BACKGROUND Prostate cancer (PCa) is a very heterogeneous disease with respect to clinical outcome. This study explored differential DNA methylation in a priori selected genes to diagnose PCa and predict clinical failure (CF) in high-risk patients. METHODS A quantitative multiplex, methylation-specific PCR assay was developed to assess promoter methylation of the APC, CCND2, GSTP1, PTGS2 and RARB genes in formalin-fixed, paraffin-embedded tissue samples from 42 patients with benign prostatic hyperplasia and radical prostatectomy specimens of patients with high-risk PCa, encompassing training and validation cohorts of 147 and 71 patients, respectively. Log-rank tests, univariate and multivariate Cox models were used to investigate the prognostic value of the DNA methylation. RESULTS Hypermethylation of APC, CCND2, GSTP1, PTGS2 and RARB was highly cancer-specific. However, only GSTP1 methylation was significantly associated with CF in both independent high-risk PCa cohorts. Importantly, trichotomization into low, moderate and high GSTP1 methylation level subgroups was highly predictive for CF. Patients with either a low or high GSTP1 methylation level, as compared to the moderate methylation groups, were at a higher risk for CF in both the training (Hazard ratio [HR], 3.65; 95% CI, 1.65 to 8.07) and validation sets (HR, 4.27; 95% CI, 1.03 to 17.72) as well as in the combined cohort (HR, 2.74; 95% CI, 1.42 to 5.27) in multivariate analysis. CONCLUSIONS Classification of primary high-risk tumors into three subtypes based on DNA methylation can be combined with clinico-pathological parameters for a more informative risk-stratification of these PCa patients.
Resumo:
Strategies are compared for the development of a linear regression model with stochastic (multivariate normal) regressor variables and the subsequent assessment of its predictive ability. Bias and mean squared error of four estimators of predictive performance are evaluated in simulated samples of 32 population correlation matrices. Models including all of the available predictors are compared with those obtained using selected subsets. The subset selection procedures investigated include two stopping rules, C$\sb{\rm p}$ and S$\sb{\rm p}$, each combined with an 'all possible subsets' or 'forward selection' of variables. The estimators of performance utilized include parametric (MSEP$\sb{\rm m}$) and non-parametric (PRESS) assessments in the entire sample, and two data splitting estimates restricted to a random or balanced (Snee's DUPLEX) 'validation' half sample. The simulations were performed as a designed experiment, with population correlation matrices representing a broad range of data structures.^ The techniques examined for subset selection do not generally result in improved predictions relative to the full model. Approaches using 'forward selection' result in slightly smaller prediction errors and less biased estimators of predictive accuracy than 'all possible subsets' approaches but no differences are detected between the performances of C$\sb{\rm p}$ and S$\sb{\rm p}$. In every case, prediction errors of models obtained by subset selection in either of the half splits exceed those obtained using all predictors and the entire sample.^ Only the random split estimator is conditionally (on $\\beta$) unbiased, however MSEP$\sb{\rm m}$ is unbiased on average and PRESS is nearly so in unselected (fixed form) models. When subset selection techniques are used, MSEP$\sb{\rm m}$ and PRESS always underestimate prediction errors, by as much as 27 percent (on average) in small samples. Despite their bias, the mean squared errors (MSE) of these estimators are at least 30 percent less than that of the unbiased random split estimator. The DUPLEX split estimator suffers from large MSE as well as bias, and seems of little value within the context of stochastic regressor variables.^ To maximize predictive accuracy while retaining a reliable estimate of that accuracy, it is recommended that the entire sample be used for model development, and a leave-one-out statistic (e.g. PRESS) be used for assessment. ^
Resumo:
Sediment samples and hydrographic conditions were studied at 28 stations around Iceland. At these sites, Conductivity-Temperature-Depth (CTD) casts were conducted to collect hydrographic data and multicorer casts were conductd to collect data on sediment characteristics including grain size distribution, carbon and nitrogen concentration, and chloroplastic pigment concentration. A total of 14 environmental predictors were used to model sediment characteristics around Iceland on regional geographic space. For these, two approaches were used: Multivariate Adaptation Regression Splines (MARS) and randomForest regression models. RandomForest outperformed MARS in predicting grain size distribution. MARS models had a greater tendency to over- and underpredict sediment values in areas outside the environmental envelope defined by the training dataset. We provide first GIS layers on sediment characteristics around Iceland, that can be used as predictors in future models. Although models performed well, more samples, especially from the shelf areas, will be needed to improve the models in future.
Resumo:
Thesis (Master's)--University of Washington, 2016-06