38 resultados para Dirichlet Regression compositional model.
em Université de Lausanne, Switzerland
Resumo:
Uncertainty quantification of petroleum reservoir models is one of the present challenges, which is usually approached with a wide range of geostatistical tools linked with statistical optimisation or/and inference algorithms. Recent advances in machine learning offer a novel approach to model spatial distribution of petrophysical properties in complex reservoirs alternative to geostatistics. The approach is based of semisupervised learning, which handles both ?labelled? observed data and ?unlabelled? data, which have no measured value but describe prior knowledge and other relevant data in forms of manifolds in the input space where the modelled property is continuous. Proposed semi-supervised Support Vector Regression (SVR) model has demonstrated its capability to represent realistic geological features and describe stochastic variability and non-uniqueness of spatial properties. On the other hand, it is able to capture and preserve key spatial dependencies such as connectivity of high permeability geo-bodies, which is often difficult in contemporary petroleum reservoir studies. Semi-supervised SVR as a data driven algorithm is designed to integrate various kind of conditioning information and learn dependences from it. The semi-supervised SVR model is able to balance signal/noise levels and control the prior belief in available data. In this work, stochastic semi-supervised SVR geomodel is integrated into Bayesian framework to quantify uncertainty of reservoir production with multiple models fitted to past dynamic observations (production history). Multiple history matched models are obtained using stochastic sampling and/or MCMC-based inference algorithms, which evaluate posterior probability distribution. Uncertainty of the model is described by posterior probability of the model parameters that represent key geological properties: spatial correlation size, continuity strength, smoothness/variability of spatial property distribution. The developed approach is illustrated with a fluvial reservoir case. The resulting probabilistic production forecasts are described by uncertainty envelopes. The paper compares the performance of the models with different combinations of unknown parameters and discusses sensitivity issues.
Resumo:
Uncertainty quantification of petroleum reservoir models is one of the present challenges, which is usually approached with a wide range of geostatistical tools linked with statistical optimisation or/and inference algorithms. The paper considers a data driven approach in modelling uncertainty in spatial predictions. Proposed semi-supervised Support Vector Regression (SVR) model has demonstrated its capability to represent realistic features and describe stochastic variability and non-uniqueness of spatial properties. It is able to capture and preserve key spatial dependencies such as connectivity, which is often difficult to achieve with two-point geostatistical models. Semi-supervised SVR is designed to integrate various kinds of conditioning data and learn dependences from them. A stochastic semi-supervised SVR model is integrated into a Bayesian framework to quantify uncertainty with multiple models fitted to dynamic observations. The developed approach is illustrated with a reservoir case study. The resulting probabilistic production forecasts are described by uncertainty envelopes.
Resumo:
Fanconi anemia (FA) is a genetically heterogeneous chromosome instability syndrome associated with congenital abnormalities, bone marrow failure, and cancer predisposition. Eight FA proteins form a nuclear core complex, which promotes tolerance of DNA lesions in S phase, but the underlying mechanisms are still elusive. We reported recently that the FA core complex protein FANCM can translocate Holliday junctions. Here we show that FANCM promotes reversal of model replication forks via concerted displacement and annealing of the nascent and parental DNA strands. Fork reversal by FANCM also occurs when the lagging strand template is partially single-stranded and bound by RPA. The combined fork reversal and branch migration activities of FANCM lead to extensive regression of model replication forks. These observations provide evidence that FANCM can remodel replication fork structures and suggest a mechanism by which FANCM could promote DNA damage tolerance in S phase
Resumo:
The present study constitutes an investigation of tobacco consumption, related attitudes and individual differences in smoking or non-smoking behaviors in a sample of adolescents of different ages in the French-speaking part of Switzerland. We investigated three school-age groups (7th-grade, 9th-grade, and the second-year of high school) for differences in attitude and social and cognitive dimensions. We present both descriptive and inferential statistics. On an inferential level, we present a binary logistic regression-based model predicting risk of smoking. The resulting model most importantly suggests a strong relationship between smoking and alcohol consumption (both regular and sporadic). We interpret this result in terms of both the impact of the actual campaigns and the cognitive processes associated with adolescence.
Resumo:
BACKGROUND: Adrenal insufficiency is a rare and potentially lethal disease if untreated. Several clinical signs and biological markers are associated with glucocorticoid failure but the importance of these factors for diagnosing adrenal insufficiency is not known. In this study, we aimed to assess the prevalence of and the factors associated with adrenal insufficiency among patients admitted to an acute internal medicine ward. METHODS: Retrospective, case-control study including all patients with high-dose (250 μg) ACTH-stimulation tests for suspected adrenal insufficiency performed between 2008 and 2010 in an acute internal medicine ward (n = 281). Cortisol values <550 nmol/l upon ACTH-stimulation test were considered diagnostic for adrenal insufficiency. Area under the ROC curve (AROC), sensitivity, specificity, negative and positive predictive values for adrenal insufficiency were assessed for thirteen symptoms, signs and biological variables. RESULTS: 32 patients (11.4%) presented adrenal insufficiency; the others served as controls. Among all clinical and biological parameters studied, history of glucocorticoid withdrawal was the only independent factor significantly associated with patients with adrenal insufficiency (Odds Ratio: 6.71, 95% CI: 3.08 -14.62). Using a logistic regression, a model with four significant and independent variable was obtained, regrouping history of glucocorticoid withdrawal (OR 7.38, 95% CI [3.18 ; 17.11], p-value <0.001), nausea (OR 3.37, 95% CI [1.03 ; 11.00], p-value 0.044), eosinophilia (OR 17.6, 95% CI [1.02; 302.3], p-value 0.048) and hyperkalemia (OR 2.41, 95% CI [0.87; 6.69], p-value 0.092). The AROC (95% CI) was 0.75 (0.70; 0.80) for this model, with 6.3 (0.8 - 20.8) for sensitivity and 99.2 (97.1 - 99.9) for specificity. CONCLUSIONS: 11.4% of patients with suspected adrenal insufficient admitted to acute medical ward actually do present with adrenal insufficiency, defined by an abnormal response to high-dose (250 μg) ACTH-stimulation test. A history of glucocorticoid withdrawal was the strongest factor predicting the potential adrenal failure. The combination of a history of glucocorticoid withdrawal, nausea, eosinophilia and hyperkaliemia might be of interest to suspect adrenal insufficiency.
Resumo:
Aims: To describe the drinking patterns and their baseline predictive factors during a 12-month period after an initial evaluation for alcohol treatment. Methods CONTROL is a single-center, prospective, observational study evaluating consecutive alcohol-dependent patients. Using a curve clustering methodology based on a polynomial regression mixture model, we identified three clusters of patients with dominant alcohol use patterns described as mostly abstainers, mostly moderate drinkers and mostly heavy drinkers. Multinomial logistic regression analysis was used to identify baseline factors (socio-demographic, alcohol dependence consequences and related factors) predictive of belonging to each drinking cluster. ResultsThe sample included 143 alcohol-dependent adults (63.6% males), mean age 44.6 ± 11.8 years. The clustering method identified 47 (32.9%) mostly abstainers, 56 (39.2%) mostly moderate drinkers and 40 (28.0%) mostly heavy drinkers. Multivariate analyses indicated that mild or severe depression at baseline predicted belonging to the mostly moderate drinkers cluster during follow-up (relative risk ratio (RRR) 2.42, CI [1.02-5.73, P = 0.045] P = 0.045), while living alone (RRR 2.78, CI [1.03-7.50], P = 0.044) and reporting more alcohol-related consequences (RRR 1.03, CI [1.01-1.05], P = 0.004) predicted belonging to the mostly heavy drinkers cluster during follow-up. Conclusion In this sample, the drinking patterns of alcohol-dependent patients were predicted by baseline factors, i.e. depression, living alone or alcohol-related consequences and findings that may inform clinicians about the likely drinking patterns of their alcohol-dependent patient over the year following the initial evaluation for alcohol treatment.
Resumo:
ABSTRACT: BACKGROUND: The relationship between body mass index (BMI) and socioeconomic status (SES) tends to change over time and across populations. In this study, we examined, separately in men and women, whether the association between BMI and SES changed over successive birth cohorts in the Seychelles (Indian Ocean, African region). METHODS: We used data from all participants in three surveys conducted in 1989, 1994 and 2004 in independent random samples of the population aged 25-64 years in the Seychelles (N= 3'403). We used linear regression to model mean BMI according to age, cohort, SES and smoking status, allowing for a quadratic term for age to account for a curvilinear relation between BMI and age and interactions between SES and age and between SES and cohorts to test whether the relation between SES and BMI changed across subsequent cohorts. All analyses were performed separately in men and women. RESULTS: BMI increased with age in all birth cohorts. BMI was lower in men of low SES than high SES but was higher in women of low SES than high SES. In all SES categories, BMI increased over successive cohorts (1.24 kg/m2 in men and 1.51 kg/m2 for a 10-year increase in birth cohorts, p <0.001). The difference in BMI between men or women of high vs. low SES did not change significantly across successive cohorts (the interaction between SES and year of birth of cohort was statistically not significant). Smoking was associated with lower BMI in men and women (respectively -1.55 kg/m2 and 2.46 kg/m2, p <0.001). CONCLUSIONS: Although large differences exist between men and women, social patterning of BMI did not change significantly over successive cohorts in this population of a middle-income country in the African region.
Resumo:
The predictive potential of six selected factors was assessed in 72 patients with primary myelodysplastic syndrome using univariate and multivariate logistic regression analysis of survival at 18 months. Factors were age (above median of 69 years), dysplastic features in the three myeloid bone marrow cell lineages, presence of chromosome defects, all metaphases abnormal, double or complex chromosome defects (C23), and a Bournemouth score of 2, 3, or 4 (B234). In the multivariate approach, B234 and C23 proved to be significantly associated with a reduction in the survival probability. The similarity of the regression coefficients associated with these two factors means that they have about the same weight. Consequently, the model was simplified by counting the number of factors (0, 1, or 2) present in each patient, thus generating a scoring system called the Lausanne-Bournemouth score (LB score). The LB score combines the well-recognized and easy-to-use Bournemouth score (B score) with the chromosome defect complexity, C23 constituting an additional indicator of patient outcome. The predicted risk of death within 18 months calculated from the model is as follows: 7.1% (confidence interval: 1.7-24.8) for patients with an LB score of 0, 60.1% (44.7-73.8) for an LB score of 1, and 96.8% (84.5-99.4) for an LB score of 2. The scoring system presented here has several interesting features. The LB score may improve the predictive value of the B score, as it is able to recognize two prognostic groups in the intermediate risk category of patients with B scores of 2 or 3. It has also the ability to identify two distinct prognostic subclasses among RAEB and possibly CMML patients. In addition to its above-described usefulness in the prognostic evaluation, the LB score may bring new insights into the understanding of evolution patterns in MDS. We used the combination of the B score and chromosome complexity to define four classes which may be considered four possible states of myelodysplasia and which describe two distinct evolutional pathways.
Resumo:
Neuroblastoma (NB) is a neural crest-derived childhood tumor characterized by a remarkable phenotypic diversity, ranging from spontaneous regression to fatal metastatic disease. Although the cancer stem cell (CSC) model provides a trail to characterize the cells responsible for tumor onset, the NB tumor-initiating cell (TIC) has not been identified. In this study, the relevance of the CSC model in NB was investigated by taking advantage of typical functional stem cell characteristics. A predictive association was established between self-renewal, as assessed by serial sphere formation, and clinical aggressiveness in primary tumors. Moreover, cell subsets gradually selected during serial sphere culture harbored increased in vivo tumorigenicity, only highlighted in an orthotopic microenvironment. A microarray time course analysis of serial spheres passages from metastatic cells allowed us to specifically "profile" the NB stem cell-like phenotype and to identify CD133, ABC transporter, and WNT and NOTCH genes as spheres markers. On the basis of combined sphere markers expression, at least two distinct tumorigenic cell subpopulations were identified, also shown to preexist in primary NB. However, sphere markers-mediated cell sorting of parental tumor failed to recapitulate the TIC phenotype in the orthotopic model, highlighting the complexity of the CSC model. Our data support the NB stem-like cells as a dynamic and heterogeneous cell population strongly dependent on microenvironmental signals and add novel candidate genes as potential therapeutic targets in the control of high-risk NB.
Resumo:
The role of land cover change as a significant component of global change has become increasingly recognized in recent decades. Large databases measuring land cover change, and the data which can potentially be used to explain the observed changes, are also becoming more commonly available. When developing statistical models to investigate observed changes, it is important to be aware that the chosen sampling strategy and modelling techniques can influence results. We present a comparison of three sampling strategies and two forms of grouped logistic regression models (multinomial and ordinal) in the investigation of patterns of successional change after agricultural land abandonment in Switzerland. Results indicated that both ordinal and nominal transitional change occurs in the landscape and that the use of different sampling regimes and modelling techniques as investigative tools yield different results. Synthesis and applications. Our multimodel inference identified successfully a set of consistently selected indicators of land cover change, which can be used to predict further change, including annual average temperature, the number of already overgrown neighbouring areas of land and distance to historically destructive avalanche sites. This allows for more reliable decision making and planning with respect to landscape management. Although both model approaches gave similar results, ordinal regression yielded more parsimonious models that identified the important predictors of land cover change more efficiently. Thus, this approach is favourable where land cover change pattern can be interpreted as an ordinal process. Otherwise, multinomial logistic regression is a viable alternative.
Resumo:
Excessive exposure to solar ultraviolet (UV) is the main cause of skin cancer. Specific prevention should be further developed to target overexposed or highly vulnerable populations. A better characterisation of anatomical UV exposure patterns is however needed for specific prevention. To develop a regression model for predicting the UV exposure ratio (ER, ratio between the anatomical dose and the corresponding ground level dose) for each body site without requiring individual measurements. A 3D numeric model (SimUVEx) was used to compute ER for various body sites and postures. A multiple fractional polynomial regression analysis was performed to identify predictors of ER. The regression model used simulation data and its performance was tested on an independent data set. Two input variables were sufficient to explain ER: the cosine of the maximal daily solar zenith angle and the fraction of the sky visible from the body site. The regression model was in good agreement with the simulated data ER (R(2)=0.988). Relative errors up to +20% and -10% were found in daily doses predictions, whereas an average relative error of only 2.4% (-0.03% to 5.4%) was found in yearly dose predictions. The regression model predicts accurately ER and UV doses on the basis of readily available data such as global UV erythemal irradiance measured at ground surface stations or inferred from satellite information. It renders the development of exposure data on a wide temporal and geographical scale possible and opens broad perspectives for epidemiological studies and skin cancer prevention.
Resumo:
An online algorithm for determining respiratory mechanics in patients using non-invasive ventilation (NIV) in pressure support mode was developed and embedded in a ventilator system. Based on multiple linear regression (MLR) of respiratory data, the algorithm was tested on a patient bench model under conditions with and without leak and simulating a variety of mechanics. Bland-Altman analysis indicates reliable measures of compliance across the clinical range of interest (± 11-18% limits of agreement). Resistance measures showed large quantitative errors (30-50%), however, it was still possible to qualitatively distinguish between normal and obstructive resistances. This outcome provides clinically significant information for ventilator titration and patient management.
Resumo:
Background Individual signs and symptoms are of limited value for the diagnosis of influenza. Objective To develop a decision tree for the diagnosis of influenza based on a classification and regression tree (CART) analysis. Methods Data from two previous similar cohort studies were assembled into a single dataset. The data were randomly divided into a development set (70%) and a validation set (30%). We used CART analysis to develop three models that maximize the number of patients who do not require diagnostic testing prior to treatment decisions. The validation set was used to evaluate overfitting of the model to the training set. Results Model 1 has seven terminal nodes based on temperature, the onset of symptoms and the presence of chills, cough and myalgia. Model 2 was a simpler tree with only two splits based on temperature and the presence of chills. Model 3 was developed with temperature as a dichotomous variable (≥38°C) and had only two splits based on the presence of fever and myalgia. The area under the receiver operating characteristic curves (AUROCC) for the development and validation sets, respectively, were 0.82 and 0.80 for Model 1, 0.75 and 0.76 for Model 2 and 0.76 and 0.77 for Model 3. Model 2 classified 67% of patients in the validation group into a high- or low-risk group compared with only 38% for Model 1 and 54% for Model 3. Conclusions A simple decision tree (Model 2) classified two-thirds of patients as low or high risk and had an AUROCC of 0.76. After further validation in an independent population, this CART model could support clinical decision making regarding influenza, with low-risk patients requiring no further evaluation for influenza and high-risk patients being candidates for empiric symptomatic or drug therapy.
Resumo:
This prospective study applies an extended Information-Motivation-Behavioural Skills (IMB) model to establish predictors of HIV-protection behaviour among HIV-positive men who have sex with men (MSM) during sex with casual partners. Data have been collected from anonymous, self-administered questionnaires and analysed by using descriptive and backward elimination regression analyses. In a sample of 165 HIV-positive MSM, 82 participants between the ages of 23 and 78 (M=46.4, SD=9.0) had sex with casual partners during the three-month period under investigation. About 62% (n=51) have always used a condom when having sex with casual partners. From the original IMB model, only subjective norm predicted condom use. More important predictors that increased condom use were low consumption of psychotropics, high satisfaction with sexuality, numerous changes in sexual behaviour after diagnosis, low social support from friends, alcohol use before sex and habitualised condom use with casual partner(s). The explanatory power of the calculated regression model was 49% (p<0.001). The study reveals the importance of personal and social resources and of routines for condom use, and provides information for the research-based conceptualisation of prevention offers addressing especially people living with HIV ("positive prevention").
Resumo:
This study on determinants of sexual protection behavior among HIV-positive gay men used the empirically tested information-motivation-behavioral skills (IMB) model. HIV-specific variables were added to the model to determine factors decisive for condom use with steady and casual partners. Data were collected using an anonymous, standardized self-administered questionnaire. Study participants were recruited at HIV outpatient clinics associated with the Eurosupport Study Group and the Swiss HIV Cohort Study. To identify factors associated with condom use, backward elimination regression analyses were performed. Overall, 838 HIV-infected gay men from 14 European countries were included in this analysis. About 53% of them reported at least one sexual contact with a steady partner; 62.5% had sex with a casual partner during the last 6 months. Forty-three percent always used condoms with steady partners and 44% with casual partners. High self-efficacy and subjective norms in favor of condom-use were associated with increased condom use with casual and steady partners, whereas feeling depressed was associated with decreased condom use with casual partners. Condoms were used less often with HIV-positive partners. Self-efficacy as an important behavioral skill to perform protection behavior was influenced by lower perceived vulnerability, higher subjective norms, and more positive safer sex attitudes. The IMB-model constructs appeared to be valid; however, not all the model predictors could be determined as hypothesized. Besides the original IMB constructs, HIV-specific variables, including sexual partners' serostatus and mental health, explained condom use. Such factors should be considered in clinical interventions to promote "positive prevention."