965 resultados para REGRESSION APPROACH
Resumo:
Uncertainty quantification of petroleum reservoir models is one of the present challenges, which is usually approached with a wide range of geostatistical tools linked with statistical optimisation or/and inference algorithms. The paper considers a data driven approach in modelling uncertainty in spatial predictions. Proposed semi-supervised Support Vector Regression (SVR) model has demonstrated its capability to represent realistic features and describe stochastic variability and non-uniqueness of spatial properties. It is able to capture and preserve key spatial dependencies such as connectivity, which is often difficult to achieve with two-point geostatistical models. Semi-supervised SVR is designed to integrate various kinds of conditioning data and learn dependences from them. A stochastic semi-supervised SVR model is integrated into a Bayesian framework to quantify uncertainty with multiple models fitted to dynamic observations. The developed approach is illustrated with a reservoir case study. The resulting probabilistic production forecasts are described by uncertainty envelopes.
Resumo:
The aim of this paper is twofold. First, we study the determinants of economic growth among a wide set of potential variables for the Spanish provinces (NUTS3). Among others, we include various types of private, public and human capital in the group of growth factors. Also,we analyse whether Spanish provinces have converged in economic terms in recent decades. Thesecond objective is to obtain cross-section and panel data parameter estimates that are robustto model speci¯cation. For this purpose, we use a Bayesian Model Averaging (BMA) approach.Bayesian methodology constructs parameter estimates as a weighted average of linear regression estimates for every possible combination of included variables. The weight of each regression estimate is given by the posterior probability of each model.
Resumo:
The aim of this paper is twofold. First, we study the determinants of economic growth among a wide set of potential variables for the Spanish provinces (NUTS3). Among others, we include various types of private, public and human capital in the group of growth factors. Also,we analyse whether Spanish provinces have converged in economic terms in recent decades. Thesecond objective is to obtain cross-section and panel data parameter estimates that are robustto model speci¯cation. For this purpose, we use a Bayesian Model Averaging (BMA) approach.Bayesian methodology constructs parameter estimates as a weighted average of linear regression estimates for every possible combination of included variables. The weight of each regression estimate is given by the posterior probability of each model.
Resumo:
Fluvial deposits are a challenge for modelling flow in sub-surface reservoirs. Connectivity and continuity of permeable bodies have a major impact on fluid flow in porous media. Contemporary object-based and multipoint statistics methods face a problem of robust representation of connected structures. An alternative approach to model petrophysical properties is based on machine learning algorithm ? Support Vector Regression (SVR). Semi-supervised SVR is able to establish spatial connectivity taking into account the prior knowledge on natural similarities. SVR as a learning algorithm is robust to noise and captures dependencies from all available data. Semi-supervised SVR applied to a synthetic fluvial reservoir demonstrated robust results, which are well matched to the flow performance
Resumo:
BACKGROUND: Whole pelvis intensity modulated radiotherapy (IMRT) is increasingly being used to treat cervical cancer aiming to reduce side effects. Encouraged by this, some groups have proposed the use of simultaneous integrated boost (SIB) to target the tumor, either to get a higher tumoricidal effect or to replace brachytherapy. Nevertheless, physiological organ movement and rapid tumor regression throughout treatment might substantially reduce any benefit of this approach. PURPOSE: To evaluate the clinical target volume - simultaneous integrated boost (CTV-SIB) regression and motion during chemo-radiotherapy (CRT) for cervical cancer, and to monitor treatment progress dosimetrically and volumetrically to ensure treatment goals are met. METHODS AND MATERIALS: Ten patients treated with standard doses of CRT and brachytherapy were retrospectively re-planned using a helical Tomotherapy - SIB technique for the hypothetical scenario of this feasibility study. Target and organs at risk (OAR) were contoured on deformable fused planning-computed tomography and megavoltage computed tomography images. The CTV-SIB volume regression was determined. The center of mass (CM) was used to evaluate the degree of motion. The Dice's similarity coefficient (DSC) was used to assess the spatial overlap of CTV-SIBs between scans. A cumulative dose-volume histogram modeled estimated delivered doses. RESULTS: The CTV-SIB relative reduction was between 31 and 70%. The mean maximum CM change was 12.5, 9, and 3 mm in the superior-inferior, antero-posterior, and right-left dimensions, respectively. The CTV-SIB-DSC approached 1 in the first week of treatment, indicating almost perfect overlap. CTV-SIB-DSC regressed linearly during therapy, and by the end of treatment was 0.5, indicating 50% discordance. Two patients received less than 95% of the prescribed dose. Much higher doses to the OAR were observed. A multiple regression analysis showed a significant interaction between CTV-SIB reduction and OAR dose increase. CONCLUSIONS: The CTV-SIB had important regression and motion during CRT, receiving lower therapeutic doses than expected. The OAR had unpredictable shifts and received higher doses. The use of SIB without frequent adaptation of the treatment plan exposes cervical cancer patients to an unpredictable risk of under-dosing the target and/or overdosing adjacent critical structures. In that scenario, brachytherapy continues to be the gold standard approach.
Resumo:
Estimating the time since discharge of a spent cartridge or a firearm can be useful in criminal situa-tions involving firearms. The analysis of volatile gunshot residue remaining after shooting using solid-phase microextraction (SPME) followed by gas chromatography (GC) was proposed to meet this objective. However, current interpretative models suffer from several conceptual drawbacks which render them inadequate to assess the evidential value of a given measurement. This paper aims to fill this gap by proposing a logical approach based on the assessment of likelihood ratios. A probabilistic model was thus developed and applied to a hypothetical scenario where alternative hy-potheses about the discharge time of a spent cartridge found on a crime scene were forwarded. In order to estimate the parameters required to implement this solution, a non-linear regression model was proposed and applied to real published data. The proposed approach proved to be a valuable method for interpreting aging-related data.
Resumo:
Many of the most interesting questions ecologists ask lead to analyses of spatial data. Yet, perhaps confused by the large number of statistical models and fitting methods available, many ecologists seem to believe this is best left to specialists. Here, we describe the issues that need consideration when analysing spatial data and illustrate these using simulation studies. Our comparative analysis involves using methods including generalized least squares, spatial filters, wavelet revised models, conditional autoregressive models and generalized additive mixed models to estimate regression coefficients from synthetic but realistic data sets, including some which violate standard regression assumptions. We assess the performance of each method using two measures and using statistical error rates for model selection. Methods that performed well included generalized least squares family of models and a Bayesian implementation of the conditional auto-regressive model. Ordinary least squares also performed adequately in the absence of model selection, but had poorly controlled Type I error rates and so did not show the improvements in performance under model selection when using the above methods. Removing large-scale spatial trends in the response led to poor performance. These are empirical results; hence extrapolation of these findings to other situations should be performed cautiously. Nevertheless, our simulation-based approach provides much stronger evidence for comparative analysis than assessments based on single or small numbers of data sets, and should be considered a necessary foundation for statements of this type in future.
Resumo:
Background: Elevated levels of g-glutamyl transferase (GGT) have been associated with subsequent risk of elevated blood pressure (BP), hypertension and diabetes. However, the causality of these relationships has not been addressed. Mendelian randomization refers to the random allocation of alleles at the time of gamete formation. Such allocation is expected to be independent of any behavioural and environmental factors (known or unknown), allowing the analysis of largely unconfounded risk associations that are not due to reverse causation. Methods: We performed a cross-sectional analysis among 4361 participants to the population based CoLaus study. Associations of sex-specific GGT quartiles with systolic BP, diastolic BP and insulin levels were assessed using multivariable linear regression analyses. The rs2017869 GGT1 variant, which explained 1.6% of the variance in GGT levels, was used as an instrument to perform a Mendelian randomization analysis. Results: Median age of the study population was 53 years. After age and sex adjustment, GGT quartiles were strongly associated with systolic and diastolic BP (all p for linear trend <0.0001). After multivariable adjustment, these relationships were significantly attenuated, but remained significant for systolic (b(95%CI)¼1.30 (0.32;2.03), p¼0.007) and diastolic BP (b (95%CI)¼0.57 (0.02;1.13), p¼0.04). Using Mendelian randomization, we observed no positive association of GGT with either systolic BP (b (95%CI)¼-5.68 (-11.51-0.16), p¼0.06) or diastolic BP (b (95%CI)¼ -2.24 (-5.98;1.49) p¼0.24). The association of GGT with insulin was also attenuated after multivariable adjustment. Nevertheless, a strong linear trend persisted in the fully adjusted model (b (95%CI)¼0.07 (0.04;0.09), p<0.0001). Using Mendelian randomization, we observed a similar positive association of GGT with insulin (b (95%CI)¼0.19 (0.01-0.37), p¼0.04). Conclusion: In this study, we found evidence for a direct causal relationship between GGT and insulin, suggesting that oxidative stress may be causally implicated in the pathogenesis of type 2 diabetes mellitus.
Resumo:
Background: MLPA method is a potentially useful semi-quantitative method to detect copy number alterations in targeted regions. In this paper, we propose a method for the normalization procedure based on a non-linear mixed-model, as well as a new approach for determining the statistical significance of altered probes based on linear mixed-model. This method establishes a threshold by using different tolerance intervals that accommodates the specific random error variability observed in each test sample.Results: Through simulation studies we have shown that our proposed method outperforms two existing methods that are based on simple threshold rules or iterative regression. We have illustrated the method using a controlled MLPA assay in which targeted regions are variable in copy number in individuals suffering from different disorders such as Prader-Willi, DiGeorge or Autism showing the best performace.Conclusion: Using the proposed mixed-model, we are able to determine thresholds to decide whether a region is altered. These threholds are specific for each individual, incorporating experimental variability, resulting in improved sensitivity and specificity as the examples with real data have revealed.
Resumo:
This paper suggests a method for obtaining efficiency bounds in models containing either only infinite-dimensional parameters or both finite- and infinite-dimensional parameters (semiparametric models). The method is based on a theory of random linear functionals applied to the gradient of the log-likelihood functional and is illustrated by computing the lower bound for Cox's regression model
Resumo:
The purpose of this thesis is to study factors that explain the bilateral fiber trade flows. This is done by analyzing bilateral trade flows during 1990-2006. It will be studied also, whether there are differences between fiber types. This thesis uses a gravity model approach to study the trade flows. Gravity model is mostly used to study the aggregate data between trading countries. In this thesis the gravity model is applied to single fibers. This model is then applied to panel data set. Results from the regression show clearly that there are benefits in studying different fibers in separate. The effects differ considerably from each other. Furthermore, this thesis speaks for the existence of Linder’s effect in certain fiber types.
Resumo:
The analysis of price asymmetries in the gasoline market is one of the most studied in the energy economics literature. Nevertheless, the great variability of results makes it very difficult to extract conclusive results on the existence or not of asymmetries. This paper shows through a meta-analysis approach how the industry segment analysed, the quality and quantity of data, the estimator and the model used may explain this heterogeneity of results.
Resumo:
Several studies have suggested a bilingual advantage in executive functions, presumably due to bilinguals' massive practice with language switching that requires executive resources, but the results are still somewhat controversial. Previous studies are also plagued by the inherent limitations of a natural groups design where the participant groups are bound to differ in many ways in addition to the variable used to classify them. In an attempt to introduce a complementary analysis approach, we employed multiple regression to study whether the performance of 30- to 75-year-old FinnishSwedish bilinguals (N = 38) on tasks measuring different executive functions (inhibition, updating, and set shifting) could be predicted by the frequency of language switches in everyday life (as measured by a language switching questionnaire), L2 age of acquisition, or by the self-estimated degree of use of both languages in everyday life. Most consistent effects were found for the set shifting task where a higher rate of everyday language switches was related to a smaller mixing cost in errors. Mixing cost is thought to reflect top-down management of competing task sets, thus resembling the bilingual situation where decisions of which language to use has to be made in each conversation. These findings provide additional support to the idea that some executive functions in bilinguals are affected by a lifelong experience in language switching and, perhaps even more importantly, suggest a complementary approach to the study of this issue.
Resumo:
Two vegetable wastes, cork bark and grape stalks, were investigated for the removal of methylene blue from aqueous solution. The effects of contact time, dye concentration, pH, and temperature on sorption were studied relative to adsorption on a commercially-activated carbon. The highest adsorption yield was obtained within the pH range 5 to 10 for grape stalks and 7 to 10 for cork bark. The sorption kinetics of dye onto activated carbon and grape stalks was very fast. Kinetics data were fitted to the pseudo-first and second order kinetic equations, and the values of the pseudo-second-order initial rate constants were found to be 1.69 mg g-1 min-1 for activated carbon, 2.24 mg g-1 min-1 for grape stalks, and 0.90 mg g-1 min-1 for cork bark. Langmuir maximum sorption capacities for activated carbon, grape stalks, and cork bark for methylene blue estimated by the Orthogonal Distance Regression method (ODR) were 157.5 mg g-1, 105.6 mg g-1, and 30.52 mg g-1, respectively. FTIR spectra indicated that carboxylic groups and lignin play a significant role in the sorption of methylene blue. Electrostatic forces, n-p interactions, cation-p, and p-p stacking interactions contribute to methylene blue sorption onto grape stalks and cork bark. Grape stalks can be considered an efficient biosorbent and as a viable alternative to activated carbon and ion-exchange resins for the removal of methylene blue
Resumo:
Extended Hildebrand Solubility Approach (EHSA) was successfully applied to evaluate the solubility of Indomethacin in 1,4-dioxane + water mixtures at 298.15 K. An acceptable correlation-performance of EHSA was found by using a regular polynomial model in order four of the W interaction parameter vs. solubility parameter of the mixtures (overall deviation was 8.9%). Although the mean deviation obtained was similar to that obtained directly by means of an empiric regression of the experimental solubility vs. mixtures solubility parameters, the advantages of EHSA are evident because it requires physicochemical properties easily available for drugs.