926 resultados para Dirichlet Regression compositional model.
Resumo:
We are concerned with providing more empirical evidence on forecast failure, developing forecast models, and examining the impact of events such as audit reports. A joint consideration of classic financial ratios and relevant external indicators leads us to build a basic prediction model focused in non-financial Galician SMEs. Explanatory variables are relevant financial indicators from the viewpoint of the financial logic and financial failure theory. The paper explores three mathematical models: discriminant analysis, Logit, and linear multivariate regression. We conclude that, even though they both offer high explanatory and predictive abilities, Logit and MDA models should be used and interpreted jointly.
Resumo:
Dissertação de Mestrado, Gestão de Empresa (MBA), 16 de Julho de 2013, Universidade dos Açores.
Resumo:
Composition is a practice of key importance in software engineering. When real-time applications are composed it is necessary that their timing properties (such as meeting the deadlines) are guaranteed. The composition is performed by establishing an interface between the application and the physical platform. Such an interface does typically contain information about the amount of computing capacity needed by the application. In multiprocessor platforms, the interface should also present information about the degree of parallelism. Recently there have been quite a few interface proposals. However, they are either too complex to be handled or too pessimistic.In this paper we propose the Generalized Multiprocessor Periodic Resource model (GMPR) that is strictly superior to the MPR model without requiring a too detailed description. We describe a method to generate the interface from the application specification. All these methods have been implemented in Matlab routines that are publicly available.
Resumo:
This paper introduces a new unsupervised hyperspectral unmixing method conceived to linear but highly mixed hyperspectral data sets, in which the simplex of minimum volume, usually estimated by the purely geometrically based algorithms, is far way from the true simplex associated with the endmembers. The proposed method, an extension of our previous studies, resorts to the statistical framework. The abundance fraction prior is a mixture of Dirichlet densities, thus automatically enforcing the constraints on the abundance fractions imposed by the acquisition process, namely, nonnegativity and sum-to-one. A cyclic minimization algorithm is developed where the following are observed: 1) The number of Dirichlet modes is inferred based on the minimum description length principle; 2) a generalized expectation maximization algorithm is derived to infer the model parameters; and 3) a sequence of augmented Lagrangian-based optimizations is used to compute the signatures of the endmembers. Experiments on simulated and real data are presented to show the effectiveness of the proposed algorithm in unmixing problems beyond the reach of the geometrically based state-of-the-art competitors.
Resumo:
In this article, we calibrate the Vasicek interest rate model under the risk neutral measure by learning the model parameters using Gaussian processes for machine learning regression. The calibration is done by maximizing the likelihood of zero coupon bond log prices, using mean and covariance functions computed analytically, as well as likelihood derivatives with respect to the parameters. The maximization method used is the conjugate gradients. The only prices needed for calibration are zero coupon bond prices and the parameters are directly obtained in the arbitrage free risk neutral measure.
Resumo:
The prediction of the time and the efficiency of the remediation of contaminated soils using soil vapor extraction remain a difficult challenge to the scientific community and consultants. This work reports the development of multiple linear regression and artificial neural network models to predict the remediation time and efficiency of soil vapor extractions performed in soils contaminated separately with benzene, toluene, ethylbenzene, xylene, trichloroethylene, and perchloroethylene. The results demonstrated that the artificial neural network approach presents better performances when compared with multiple linear regression models. The artificial neural network model allowed an accurate prediction of remediation time and efficiency based on only soil and pollutants characteristics, and consequently allowing a simple and quick previous evaluation of the process viability.
Resumo:
Composition is a practice of key importance in software engineering. When real-time applications are composed, it is necessary that their timing properties (such as meeting the deadlines) are guaranteed. The composition is performed by establishing an interface between the application and the physical platform. Such an interface typically contains information about the amount of computing capacity needed by the application. For multiprocessor platforms, the interface should also present information about the degree of parallelism. Several interface proposals have recently been put forward in various research works. However, those interfaces are either too complex to be handled or too pessimistic. In this paper we propose the generalized multiprocessor periodic resource model (GMPR) that is strictly superior to the MPR model without requiring a too detailed description. We then derive a method to compute the interface from the application specification. This method has been implemented in Matlab routines that are publicly available.
Resumo:
In health related research it is common to have multiple outcomes of interest in a single study. These outcomes are often analysed separately, ignoring the correlation between them. One would expect that a multivariate approach would be a more efficient alternative to individual analyses of each outcome. Surprisingly, this is not always the case. In this article we discuss different settings of linear models and compare the multivariate and univariate approaches. We show that for linear regression models, the estimates of the regression parameters associated with covariates that are shared across the outcomes are the same for the multivariate and univariate models while for outcome-specific covariates the multivariate model performs better in terms of efficiency.
Resumo:
OBJECTIVE: The objective of the study was to develop a model for estimating patient 28-day in-hospital mortality using 2 different statistical approaches. DESIGN: The study was designed to develop an outcome prediction model for 28-day in-hospital mortality using (a) logistic regression with random effects and (b) a multilevel Cox proportional hazards model. SETTING: The study involved 305 intensive care units (ICUs) from the basic Simplified Acute Physiology Score (SAPS) 3 cohort. PATIENTS AND PARTICIPANTS: Patients (n = 17138) were from the SAPS 3 database with follow-up data pertaining to the first 28 days in hospital after ICU admission. INTERVENTIONS: None. MEASUREMENTS AND RESULTS: The database was divided randomly into 5 roughly equal-sized parts (at the ICU level). It was thus possible to run the model-building procedure 5 times, each time taking four fifths of the sample as a development set and the remaining fifth as the validation set. At 28 days after ICU admission, 19.98% of the patients were still in the hospital. Because of the different sampling space and outcome variables, both models presented a better fit in this sample than did the SAPS 3 admission score calibrated to vital status at hospital discharge, both on the general population and in major subgroups. CONCLUSIONS: Both statistical methods can be used to model the 28-day in-hospital mortality better than the SAPS 3 admission model. However, because the logistic regression approach is specifically designed to forecast 28-day mortality, and given the high uncertainty associated with the assumption of the proportionality of risks in the Cox model, the logistic regression approach proved to be superior.
Resumo:
Objectives: To characterize the epidemiology and risk factors for acute kidney injury (AKI) after pediatric cardiac surgery in our center, to determine its association with poor short-term outcomes, and to develop a logistic regression model that will predict the risk of AKI for the study population. Methods: This single-center, retrospective study included consecutive pediatric patients with congenital heart disease who underwent cardiac surgery between January 2010 and December 2012. Exclusion criteria were a history of renal disease, dialysis or renal transplantation. Results: Of the 325 patients included, median age three years (1 day---18 years), AKI occurred in 40 (12.3%) on the first postoperative day. Overall mortality was 13 (4%), nine of whom were in the AKI group. AKI was significantly associated with length of intensive care unit stay, length of mechanical ventilation and in-hospital death (p<0.01). Patients’ age and postoperative serum creatinine, blood urea nitrogen and lactate levels were included in the logistic regression model as predictor variables. The model accurately predicted AKI in this population, with a maximum combined sensitivity of 82.1% and specificity of 75.4%. Conclusions: AKI is common and is associated with poor short-term outcomes in this setting. Younger age and higher postoperative serum creatinine, blood urea nitrogen and lactate levels were powerful predictors of renal injury in this population. The proposed model could be a useful tool for risk stratification of these patients.
Resumo:
INTRODUCTION: Malaria is a serious problem in the Brazilian Amazon region, and the detection of possible risk factors could be of great interest for public health authorities. The objective of this article was to investigate the association between environmental variables and the yearly registers of malaria in the Amazon region using Bayesian spatiotemporal methods. METHODS: We used Poisson spatiotemporal regression models to analyze the Brazilian Amazon forest malaria count for the period from 1999 to 2008. In this study, we included some covariates that could be important in the yearly prediction of malaria, such as deforestation rate. We obtained the inferences using a Bayesian approach and Markov Chain Monte Carlo (MCMC) methods to simulate samples for the joint posterior distribution of interest. The discrimination of different models was also discussed. RESULTS: The model proposed here suggests that deforestation rate, the number of inhabitants per km², and the human development index (HDI) are important in the prediction of malaria cases. CONCLUSIONS: It is possible to conclude that human development, population growth, deforestation, and their associated ecological alterations are conducive to increasing malaria risk. We conclude that the use of Poisson regression models that capture the spatial and temporal effects under the Bayesian paradigm is a good strategy for modeling malaria counts.
Resumo:
This project focuses on the study of different explanatory models for the behavior of CDS security, such as Fixed-Effect Model, GLS Random-Effect Model, Pooled OLS and Quantile Regression Model. After determining the best fitness model, trading strategies with long and short positions in CDS have been developed. Due to some specifications of CDS, I conclude that the quantile regression is the most efficient model to estimate the data. The P&L and Sharpe Ratio of the strategy are analyzed using a backtesting analogy, where I conclude that, mainly for non-financial companies, the model allows traders to take advantage of and profit from arbitrages.
Resumo:
Extreme value models are widely used in different areas. The Birnbaum–Saunders distribution is receiving considerable attention due to its physical arguments and its good properties. We propose a methodology based on extreme value Birnbaum–Saunders regression models, which includes model formulation, estimation, inference and checking. We further conduct a simulation study for evaluating its performance. A statistical analysis with real-world extreme value environmental data using the methodology is provided as illustration.
Resumo:
This paper discusses models, associations and causation in psychiatry. The different types of association (linear, positive, negative, exponential, partial, U shaped relationship, hidden and spurious) between variables involved in mental disorders are presented as well as the use of multiple regression analysis to disentangle interrelatedness amongst multiple variables. A useful model should have internal consistency, external validity and predictive power; be dynamic in order to accommodate new sound knowledge; and should fit facts rather than they other way around. It is argued that whilst models are theoretical constructs they also convey a style of reasoning and can change clinical practice. Cause and effect are complex phenomena in that the same cause can yield different effects. Conversely, the same effect can have a different range of causes. In mental disorders and human behaviour there is always a chain of events initiated by the indirect and remote cause; followed by intermediate causes; and finally the direct and more immediate cause. Causes of mental disorders are grouped as those: (i) which are necessary and sufficient; (ii) which are necessary but not sufficient; and (iii) which are neither necessary nor sufficient, but when present increase the risk for mental disorders.
Resumo:
Background:Previous reports have inferred a linear relationship between LDL-C and changes in coronary plaque volume (CPV) measured by intravascular ultrasound. However, these publications included a small number of studies and did not explore other lipid markers.Objective:To assess the association between changes in lipid markers and regression of CPV using published data.Methods:We collected data from the control, placebo and intervention arms in studies that compared the effect of lipidlowering treatments on CPV, and from the placebo and control arms in studies that tested drugs that did not affect lipids. Baseline and final measurements of plaque volume, expressed in mm3, were extracted and the percentage changes after the interventions were calculated. Performing three linear regression analyses, we assessed the relationship between percentage and absolute changes in lipid markers and percentage variations in CPV.Results:Twenty-seven studies were selected. Correlations between percentage changes in LDL-C, non-HDL-C, and apolipoprotein B (ApoB) and percentage changes in CPV were moderate (r = 0.48, r = 0.47, and r = 0.44, respectively). Correlations between absolute differences in LDL-C, non‑HDL-C, and ApoB with percentage differences in CPV were stronger (r = 0.57, r = 0.52, and r = 0.79). The linear regression model showed a statistically significant association between a reduction in lipid markers and regression of plaque volume.Conclusion:A significant association between changes in different atherogenic particles and regression of CPV was observed. The absolute reduction in ApoB showed the strongest correlation with coronary plaque regression.