928 resultados para switching regression model
Resumo:
Also issued as thesis (M.S.) University of Illinois.
Finite mixture regression model with random effects: application to neonatal hospital length of stay
Resumo:
A two-component mixture regression model that allows simultaneously for heterogeneity and dependency among observations is proposed. By specifying random effects explicitly in the linear predictor of the mixture probability and the mixture components, parameter estimation is achieved by maximising the corresponding best linear unbiased prediction type log-likelihood. Approximate residual maximum likelihood estimates are obtained via an EM algorithm in the manner of generalised linear mixed model (GLMM). The method can be extended to a g-component mixture regression model with the component density from the exponential family, leading to the development of the class of finite mixture GLMM. For illustration, the method is applied to analyse neonatal length of stay (LOS). It is shown that identification of pertinent factors that influence hospital LOS can provide important information for health care planning and resource allocation. (C) 2002 Elsevier Science B.V. All rights reserved.
Resumo:
Abstract A new LIBS quantitative analysis method based on analytical line adaptive selection and Relevance Vector Machine (RVM) regression model is proposed. First, a scheme of adaptively selecting analytical line is put forward in order to overcome the drawback of high dependency on a priori knowledge. The candidate analytical lines are automatically selected based on the built-in characteristics of spectral lines, such as spectral intensity, wavelength and width at half height. The analytical lines which will be used as input variables of regression model are determined adaptively according to the samples for both training and testing. Second, an LIBS quantitative analysis method based on RVM is presented. The intensities of analytical lines and the elemental concentrations of certified standard samples are used to train the RVM regression model. The predicted elemental concentration analysis results will be given with a form of confidence interval of probabilistic distribution, which is helpful for evaluating the uncertainness contained in the measured spectra. Chromium concentration analysis experiments of 23 certified standard high-alloy steel samples have been carried out. The multiple correlation coefficient of the prediction was up to 98.85%, and the average relative error of the prediction was 4.01%. The experiment results showed that the proposed LIBS quantitative analysis method achieved better prediction accuracy and better modeling robustness compared with the methods based on partial least squares regression, artificial neural network and standard support vector machine.
Resumo:
Data fluctuation in multiple measurements of Laser Induced Breakdown Spectroscopy (LIBS) greatly affects the accuracy of quantitative analysis. A new LIBS quantitative analysis method based on the Robust Least Squares Support Vector Machine (RLS-SVM) regression model is proposed. The usual way to enhance the analysis accuracy is to improve the quality and consistency of the emission signal, such as by averaging the spectral signals or spectrum standardization over a number of laser shots. The proposed method focuses more on how to enhance the robustness of the quantitative analysis regression model. The proposed RLS-SVM regression model originates from the Weighted Least Squares Support Vector Machine (WLS-SVM) but has an improved segmented weighting function and residual error calculation according to the statistical distribution of measured spectral data. Through the improved segmented weighting function, the information on the spectral data in the normal distribution will be retained in the regression model while the information on the outliers will be restrained or removed. Copper elemental concentration analysis experiments of 16 certified standard brass samples were carried out. The average value of relative standard deviation obtained from the RLS-SVM model was 3.06% and the root mean square error was 1.537%. The experimental results showed that the proposed method achieved better prediction accuracy and better modeling robustness compared with the quantitative analysis methods based on Partial Least Squares (PLS) regression, standard Support Vector Machine (SVM) and WLS-SVM. It was also demonstrated that the improved weighting function had better comprehensive performance in model robustness and convergence speed, compared with the four known weighting functions.
Resumo:
2000 Mathematics Subject Classification: 62J12, 62K15, 91B42, 62H99.
Resumo:
2010 Mathematics Subject Classification: 68T50,62H30,62J05.
Resumo:
Multiple linear regression model plays a key role in statistical inference and it has extensive applications in business, environmental, physical and social sciences. Multicollinearity has been a considerable problem in multiple regression analysis. When the regressor variables are multicollinear, it becomes difficult to make precise statistical inferences about the regression coefficients. There are some statistical methods that can be used, which are discussed in this thesis are ridge regression, Liu, two parameter biased and LASSO estimators. Firstly, an analytical comparison on the basis of risk was made among ridge, Liu and LASSO estimators under orthonormal regression model. I found that LASSO dominates least squares, ridge and Liu estimators over a significant portion of the parameter space for large dimension. Secondly, a simulation study was conducted to compare performance of ridge, Liu and two parameter biased estimator by their mean squared error criterion. I found that two parameter biased estimator performs better than its corresponding ridge regression estimator. Overall, Liu estimator performs better than both ridge and two parameter biased estimator.
Resumo:
Adaptability and invisibility are hallmarks of modern terrorism, and keeping pace with its dynamic nature presents a serious challenge for societies throughout the world. Innovations in computer science have incorporated applied mathematics to develop a wide array of predictive models to support the variety of approaches to counterterrorism. Predictive models are usually designed to forecast the location of attacks. Although this may protect individual structures or locations, it does not reduce the threat—it merely changes the target. While predictive models dedicated to events or social relationships receive much attention where the mathematical and social science communities intersect, models dedicated to terrorist locations such as safe-houses (rather than their targets or training sites) are rare and possibly nonexistent. At the time of this research, there were no publically available models designed to predict locations where violent extremists are likely to reside. This research uses France as a case study to present a complex systems model that incorporates multiple quantitative, qualitative and geospatial variables that differ in terms of scale, weight, and type. Though many of these variables are recognized by specialists in security studies, there remains controversy with respect to their relative importance, degree of interaction, and interdependence. Additionally, some of the variables proposed in this research are not generally recognized as drivers, yet they warrant examination based on their potential role within a complex system. This research tested multiple regression models and determined that geographically-weighted regression analysis produced the most accurate result to accommodate non-stationary coefficient behavior, demonstrating that geographic variables are critical to understanding and predicting the phenomenon of terrorism. This dissertation presents a flexible prototypical model that can be refined and applied to other regions to inform stakeholders such as policy-makers and law enforcement in their efforts to improve national security and enhance quality-of-life.
Resumo:
Grade retention practices are at the forefront of the educational debate. In this paper, we use PISA 2009 data for Spain to measure the effect of grade retention on students achievement. One important problem when analyzing this question is that school outcomes and the propensity to repeat a grade are likely to be determined simultaneously. We address this problem by estimating a Switching Regression Model. We find that grade retention has a negative impact on educational outcomes, but we confi rm the importance of endogenous selection, which makes observed differences between repeaters and non-repeaters appear 14.6% lower than they actually are. The effect on PISA scores of repeating is much smaller (-10% of non-repeaters average) than the counterfactual reduction that non-repeaters would suffer had they been retained as repeaters (-24% of their average). Furthermore, those who repeated a grade during primary education suffered more than those who repeated a grade of secondary school, although the effect of repeating at both times is, as expected, much larger.
Resumo:
The decision to settle a motor insurance claim by either negotiation or trial is analysed. This decision may depend on how risk and confrontation adverse or pessimistic the claimant is. The extent to which these behavioural features of the claimant might influence the final compensation amount are examined. An empirical analysis, fitting a switching regression model to a Spanish database, is conducted in order to analyze whether the choice of the conflict resolution procedure is endogenous to the compensation outcomes. The results show that compensations awarded by courts are always higher, although 95% of cases are settled by negotiation. We show that this is because claimants are adverse to risk and confrontation, and are pessimistic about their chances at trial. By contrast, insurers are risk - confrontation neutral and more objective in relation to the expected trial compensation. During the negotiation insurers accept to pay the subjective compensation values of claimants, since these values are lower than their estimates of compensations at trial.
Resumo:
O objetivo desta dissertação é analisar a política de juros do Banco Central durante o período entre os anos de 1995 a 2002, procurando verificar se houve mudança nesta política, isto é se houve mudanças de regimes na condução da política monetária, principalmente com a mudança do regime cambial. Para tanto o modelo estimado mais adequado para a análise foi um switching regression model que determina endogenamente se há mudança de regimes.
Resumo:
A presente tese engloba três artigos sobre diferencial de salários e estimação de demanda no Brasil. O primeiro artigo investiga o diferencial de salários entre os trabalhadores dos setores público e privado. A principal contribuição deste estudo é a estimação de um modelo de regressão com mudança endógena (endogenous switching regression model), que corrige o viés de seleção no processo de escolha setorial realizada pelos trabalhadores e permite a identificação de fatores determinantes na entrada do trabalhador no mercado de trabalho do setor público. O objetivo do segundo trabalho é calcular a elasticidade-preço e a elasticidade-despesa de 25 produtos alimentares das famílias residentes nas áreas rurais e urbanas do Brasil. Para tanto, foram estimados dois sistemas de equações de demanda por alimentos, um referente às famílias residentes nas áreas rurais do país e o outro sistema associado às famílias residentes nas áreas urbanas. O terceiro artigo busca testar a validade do modelo unitário para solteiros(as) e a validade do modelo de racionalidade coletiva de Browning e Chiappori (1998) para casais no Brasil. Para tanto, foi estimado um sistema de demanda do consumo brasileiro com base no modelo QUAIDS, que apresenta uma estrutura de preferências flexível o suficiente para permitir curvas de Engel quadráticas.
Resumo:
The current study applies a two-state switching regression model to examine the behavior of a hypothetical portfolio of ten socially responsible (SRI) equity mutual funds during the expansion and contraction phases of US business cycles between April 1991 and June 2009, based on the Carhart four-factor model, using monthly data. The model identified a business cycle effect on the performance of SRI equity mutual funds. Fund returns were less volatile during expansion/peaks than during contraction/troughs, as indicated by the standard deviation of returns. During contraction/troughs, fund excess returns were explained by the differential in returns between small and large companies, the difference between the returns on stocks trading at high and low Book-to-Market Value, the market excess return over the risk-free rate, and fund objective. During contraction/troughs, smaller companies offered higher returns than larger companies (ci = 0.26, p = 0.01), undervalued stocks out-performed high growth stocks (h i = 0.39, p <0.0001), and funds with growth objectives out-performed funds with other objectives (oi = 0.01, p = 0.02). The hypothetical SRI portfolio was less risky than the market (bi = 0.74, p <0.0001). During expansion/peaks, fund excess returns were explained by the market excess return over the risk-free rate, and fund objective. Funds with other objectives, such as balanced funds and income funds out-performed funds with growth objectives (oi = −0.01, p = 0.03). The hypothetical SRI portfolio exhibited similar risk as the market (bi = 0.93, p <0.0001). The SRI investor adds a third criterion to the risk and return trade-off of traditional portfolio theory. This constraint is social performance. The research suggests that managers of SRI equity mutual funds may diminish value by using social and ethical criteria to select stocks, but add value by superior stock selection. The result is that the performance of SRI mutual funds is very similar to that of the market. There was no difference in the value added among secular SRI, religious SRI, and vice screens.
Resumo:
Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq)
Resumo:
The objective of this study was to estimate (co)variance components using random regression on B-spline functions to weight records obtained from birth to adulthood. A total of 82 064 weight records of 8145 females obtained from the data bank of the Nellore Breeding Program (PMGRN/Nellore Brazil) which started in 1987, were used. The models included direct additive and maternal genetic effects and animal and maternal permanent environmental effects as random. Contemporary group and dam age at calving (linear and quadratic effect) were included as fixed effects, and orthogonal Legendre polynomials of age (cubic regression) were considered as random covariate. The random effects were modeled using B-spline functions considering linear, quadratic and cubic polynomials for each individual segment. Residual variances were grouped in five age classes. Direct additive genetic and animal permanent environmental effects were modeled using up to seven knots (six segments). A single segment with two knots at the end points of the curve was used for the estimation of maternal genetic and maternal permanent environmental effects. A total of 15 models were studied, with the number of parameters ranging from 17 to 81. The models that used B-splines were compared with multi-trait analyses with nine weight traits and to a random regression model that used orthogonal Legendre polynomials. A model fitting quadratic B-splines, with four knots or three segments for direct additive genetic effect and animal permanent environmental effect and two knots for maternal additive genetic effect and maternal permanent environmental effect, was the most appropriate and parsimonious model to describe the covariance structure of the data. Selection for higher weight, such as at young ages, should be performed taking into account an increase in mature cow weight. Particularly, this is important in most of Nellore beef cattle production systems, where the cow herd is maintained on range conditions. There is limited modification of the growth curve of Nellore cattle with respect to the aim of selecting them for rapid growth at young ages while maintaining constant adult weight.