35 resultados para multicollinearity


Relevância:

20.00% 20.00%

Publicador:

Resumo:

It is commonly perceived that variables ‘measuring’ different dimensions of teaching (construed as instructional attributes) used in student evaluation of teaching (SET) questionnaires are so highly correlated that they pose a serious multicollinearity problem for quantitative analysis including regression analysis. Using nearly 12000 individual student responses to SET questionnaires and ten key dimensions of teaching and 25 courses at various undergraduate and postgraduate levels for multiple years at a large Australian university, this paper investigates whether this is indeed the case and if so under what circumstances. This paper tests this proposition first by examining variance inflation factors (VIFs), across courses, levels and over time using individual responses; and secondly by using class averages. In the first instance, the paper finds no sustainable evidence of multicollinearity. While, there were one or two isolated cases of VIFs marginally exceeding the conservative threshold of 5, in no cases did the VIFs for any of the instructional attributes come anywhere close to the high threshold value of 10. In the second instance, however, the paper finds that the attributes are highly correlated as all the VIFs exceed 10. These findings have two implications: (a) given the ordinal nature of the data ordered probit analysis using individual student responses can be employed to quantify the impact of instructional attributes on TEVAL score; (b) Data based on class averages cannot be used for probit analysis. An illustrative exercise using level 2 undergraduate courses data suggests higher TEVAL scores depend first and foremost on improving explanation, presentation, and organization of lecture materials.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This paper addresses the investment decisions considering the presence of financial constraints of 373 large Brazilian firms from 1997 to 2004, using panel data. A Bayesian econometric model was used considering ridge regression for multicollinearity problems among the variables in the model. Prior distributions are assumed for the parameters, classifying the model into random or fixed effects. We used a Bayesian approach to estimate the parameters, considering normal and Student t distributions for the error and assumed that the initial values for the lagged dependent variable are not fixed, but generated by a random process. The recursive predictive density criterion was used for model comparisons. Twenty models were tested and the results indicated that multicollinearity does influence the value of the estimated parameters. Controlling for capital intensity, financial constraints are found to be more important for capital-intensive firms, probably due to their lower profitability indexes, higher fixed costs and higher degree of property diversification.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The objective of this work was to assess the degree of multicollinearity and to identify the variables involved in linear dependence relations in additive-dominant models. Data of birth weight (n=141,567), yearling weight (n=58,124), and scrotal circumference (n=20,371) of Montana Tropical composite cattle were used. Diagnosis of multicollinearity was based on the variance inflation factor (VIF) and on the evaluation of the condition indexes and eigenvalues from the correlation matrix among explanatory variables. The first model studied (RM) included the fixed effect of dam age class at calving and the covariates associated to the direct and maternal additive and non-additive effects. The second model (R) included all the effects of the RM model except the maternal additive effects. Multicollinearity was detected in both models for all traits considered, with VIF values of 1.03 - 70.20 for RM and 1.03 - 60.70 for R. Collinearity increased with the increase of variables in the model and the decrease in the number of observations, and it was classified as weak, with condition index values between 10.00 and 26.77. In general, the variables associated with additive and non-additive effects were involved in multicollinearity, partially due to the natural connection between these covariables as fractions of the biological types in breed composition.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Includes bibliographical references (leaves 19-21).

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Understanding the expected safety performance of rural signalized intersections is critical for (a) identifying high-risk sites where the observed safety performance is substantially worse than the expected safety performance, (b) understanding influential factors associated with crashes, and (c) predicting the future performance of sites and helping plan safety-enhancing activities. These three critical activities are routinely conducted for safety management and planning purposes in jurisdictions throughout the United States and around the world. This paper aims to develop baseline expected safety performance functions of rural signalized intersections in South Korea, which to date have not yet been established or reported in the literature. Data are examined from numerous locations within South Korea for both three-legged and four-legged configurations. The safety effects of a host of operational and geometric variables on the safety performance of these sites are also examined. In addition, supplementary tables and graphs are developed for comparing the baseline safety performance of sites with various geometric and operational features. These graphs identify how various factors are associated with safety. The expected safety prediction tables offer advantages over regression prediction equations by allowing the safety manager to isolate specific features of the intersections and examine their impact on expected safety. The examination of the expected safety performance tables through illustrated examples highlights the need to correct for regression-to-the-mean effects, emphasizes the negative impacts of multicollinearity, shows why multivariate models do not translate well to accident modification factors, and illuminates the need to examine road safety carefully and methodically. Caveats are provided on the use of the safety performance prediction graphs developed in this paper.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Maternal and infant mortality is a global health issue with a significant social and economic impact. Each year, over half a million women worldwide die due to complications related to pregnancy or childbirth, four million infants die in the first 28 days of life, and eight million infants die in the first year. Ninety-nine percent of maternal and infant deaths are in developing countries. Reducing maternal and infant mortality is among the key international development goals. In China, the national maternal mortality ratio and infant mortality rate were reduced greatly in the past two decades, yet a large discrepancy remains between urban and rural areas. To address this problem, a large-scale Safe Motherhood Programme was initiated in 2000. The programme was implemented in Guangxi in 2003. Interventions in the programme included both demand-side and supply side-interventions focusing on increasing health service use and improving birth outcomes. Little is known about the effects and economic outcomes of the Safe Motherhood Programme in Guangxi, although it has been implemented for seven years. The aim of this research is to estimate the effectiveness and cost-effectiveness of the interventions in the Safe Motherhood Programme in Guangxi, China. The objectives of this research include: 1. To evaluate whether the changes of health service use and birth outcomes are associated with the interventions in the Safe Motherhood Programme. 2. To estimate the cost-effectiveness of the interventions in the Safe Motherhood Programme and quantify the uncertainty surrounding the decision. 3. To assess the expected value of perfect information associated with both the whole decision and individual parameters, and interpret the findings to inform priority setting in further research and policy making in this area. A quasi-experimental study design was used in this research to assess the effectiveness of the programme in increasing health service use and improving birth outcomes. The study subjects were 51 intervention counties and 30 control counties. Data on the health service use, birth outcomes and socio-economic factors from 2001 to 2007 were collected from the programme database and statistical yearbooks. Based on the profile plots of the data, general linear mixed models were used to evaluate the effectiveness of the programme while controlling for the effects of baseline levels of the response variables, change of socio-economic factors over time and correlations among repeated measurements from the same county. Redundant multicollinear variables were deleted from the mixed model using the results of the multicollinearity diagnoses. For each response variable, the best covariance structure was selected from 15 alternatives according to the fit statistics including Akaike information criterion, Finite-population corrected Akaike information criterion, and Schwarz.s Bayesian information criterion. Residual diagnostics were used to validate the model assumptions. Statistical inferences were made to show the effect of the programme on health service use and birth outcomes. A decision analytic model was developed to evaluate the cost-effectiveness of the programme, quantify the decision uncertainty, and estimate the expected value of perfect information associated with the decision. The model was used to describe the transitions between health states for women and infants and reflect the change of both costs and health benefits associated with implementing the programme. Result gained from the mixed models and other relevant evidence identified were synthesised appropriately to inform the input parameters of the model. Incremental cost-effectiveness ratios of the programme were calculated for the two groups of intervention counties over time. Uncertainty surrounding the parameters was dealt with using probabilistic sensitivity analysis, and uncertainty relating to model assumptions was handled using scenario analysis. Finally the expected value of perfect information for both the whole model and individual parameters in the model were estimated to inform priority setting in further research in this area.The annual change rates of the antenatal care rate and the institutionalised delivery rate were improved significantly in the intervention counties after the programme was implemented. Significant improvements were also found in the annual change rates of the maternal mortality ratio, the infant mortality rate, the incidence rate of neonatal tetanus and the mortality rate of neonatal tetanus in the intervention counties after the implementation of the programme. The annual change rate of the neonatal mortality rate was also improved, although the improvement was only close to statistical significance. The influences of the socio-economic factors on the health service use indicators and birth outcomes were identified. The rural income per capita had a significant positive impact on the health service use indicators, and a significant negative impact on the birth outcomes. The number of beds in healthcare institutions per 1,000 population and the number of rural telephone subscribers per 1,000 were found to be positively significantly related to the institutionalised delivery rate. The length of highway per square kilometre negatively influenced the maternal mortality ratio. The percentage of employed persons in the primary industry had a significant negative impact on the institutionalised delivery rate, and a significant positive impact on the infant mortality rate and neonatal mortality rate. The incremental costs of implementing the programme over the existing practice were US $11.1 million from the societal perspective, and US $13.8 million from the perspective of the Ministry of Health. Overall, 28,711 life years were generated by the programme, producing an overall incremental cost-effectiveness ratio of US $386 from the societal perspective, and US $480 from the perspective of the Ministry of Health, both of which were below the threshold willingness-to-pay ratio of US $675. The expected net monetary benefit generated by the programme was US $8.3 million from the societal perspective, and US $5.5 million from the perspective of the Ministry of Health. The overall probability that the programme was cost-effective was 0.93 and 0.89 from the two perspectives, respectively. The incremental cost-effectiveness ratio of the programme was insensitive to the different estimates of the three parameters relating to the model assumptions. Further research could be conducted to reduce the uncertainty surrounding the decision, in which the upper limit of investment was US $0.6 million from the societal perspective, and US $1.3 million from the perspective of the Ministry of Health. It is also worthwhile to get a more precise estimate of the improvement of infant mortality rate. The population expected value of perfect information for individual parameters associated with this parameter was US $0.99 million from the societal perspective, and US $1.14 million from the perspective of the Ministry of Health. The findings from this study have shown that the interventions in the Safe Motherhood Programme were both effective and cost-effective in increasing health service use and improving birth outcomes in rural areas of Guangxi, China. Therefore, the programme represents a good public health investment and should be adopted and further expanded to an even broader area if possible. This research provides economic evidence to inform efficient decision making in improving maternal and infant health in developing countries.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Over past few decades, frog species have been experiencing dramatic decline around the world. The reason for this decline includes habitat loss, invasive species, climate change and so on. To better know the status of frog species, classifying frogs has become increasingly important. In this study, acoustic features are investigated for multi-level classification of Australian frogs: family, genus and species, including three families, eleven genera and eighty five species which are collected from Queensland, Australia. For each frog species, six instances are selected from which ten acoustic features are calculated. Then, the multicollinearity between ten features are studied for selecting non-correlated features for subsequent analysis. A decision tree (DT) classifier is used to visually and explicitly determine which acoustic features are relatively important for classifying family, which for genus, and which for species. Finally, a weighted support vector machines (SVMs) classifier is used for the multi- level classification with three most important acoustic features respectively. Our experiment results indicate that using different acoustic feature sets can successfully classify frogs at different levels and the average classification accuracy can be up to 85.6%, 86.1% and 56.2% for family, genus and species respectively.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

To protect and restore lake ecosystems under threats posed by the increasing human population, information on their ecological quality is needed. Lake sediments provide a data rich archive that allows identification of various biological components present prior to anthropogenic alterations as well as a constant record of changes. By providing a longer dimension of time than any ongoing monitoring programme, palaeolimnological methods can help in understanding natural variability and long-term ecological changes in lakes. As zooplankton have a central role in the lake food web, their remains can potentially provide versatile information on past trophic structure. However, various taphonomic processes operating in the lakes still raise questions concerning how subfossil assemblages reflect living communities. This thesis work aimed at improving the use of sedimentary zooplankton remains in the reconstruction of past zooplankton communities and the trophic structure in lakes. To quantify interspecific differences in the accumulation of remains, the subfossils of nine pelagic zooplankton taxa in annually laminated sediments were compared with monitoring results for live zooplankton in Lake Vesijärvi. This lake has a known history of eutrophication and recovery, which resulted from reduced external loading and effective fishing of plankti-benthivorous fish. The response of zooplankton assemblages to these known changes was resolved using annually laminated sediments. The generality of the responses observed in Lake Vesijärvi were further tested with a set of 31 lakes in Southern Finland, relating subfossils in surface sediments to contemporary water quality and fish density, as well as to lake morphometry. The results demonstrated differential preservation and retention of cladoceran species in the sediment. Daphnia, Diaphanosoma and Ceriodaphnia were clearly underrepresented in the sediment samples in comparison to well-preserved Bosmina species, Chydorus, Limnosida and Leptodora. For well-preserved species, the annual net accumulation rate was similar to or above the expected values, reflecting effective sediment focusing and accumulation in the deepest part of the lake. The decreased fish density and improved water quality led to subtle changes in zooplankton community composition. The abundance of Diaphanosoma and Limnosida increased after the reduction in fish density, while Ceriodaphnia and rotifers decreased. The most sensitive indicator of fish density was the mean size of Daphnia ephippia and Bosmina (E.) crassicornis ephippia and carapaces. The concentration of plant-associated species increased, reflecting expanding littoral vegetation along with increasing transparency. Several of the patterns observed in Lake Vesijärvi could also be found within the set of 31 lakes. According to this thesis work, the most useful cladoceran-based indices for nutrient status and planktivorous fish density in Finnish lakes were the relative abundances of certain pelagic taxa, and the mean size of Bosmina spp. carapaces, especially those of Bosmina (E.) cf. coregoni. The abundance of plant-associated species reflected the potential area for aquatic plants. Lake morphometry and sediment organic content, however, explained a relatively high proportion of the variance in the species data, and more studies are needed to quantify lake-specific differences in the accumulation and preservation of remains. Commonly occurring multicollinearity between environmental variables obstructs the cladoceran-based reconstruction of single environmental variables. As taphonomic factors and several direct and indirect structuring forces in lake ecosystems simultaneously affect zooplankton, the subfossil assemblages should be studied in a holistic way before making final conclusions about the trophic structure and the change in lake ecological quality.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

In this paper we compare conceptualising single factor technical and allocative efficiency as indicators of a single latent variable, or as separate observed variables. In the former case, the impacts on both efficiency types are analysed by means of structural equation modeling (SEM), in the latter by seemingly unrelated regression (SUR). We compare estimation results of the two approaches based on a dataset on single factor irrigation water use efficiency obtained from a survey of 360 farmers in the Guanzhong Plain, China. The main methodological findings are that SEM allows identification of the most important dimension of irrigation water efficiency (technical efficiency) via comparison of their factor scores and reliability. Moreover, it reduces multicollinearity and attenuation bias. It thus is preferable to SUR. The SEM estimates show that perception of water scarcity is the most important positive determinant of both types of efficiency, followed by irrigation infrastructure, income and water price. Furthermore, there is a strong negative reverse effect from efficiency on perception.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Previous studies have reported associations between conspiracist ideation and domain-level facets of schizotypy, but less is known about associations with lower-order facets. In the present study, 447 adults completed measures of conspiracist ideation and the Schizotypal Personality Questionnaire (SPQ), consisting of nine subscales grouped into four domains. Results of a multiple regression showed that two domains of the SPQ significantly predicted conspiracist ideation, but multicollinearity was a limiting factor. In a second regression, we found that the subscales of Odd Beliefs or Magical Thinking and Ideas of Reference significantly predicted conspiracist ideation, without any multicollinearity constraints. We interpret these results as implicating two specific lower-order facets of schizotypy in belief in conspiracy theories. We further contrast the present results with previous studies indicating associations between conspiracist ideation and paranormal beliefs.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

En este estudio se realizó un análisis predictivo de la aparición de eventos adversos de los pacientes de una IPS de Bogotá, Mederi Hospital Universitario de Barrios Unidos (HUBU) durante el año 2013; relacionados con los indicadores de eficiencia hospitalaria (Porcentaje de ocupación hospitalaria, número de egresos hospitalarios, promedio de estancia hospitalaria, número de egresos de urgencias, promedio de estancia en urgencias). Los datos fueron exportados a una matriz de análisis de las variables cualitativas; fueron presentadas con frecuencias absolutas y relativas, las variables cuantitativas (edad, tiempos de estancia) fueron presentadas con media, desviaciones estándar. Se agruparon los datos de eventos adversos y de eficiencia hospitalaria en una nueva matriz que permitiera el análisis predictivo la nueva matriz fue exportada al software de modelación estadístico Eviews 6.5; se especificaron modelos predictivos multivariados para la variable número de eventos adversos, respecto de los indicadores de eficiencia hospitalaria y se estimaron las probabilidades de ocurrencia, análisis de correlación y multicolinealidad; los resultados se presentaron en tablas de estimación para cada modelo, se restringieron los eventos adversos prevenibles y no prevenibles información obtenida a través de un sistema de información que registra los factores relacionados con la ocurrencia de eventos adversos en salud, a través del sistema de reporte de eventos en salud, reporte en las historias clínicas, reporte individual, reporte por servicio, análisis de datos y estudios de caso, de la misma forma fueron extraídos los datos de eficiencia hospitalaria para el mismo periodo. El análisis y gestión de eventos adversos pretende establecer estrategias de mejoramiento continuo y análisis de resultados frente a los indicadores de eficiencia que permitan intervención de los factores de riesgo operativo de los servicios del Hospital Universitario de Barrios Unidos (HUBU), relacionados con eventos adversos en la atención de los pacientes en especial se debe enfocar en la gestión de los egresos de pacientes de acuerdo a los resultados obtenidos con el fin de alinearse y fortalecer las políticas de seguridad del paciente para brindar una atención integral con calidad y eficiencia, disminuyendo las quejas en la atención, las glosas, los riesgos jurídicos, de acuerdo al modelo predictivo estudiado.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

In this paper, we study the influence of the National Telecom Business Volume by the data in 2008 that have been published in China Statistical Yearbook of Statistics. We illustrate the procedure of modeling “National Telecom Business Volume” on the following eight variables, GDP, Consumption Levels, Retail Sales of Social Consumer Goods Total Renovation Investment, the Local Telephone Exchange Capacity, Mobile Telephone Exchange Capacity, Mobile Phone End Users, and the Local Telephone End Users. The testing of heteroscedasticity and multicollinearity for model evaluation is included. We also consider AIC and BIC criterion to select independent variables, and conclude the result of the factors which are the optimal regression model for the amount of telecommunications business and the relation between independent variables and dependent variable. Based on the final results, we propose several recommendations about how to improve telecommunication services and promote the economic development.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

This paper performs a thorough statistical examination of the time-series properties of the daily market volatility index (VIX) from the Chicago Board Options Exchange (CBOE). The motivation lies not only on the widespread consensus that the VIX is a barometer of the overall market sentiment as to what concerns investors' risk appetite, but also on the fact that there are many trading strategies that rely on the VIX index for hedging and speculative purposes. Preliminary analysis suggests that the VIX index displays long-range dependence. This is well in line with the strong empirical evidence in the literature supporting long memory in both options-implied and realized variances. We thus resort to both parametric and semiparametric heterogeneous autoregressive (HAR) processes for modeling and forecasting purposes. Our main ndings are as follows. First, we con rm the evidence in the literature that there is a negative relationship between the VIX index and the S&P 500 index return as well as a positive contemporaneous link with the volume of the S&P 500 index. Second, the term spread has a slightly negative long-run impact in the VIX index, when possible multicollinearity and endogeneity are controlled for. Finally, we cannot reject the linearity of the above relationships, neither in sample nor out of sample. As for the latter, we actually show that it is pretty hard to beat the pure HAR process because of the very persistent nature of the VIX index.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)