297 resultados para LOGISTIC REGRESSION
Resumo:
The high morbidity and mortality associated with atherosclerotic coronary vascular disease (CVD) and its complications are being lessened by the increased knowledge of risk factors, effective preventative measures and proven therapeutic interventions. However, significant CVD morbidity remains and sudden cardiac death continues to be a presenting feature for some subsequently diagnosed with CVD. Coronary vascular disease is also the leading cause of anaesthesia related complications. Stress electrocardiography/exercise testing is predictive of 10 year risk of CVD events and the cardiovascular variables used to score this test are monitored peri-operatively. Similar physiological time-series datasets are being subjected to data mining methods for the prediction of medical diagnoses and outcomes. This study aims to find predictors of CVD using anaesthesia time-series data and patient risk factor data. Several pre-processing and predictive data mining methods are applied to this data. Physiological time-series data related to anaesthetic procedures are subjected to pre-processing methods for removal of outliers, calculation of moving averages as well as data summarisation and data abstraction methods. Feature selection methods of both wrapper and filter types are applied to derived physiological time-series variable sets alone and to the same variables combined with risk factor variables. The ability of these methods to identify subsets of highly correlated but non-redundant variables is assessed. The major dataset is derived from the entire anaesthesia population and subsets of this population are considered to be at increased anaesthesia risk based on their need for more intensive monitoring (invasive haemodynamic monitoring and additional ECG leads). Because of the unbalanced class distribution in the data, majority class under-sampling and Kappa statistic together with misclassification rate and area under the ROC curve (AUC) are used for evaluation of models generated using different prediction algorithms. The performance based on models derived from feature reduced datasets reveal the filter method, Cfs subset evaluation, to be most consistently effective although Consistency derived subsets tended to slightly increased accuracy but markedly increased complexity. The use of misclassification rate (MR) for model performance evaluation is influenced by class distribution. This could be eliminated by consideration of the AUC or Kappa statistic as well by evaluation of subsets with under-sampled majority class. The noise and outlier removal pre-processing methods produced models with MR ranging from 10.69 to 12.62 with the lowest value being for data from which both outliers and noise were removed (MR 10.69). For the raw time-series dataset, MR is 12.34. Feature selection results in reduction in MR to 9.8 to 10.16 with time segmented summary data (dataset F) MR being 9.8 and raw time-series summary data (dataset A) being 9.92. However, for all time-series only based datasets, the complexity is high. For most pre-processing methods, Cfs could identify a subset of correlated and non-redundant variables from the time-series alone datasets but models derived from these subsets are of one leaf only. MR values are consistent with class distribution in the subset folds evaluated in the n-cross validation method. For models based on Cfs selected time-series derived and risk factor (RF) variables, the MR ranges from 8.83 to 10.36 with dataset RF_A (raw time-series data and RF) being 8.85 and dataset RF_F (time segmented time-series variables and RF) being 9.09. The models based on counts of outliers and counts of data points outside normal range (Dataset RF_E) and derived variables based on time series transformed using Symbolic Aggregate Approximation (SAX) with associated time-series pattern cluster membership (Dataset RF_ G) perform the least well with MR of 10.25 and 10.36 respectively. For coronary vascular disease prediction, nearest neighbour (NNge) and the support vector machine based method, SMO, have the highest MR of 10.1 and 10.28 while logistic regression (LR) and the decision tree (DT) method, J48, have MR of 8.85 and 9.0 respectively. DT rules are most comprehensible and clinically relevant. The predictive accuracy increase achieved by addition of risk factor variables to time-series variable based models is significant. The addition of time-series derived variables to models based on risk factor variables alone is associated with a trend to improved performance. Data mining of feature reduced, anaesthesia time-series variables together with risk factor variables can produce compact and moderately accurate models able to predict coronary vascular disease. Decision tree analysis of time-series data combined with risk factor variables yields rules which are more accurate than models based on time-series data alone. The limited additional value provided by electrocardiographic variables when compared to use of risk factors alone is similar to recent suggestions that exercise electrocardiography (exECG) under standardised conditions has limited additional diagnostic value over risk factor analysis and symptom pattern. The effect of the pre-processing used in this study had limited effect when time-series variables and risk factor variables are used as model input. In the absence of risk factor input, the use of time-series variables after outlier removal and time series variables based on physiological variable values’ being outside the accepted normal range is associated with some improvement in model performance.
Resumo:
This paper presents the results of a structural equation model (SEM) for describing and quantifying the fundamental factors that affect contract disputes between owners and contractors in the construction industry. Through this example, the potential impact of SEM analysis in construction engineering and management research is illustrated. The purpose of the specific model developed in this research is to explain how and why contract related construction problems occur. This study builds upon earlier work, which developed a disputes potential index, and the likelihood of construction disputes was modeled using logistic regression. In this earlier study, questionnaires were completed on 159 construction projects, which measured both qualitative and quantitative aspects of contract disputes, management ability, financial planning, risk allocation, and project scope definition for both owners and contractors. The SEM approach offers several advantages over the previously employed logistic regression methodology. The final set of structural equations provides insight into the interaction of the variables that was not apparent in the original logistic regression modeling methodology.
Resumo:
This dissertation is primarily an applied statistical modelling investigation, motivated by a case study comprising real data and real questions. Theoretical questions on modelling and computation of normalization constants arose from pursuit of these data analytic questions. The essence of the thesis can be described as follows. Consider binary data observed on a two-dimensional lattice. A common problem with such data is the ambiguity of zeroes recorded. These may represent zero response given some threshold (presence) or that the threshold has not been triggered (absence). Suppose that the researcher wishes to estimate the effects of covariates on the binary responses, whilst taking into account underlying spatial variation, which is itself of some interest. This situation arises in many contexts and the dingo, cypress and toad case studies described in the motivation chapter are examples of this. Two main approaches to modelling and inference are investigated in this thesis. The first is frequentist and based on generalized linear models, with spatial variation modelled by using a block structure or by smoothing the residuals spatially. The EM algorithm can be used to obtain point estimates, coupled with bootstrapping or asymptotic MLE estimates for standard errors. The second approach is Bayesian and based on a three- or four-tier hierarchical model, comprising a logistic regression with covariates for the data layer, a binary Markov Random field (MRF) for the underlying spatial process, and suitable priors for parameters in these main models. The three-parameter autologistic model is a particular MRF of interest. Markov chain Monte Carlo (MCMC) methods comprising hybrid Metropolis/Gibbs samplers is suitable for computation in this situation. Model performance can be gauged by MCMC diagnostics. Model choice can be assessed by incorporating another tier in the modelling hierarchy. This requires evaluation of a normalization constant, a notoriously difficult problem. Difficulty with estimating the normalization constant for the MRF can be overcome by using a path integral approach, although this is a highly computationally intensive method. Different methods of estimating ratios of normalization constants (N Cs) are investigated, including importance sampling Monte Carlo (ISMC), dependent Monte Carlo based on MCMC simulations (MCMC), and reverse logistic regression (RLR). I develop an idea present though not fully developed in the literature, and propose the Integrated mean canonical statistic (IMCS) method for estimating log NC ratios for binary MRFs. The IMCS method falls within the framework of the newly identified path sampling methods of Gelman & Meng (1998) and outperforms ISMC, MCMC and RLR. It also does not rely on simplifying assumptions, such as ignoring spatio-temporal dependence in the process. A thorough investigation is made of the application of IMCS to the three-parameter Autologistic model. This work introduces background computations required for the full implementation of the four-tier model in Chapter 7. Two different extensions of the three-tier model to a four-tier version are investigated. The first extension incorporates temporal dependence in the underlying spatio-temporal process. The second extensions allows the successes and failures in the data layer to depend on time. The MCMC computational method is extended to incorporate the extra layer. A major contribution of the thesis is the development of a fully Bayesian approach to inference for these hierarchical models for the first time. Note: The author of this thesis has agreed to make it open access but invites people downloading the thesis to send her an email via the 'Contact Author' function.
Resumo:
Background/Rationale Guided by the need-driven dementia-compromised behavior (NDB) model, this study examined influences of the physical environment on wandering behavior. Methods Using a descriptive, cross-sectional design, 122 wanderers from 28 long-term care (LTC) facilities were videotaped 10 to 12 times; data on wandering, light, sound, temperature and humidity levels, location, ambiance, and crowding were obtained. Associations between environmental variables and wandering were evaluated with chi-square and t tests; the model was evaluated using logistic regression. Results In all, 80% of wandering occurred in the resident’s own room, dayrooms, hallways, or dining rooms. When observed in other residents’ rooms, hallways, shower/baths, or off-unit locations, wanderers were likely (60%-92% of observations) to wander. The data were a good fit to the model overall (LR [logistic regression] χ2 (5) = 50.38, P < .0001) and by wandering type. Conclusions Location, light, sound, proximity of others, and ambiance are associated with wandering and may serve to inform environmental designs and care practices.
Resumo:
Persistent use of safety restraints prevents deaths and reduces the severity and number of injuries resulting from motor vehicle crashes. However, safety-restraint use rates in the United States have been below those of other nations with safety-restraint enforcement laws. With a better understanding of the relationship between safety-restraint law enforcement and safety-restraint use, programs can be implemented to decrease the number of deaths and injuries resulting from motor vehicle crashes. Does safety-restraint use increase as enforcement increases? Do motorists increase their safety-restraint use in response to the general presence of law enforcement or to targeted law enforcement efforts? Does a relationship between enforcement and restraint use exist at the countywide level? A logistic regression model was estimated by using county-level safety-restraint use data and traffic citation statistics collected in 13 counties within the state of Florida in 1997. The model results suggest that safety-restraint use is positively correlated with enforcement intensity, is negatively correlated with safety-restraint enforcement coverage (in lanemiles of enforcement coverage), and is greater in urban than rural areas. The quantification of these relationships may assist Florida and other law enforcement agencies in raising safety-restraint use rates by allocating limited funds more efficiently either by allocating additional time for enforcement activities of the existing force or by increasing enforcement staff. In addition, the research supports a commonsense notion that enforcement activities do result in behavioral response.
Resumo:
The relationship between the quality of parent-child interactions and positive child developmental trajectories is well established (Guralnick, 2006; Shonkoff & Meissels, 2000; Zubrick et al., 2008). However, a range of parental, family, and socio-economic factors can pose risks to parents’ capacity to participate in quality interactions with their children. In particular, families with a child with a disability have been found to have higher levels of parenting stress, and are more likely to experience economic disadvantage, as well as social isolation. The importance of early interventions to promote positive parenting and child development for these families is widely recognised (Shonkoff & Meissels, 2000). However, to date, there is a lack of evidence about the effectiveness of early parenting programs for families who have a young child with a disability. This thesis investigates the impact of a music therapy parenting program, Sing & Grow, on 201 parent-child dyads who attended programs specifically targeted to parents who had a young child with a disability. Sing & Grow is an Australian national early parenting intervention funded by the Australian Government Department of Families, Housing, Community Services and Indigenous Affairs and delivered by Playgroup Queensland. It is designed and delivered by Registered Music Therapists for families with children aged from birth to three years. It aims to improve parenting skills and confidence, improve family functioning (positive parent-child interactions), enhance child development, and provide social networking opportunities to socially isolated families. The intervention targets a range of families in circumstances that have the potential to impact negatively on family functioning. This thesis uses data from the National Evaluation Study of Sing & Grow from programs which were targeted at families who had a young child with a disability. Three studies were conducted to address the objectives of this thesis. Study 1 examines the effects of the Sing & Grow intervention on parent reported pre and post parent mental health, parenting confidence, parenting skills, and child development, and other parent reported outcomes including social support, use of intervention resources, satisfaction with the intervention and perceived benefits of and barriers to participation. Significant improvements from pre to post were found for parent mental health and parent reported child communication and social skills, along with evidence that parents were very satisfied with the program and that it brought social benefits to families. Study 2 explored the pre to post effects of the intervention on children’s developmental skills and parent-child interactions using observational ratings made by clinicians. Significant pre to post improvements were found for parenting sensitivity, parental engagement with child and acceptance of child as well as for child responsiveness to parent, interest, and participation in the intervention, and social skills. Study 3 examined the nature of child and family characteristics that predicted better outcomes for families while taking account of the level of participation in the program. An overall outcome index was calculated and served as the dependent variable in a logistic regression analysis. Families who attended six or more sessions and mothers who had not completed high school were more likely to have higher outcome scores at post intervention than those who attended fewer sessions and those with more educated mothers respectively. The findings of this research indicate that the intervention had a positive impact on participants’ mental health, parenting behaviours and child development and that level of attendance was associated with better outcomes. There was also evidence that the program reached its target of high risk families (i.e., families in which mothers had lower educational levels) and that for these families better outcomes were achieved. There were also indications that the program was accessible and highly regarded by families and that it promoted social connections for participants. A theoretical model of how the intervention is currently working for families is proposed to explain the connections between early parenting, child development and maternal wellbeing. However, more research is required to further elucidate the mechanisms by which the intervention creates change for families. This research presents promising evidence that a short term group music therapy program can elicit important therapeutic benefits for families who have a child with a disability.
Resumo:
Pedal cyclists are over-represented in traffic crash injuries in Australia. This study examined correlates of cycling injuries in a sample of Queensland cyclists. Members of Bicycle Queensland (n=1976) were asked about cycling injuries as part of an online survey. They also reported demographic characteristics, reasons for cycling, years of cycling as an adult, and cycling frequency. Multivariate logistic regression modelling was used to examine the association between these variables and experiencing cycling injuries last year (yes/no). Thirty-one percent of respondents (n=617) reported at least one cycling injury. Respondents had greater likelihood of injury if they cycled more frequently, had cycled <5 years, or cycled for recreation or competition. These findings suggest that injuries are mostly likely to occur among less experienced cyclists, those cycling the most, and those cycling for sport and recreation. Injury prevention interventions should include cycle skills training along with fostering safer cycling environments.
Resumo:
Now in its second edition, this book describes tools that are commonly used in transportation data analysis. The first part of the text provides statistical fundamentals while the second part presents continuous dependent variable models. With a focus on count and discrete dependent variable models, the third part features new chapters on mixed logit models, logistic regression, and ordered probability models. The last section provides additional coverage of Bayesian statistical modeling, including Bayesian inference and Markov chain Monte Carlo methods. Data sets are available online to use with the modeling techniques discussed.
Resumo:
Background: Physical activity (PA) is recommended for managing osteoarthritis (OA). However, few people with OA are physically active. Understanding the factors associated with PA is necessary to increase PA in this population. This cross-sectional study examined factors associated with leisure-time PA, stretching exercises, and strengthening exercises in people with OA. Methods: For a mail survey, 485 individuals, aged 68.0 y (SD=10.6) with hip or knee OA, were asked about factors that may influence PA participation, including use of non-PA OA management strategies and both psychological and physical health-related factors. Associations between factors and each PA outcome were examined in multivariable logistic regression models. Results: Non-PA management strategies were the main factors associated with the outcomes. Information/education courses, heat/cold treatments, and paracetamol were associated with stretching and strengthening exercises (P<0.05). Hydrotherapy and magnet therapy were associated with leisure-time PA; using orthotics and massage therapy, with stretching exercises; and occupational therapy, with strengthening exercises (P<0.05). Few psychological or health15 related factors were associated with the outcomes. Conclusions: Some management strategies may make it easier for people with OA to be physically active, and could be promoted to encourage PA. Providers of strategies are potential avenues for recruiting people with OA into PA programs.
Resumo:
Background: The C allele of a common polymorphism of the serotonin 2A receptor (HTR2A) gene, T102C, results in reduced synthesis of 5-HT2A receptors and has been associated with current smoking status in adults. The -1438A/G polymorphism, located in the regulatory region of this gene, is in linkage disequilibrium with T102C, and the A allele is associated with increased promoter activity and with smoking in adult males. We investigated the contributions of the HTR2A gene, chronic psychological stress, and impulsivity to the prediction of cigarette smoking status and dependence in young adults. Methods: T102C and -1438A/G genotyping was conducted on 132 healthy Caucasian young adults (47 smokers) who completed self-report measures of chronic stress, depressive symptoms, impulsive personality and cigarette use. Results: A logistic regression analysis of current cigarette smoker user status, after adjusting for gender, depressive symptom severity and chronic stress, indicated that the T102C TT genotype relative to the CC genotype (OR = 7.53), and lower punishment sensitivity (OR = 0.91) were each significant predictive risk factors. However, for number of cigarettes smoked, only lower punishment sensitivity was a significant predictor (OR = 0.81). Conclusions: These data indicate the importance of the T102C polymorphism to tobacco use but not number of cigarettes smoked for Caucasian young adults. Future studies should examine whether this is explained by effects of nicotine on the serotonin system. Lower punishment sensitivity increased risk of both smoking and of greater consumption, perhaps via a reduced sensitivity to cigarette health warnings and negative physiological effects.
Resumo:
Recent epidemiologic studies have suggested that ultraviolet radiation (UV) may protect against non-Hodgkin lymphoma (NHL), but few, if any, have assessed multiple indicators of ambient and personal UV exposure. Using the US Radiologic Technologists study, we examined the association between NHL and self-reported time outdoors in summer, as well as average year-round and seasonal ambient exposures based on satellite estimates for different age periods, and sun susceptibility in participants who had responded to two questionnaires (1994–1998, 2003–2005) and who were cancer-free as of the earlier questionnaire. Using unconditional logistic regression, we estimated the odds ratio (OR) and 95% confidence intervals for 64,103 participants with 137 NHL cases. Self-reported time outdoors in summer was unrelated to risk. Lower risk was somewhat related to higher average year-round and winter ambient exposure for the period closest in time, and prior to, diagnosis (ages 20–39). Relative to 1.0 for the lowest quartile of average year-round ambient UV, the estimated OR for successively higher quartiles was 0.68 (0.42–1.10); 0.82 (0.52–1.29); and 0.64 (0.40–1.03), p-trend = 0.06), for this age period. The lower NHL risk associated with higher year-round average and winter ambient UV provides modest additional support for a protective relationship between UV and NHL.
Resumo:
Background and Aim: To investigate participation in a second round of colorectal cancer screening using a fecal occult blood test (FOBT) in an Australian rural community, and to assess the demographic characteristics and individual perspectives associated with repeat screening. ---------- Methods: Potential participants from round 1 (50–74 years of age) were sent an intervention package and asked to return a completed FOBT (n = 3406). Doctors of participants testing positive referred to colonoscopy as appropriate. Following screening, 119 participants completed qualitative telephone interviews. Multivariable logistic regression models evaluated the association between round-2 participation and other variables.---------- Results: Round-2 participation was 34.7%; the strongest predictor was participation in round 1. Repeat participants were more likely to be female; inconsistent screeners were more likely to be younger (aged 50–59 years). The proportion of positive FOBT was 12.7%, that of colonoscopy compliance was 98.6%, and the positive predictive value for cancer or adenoma of advanced pathology was 23.9%. Reasons for participation included testing as a precautionary measure or having family history/friends with colorectal cancer; reasons for non-participation included apathy or doctors’ advice against screening.---------- Conclusion: Participation was relatively low and consistent across rounds. Unless suitable strategies are identified to overcome behavioral trends and/or to screen out ineligible participants, little change in overall participation rates can be expected across rounds.
Resumo:
Since the 1970s the internationalisation process of firms has attracted wide research interest. One of the dominant explanations of firm internationalisation resulting from this research activity is the Uppsala stages model. In this paper, a pre-internationalisation phase is incorporated into the traditional Uppsala model to address the question: What are the antecedents of this model? Four concepts are proposed as the key components that define the experiential learning process underlying a firm’s pre-export phase: export stimuli, attitudinal/psychological commitment, resources and lateral rigidity. Through a survey of 290 Australian exporting and non-exporting small-medium sized firms, data relating to the four pre-internationalisation concepts is collected and an Export Readiness Index (ERI) is constructed through factor analysis. Using logistic regression, the ERI is tested as a tool for analysing export readiness among Australian SMEs.