297 resultados para reverse logistic regression

em Queensland University of Technology - ePrints Archive


Relevância:

100.00% 100.00%

Publicador:

Resumo:

Numerous expert elicitation methods have been suggested for generalised linear models (GLMs). This paper compares three relatively new approaches to eliciting expert knowledge in a form suitable for Bayesian logistic regression. These methods were trialled on two experts in order to model the habitat suitability of the threatened Australian brush-tailed rock-wallaby (Petrogale penicillata). The first elicitation approach is a geographically assisted indirect predictive method with a geographic information system (GIS) interface. The second approach is a predictive indirect method which uses an interactive graphical tool. The third method uses a questionnaire to elicit expert knowledge directly about the impact of a habitat variable on the response. Two variables (slope and aspect) are used to examine prior and posterior distributions of the three methods. The results indicate that there are some similarities and dissimilarities between the expert informed priors of the two experts formulated from the different approaches. The choice of elicitation method depends on the statistical knowledge of the expert, their mapping skills, time constraints, accessibility to experts and funding available. This trial reveals that expert knowledge can be important when modelling rare event data, such as threatened species, because experts can provide additional information that may not be represented in the dataset. However care must be taken with the way in which this information is elicited and formulated.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The benefits of applying tree-based methods to the purpose of modelling financial assets as opposed to linear factor analysis are increasingly being understood by market practitioners. Tree-based models such as CART (classification and regression trees) are particularly well suited to analysing stock market data which is noisy and often contains non-linear relationships and high-order interactions. CART was originally developed in the 1980s by medical researchers disheartened by the stringent assumptions applied by traditional regression analysis (Brieman et al. [1984]). In the intervening years, CART has been successfully applied to many areas of finance such as the classification of financial distress of firms (see Frydman, Altman and Kao [1985]), asset allocation (see Sorensen, Mezrich and Miller [1996]), equity style timing (see Kao and Shumaker [1999]) and stock selection (see Sorensen, Miller and Ooi [2000])...

Relevância:

100.00% 100.00%

Publicador:

Resumo:

This dissertation is primarily an applied statistical modelling investigation, motivated by a case study comprising real data and real questions. Theoretical questions on modelling and computation of normalization constants arose from pursuit of these data analytic questions. The essence of the thesis can be described as follows. Consider binary data observed on a two-dimensional lattice. A common problem with such data is the ambiguity of zeroes recorded. These may represent zero response given some threshold (presence) or that the threshold has not been triggered (absence). Suppose that the researcher wishes to estimate the effects of covariates on the binary responses, whilst taking into account underlying spatial variation, which is itself of some interest. This situation arises in many contexts and the dingo, cypress and toad case studies described in the motivation chapter are examples of this. Two main approaches to modelling and inference are investigated in this thesis. The first is frequentist and based on generalized linear models, with spatial variation modelled by using a block structure or by smoothing the residuals spatially. The EM algorithm can be used to obtain point estimates, coupled with bootstrapping or asymptotic MLE estimates for standard errors. The second approach is Bayesian and based on a three- or four-tier hierarchical model, comprising a logistic regression with covariates for the data layer, a binary Markov Random field (MRF) for the underlying spatial process, and suitable priors for parameters in these main models. The three-parameter autologistic model is a particular MRF of interest. Markov chain Monte Carlo (MCMC) methods comprising hybrid Metropolis/Gibbs samplers is suitable for computation in this situation. Model performance can be gauged by MCMC diagnostics. Model choice can be assessed by incorporating another tier in the modelling hierarchy. This requires evaluation of a normalization constant, a notoriously difficult problem. Difficulty with estimating the normalization constant for the MRF can be overcome by using a path integral approach, although this is a highly computationally intensive method. Different methods of estimating ratios of normalization constants (N Cs) are investigated, including importance sampling Monte Carlo (ISMC), dependent Monte Carlo based on MCMC simulations (MCMC), and reverse logistic regression (RLR). I develop an idea present though not fully developed in the literature, and propose the Integrated mean canonical statistic (IMCS) method for estimating log NC ratios for binary MRFs. The IMCS method falls within the framework of the newly identified path sampling methods of Gelman & Meng (1998) and outperforms ISMC, MCMC and RLR. It also does not rely on simplifying assumptions, such as ignoring spatio-temporal dependence in the process. A thorough investigation is made of the application of IMCS to the three-parameter Autologistic model. This work introduces background computations required for the full implementation of the four-tier model in Chapter 7. Two different extensions of the three-tier model to a four-tier version are investigated. The first extension incorporates temporal dependence in the underlying spatio-temporal process. The second extensions allows the successes and failures in the data layer to depend on time. The MCMC computational method is extended to incorporate the extra layer. A major contribution of the thesis is the development of a fully Bayesian approach to inference for these hierarchical models for the first time. Note: The author of this thesis has agreed to make it open access but invites people downloading the thesis to send her an email via the 'Contact Author' function.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

We investigate the utility to computational Bayesian analyses of a particular family of recursive marginal likelihood estimators characterized by the (equivalent) algorithms known as "biased sampling" or "reverse logistic regression" in the statistics literature and "the density of states" in physics. Through a pair of numerical examples (including mixture modeling of the well-known galaxy dataset) we highlight the remarkable diversity of sampling schemes amenable to such recursive normalization, as well as the notable efficiency of the resulting pseudo-mixture distributions for gauging prior-sensitivity in the Bayesian model selection context. Our key theoretical contributions are to introduce a novel heuristic ("thermodynamic integration via importance sampling") for qualifying the role of the bridging sequence in this procedure, and to reveal various connections between these recursive estimators and the nested sampling technique.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Expert elicitation is the process of retrieving and quantifying expert knowledge in a particular domain. Such information is of particular value when the empirical data is expensive, limited, or unreliable. This paper describes a new software tool, called Elicitator, which assists in quantifying expert knowledge in a form suitable for use as a prior model in Bayesian regression. Potential environmental domains for applying this elicitation tool include habitat modeling, assessing detectability or eradication, ecological condition assessments, risk analysis, and quantifying inputs to complex models of ecological processes. The tool has been developed to be user-friendly, extensible, and facilitate consistent and repeatable elicitation of expert knowledge across these various domains. We demonstrate its application to elicitation for logistic regression in a geographically based ecological context. The underlying statistical methodology is also novel, utilizing an indirect elicitation approach to target expert knowledge on a case-by-case basis. For several elicitation sites (or cases), experts are asked simply to quantify their estimated ecological response (e.g. probability of presence), and its range of plausible values, after inspecting (habitat) covariates via GIS.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Associations between single nucleotide polymorphisms (SNPs) at 5p15 and multiple cancer types have been reported. We have previously shown evidence for a strong association between prostate cancer (PrCa) risk and rs2242652 at 5p15, intronic in the telomerase reverse transcriptase (TERT) gene that encodes TERT. To comprehensively evaluate the association between genetic variation across this region and PrCa, we performed a fine-mapping analysis by genotyping 134 SNPs using a custom Illumina iSelect array or Sequenom MassArray iPlex, followed by imputation of 1094 SNPs in 22 301 PrCa cases and 22 320 controls in The PRACTICAL consortium. Multiple stepwise logistic regression analysis identified four signals in the promoter or intronic regions of TERT that independently associated with PrCa risk. Gene expression analysis of normal prostate tissue showed evidence that SNPs within one of these regions also associated with TERT expression, providing a potential mechanism for predisposition to disease.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Males of lek-breeding species defend clustered territories from which they display to visiting females. However, the mechanisms leading to the adoption of clustered male display sites are often unknown. In this study, we examined the possibility of a resource-based lek in New Zealand’s lesser short-tailed bat (Mystacina tuberculata) (Mammalia: Chiroptera), by assessing the placement of “singing roosts” used by males in relation to communal roosting sites used by females. The “resource-based lek” model posits that males settle near resources required by females to increase female encounter rates. For most bat species, where females are highly mobile and widely dispersed across landscapes while foraging, communal daytime roosts dominated by females may represent such a resource. Through use of video footage, spatial analyses of singing-roost locations, and passive-integrated transponder tags we confirmed that M. tuberculata employs a lek mating system. We found that male singing roosts were significantly clustered in space, were defended by resident individuals, and were visited by females (who did not receive resources from males) for mating purposes. Transponder records also indicated that some singing roosts were shared between multiple males. Spatial logistic regression indicated that singing-roost locations were associated with communal roosting sites. Communal roosts are selected based on criteria independent of the locations of singing roosts, suggesting that males responded to the location of communal roosts and not the reverse. Mystacina tuberculata thus provides evidence of a resource-based lek, and is only the second bat species worldwide confirmed to use a lek-mating system.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Background There is evidence that certain mutations in the double-strand break repair pathway ataxia-telangiectasia mutated gene act in a dominant-negative manner to increase the risk of breast cancer. There are also some reports to suggest that the amino acid substitution variants T2119C Ser707Pro and C3161G Pro1054Arg may be associated with breast cancer risk. We investigate the breast cancer risk associated with these two nonconservative amino acid substitution variants using a large Australian population-based case–control study. Methods The polymorphisms were genotyped in more than 1300 cases and 600 controls using 5' exonuclease assays. Case–control analyses and genotype distributions were compared by logistic regression. Results The 2119C variant was rare, occurring at frequencies of 1.4 and 1.3% in cases and controls, respectively (P = 0.8). There was no difference in genotype distribution between cases and controls (P = 0.8), and the TC genotype was not associated with increased risk of breast cancer (adjusted odds ratio = 1.08, 95% confidence interval = 0.59–1.97, P = 0.8). Similarly, the 3161G variant was no more common in cases than in controls (2.9% versus 2.2%, P = 0.2), there was no difference in genotype distribution between cases and controls (P = 0.1), and the CG genotype was not associated with an increased risk of breast cancer (adjusted odds ratio = 1.30, 95% confidence interval = 0.85–1.98, P = 0.2). This lack of evidence for an association persisted within groups defined by the family history of breast cancer or by age. Conclusion The 2119C and 3161G amino acid substitution variants are not associated with moderate or high risks of breast cancer in Australian women.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

There is increased recognition that determinants of health should be investigated in a life-course perspective. Retirement is a major transition in the life course and offers opportunities for changes in physical activity that may improve health in the aging population. The authors examined the effect of retirement on changes in physical activity in the GLOBE Study, a prospective cohort study known by the Dutch acronym for "Health and Living Conditions of the Population of Eindhoven and surroundings," 1991–2004. They followed respondents (n = 971) by postal questionnaire who were employed and aged 40–65 years in 1991 for 13 years, after which they were still employed (n = 287) or had retired (n = 684). Physical activity included 1) work-related transportation, 2) sports participation, and 3) nonsports leisure-time physical activity. Multinomial logistic regression analyses indicated that retirement was associated with a significantly higher odds for a decline in physical activity from work-related transportation (odds ratio (OR) = 3.03, 95% confidence interval (CI): 1.97, 4.65), adjusted for sex, age, marital status, chronic diseases, and education, compared with remaining employed. Retirement was not associated with an increase in sports participation (OR = 1.12, 95% CI: 0.71, 1.75) or nonsports leisure-time physical activity (OR = 0.80, 95% CI: 0.54, 1.19). In conclusion, retirement introduces a reduction in physical activity from work-related transportation that is not compensated for by an increase in sports participation or an increase in nonsports leisure-time physical activity.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

It is unclear which theoretical dimension of psychological stress affects health status. We hypothesized that both distress and coping mediate the relationship between socio-economic position and tooth loss. Cross-sectional data from 2915 middle-aged adults evaluated retention of < 20 teeth, behaviors, psychological stress, and sociodemographic characteristics. Principal components analysis of the Perceived Stress Scale (PSS) extracted 'distress' (a = 0.85) and 'coping' (a =0.83) factors, consistent with theory. Hierarchical entry of explanatory variables into age- and sex-adjusted logistic regression models estimated odds ratios (OR) and 95% confidence intervals [95% CI] for retention of < 20 teeth. Analysis of the separate contributions of distress and coping revealed a significant main effect of coping (OR = 0.7 [95% CI = 0.7-0.8]), but no effect for distress (OR = 1.0 [95% CI = 0.9-1.1]) or for the interaction of coping and distress. Behavior and psychological stress only modestly attenuated socio-economic inequality in retention of < 20 teeth, providing evidence to support a mediating role of coping.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Background: Although low back pain (LBP) is an important issue for the health profession, few studies have examined LBP among occupational therapy students. Purpose. To investigate the prevalence and distribution of LBP, its adverse sequelae; and to identify potential risk factors.----------- Methods: In 2005, a self-reported questionnaire was administered to occupational therapy students in Northern Queensland.----------- Findings: The 12-month period-prevalence of LBP was 64.6%. Nearly half (46.9%) had experienced pain for over 2 days, 38.8% suffered LBP that affected their daily lives, and 24.5% had sought medical treatment. The prevalence of LBP ranged from 45.5 to 77.1% (p=0.004), while the prevalence of LBP symptoms persisting longer than two days was 34.1 to 62.5% (p=0.020). Logistic regression analysis indicated that year of study and weekly computer usage were statistically-significant LBP risk factors.----------- Implications: The occupational therapy profession will need to further investigate the high prevalence of student LBP identified in this study.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Although upper body musculoskeletal disorders (MSDs) represent an increasingly important issue for university students, few if any studies have targeted the occupational therapy faculty. Given this dearth of information, it was considered necessary to investigate a cross-section of Australian occupational therapy students by means of an established questionnaire survey. Completed replies were obtained from 95.7%, 100% and 97.7% (n = 44, 55 and 48) of students in the first, second and fourth years of a large occupational therapy school in northern Queensland, Australia.---------- The 12-month period prevalence of MSDs was as follows: neck (67.4%), shoulder (46.3%) and upper back (39.5%). Three-quarters of all students (75.5%) reported an MSD occurring in at least one of these body regions. Over half (56.5%) reported an MSD over 2 days' duration in the past year. Almost 40% (39.5%) reported an MSD that had affected their daily life, while one-quarter (25.2%) needed some type of treatment.---------- Logistic regression indicated that students aged over 21 years were almost four times more likely to report shoulder-related MSD (OR 3.7, 95%CI: 1.4-10.2). Year of study in the occupational therapy course was another important MSD correlate, with adjusted odds ratios ranging from 3.3 at the upper back (OR 3.3, 95%CI: 1.2-9.6) to 10.9 at the neck (OR 10.9, 95%CI: 3.2-43.8). Computer usage also incurred a certain degree of risk, with students who spent over 5 hours per week on the computer having an increased risk of MSD at the neck (OR 5.0, 95%CI: 1.3-21.5) and shoulder (OR 4.7, 95%CI: 1.4-18.3).---------- Overall, this study suggests that Australian occupational therapy students have a large burden from MSDs in the upper body region, even more so than other student groups and some working populations. Since the distribution of MSD risk is not uniform among them, interventions to help reduce these conditions need to be carefully targeted. Further longitudinal investigations would also be useful in determining the mechanisms and contributory factors for MSDs among this unique student population.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Aims – To develop local contemporary coefficients for the Trauma Injury Severity Score in New Zealand, TRISS(NZ), and to evaluate their performance at predicting survival against the original TRISS coefficients. Methods – Retrospective cohort study of adults who sustained a serious traumatic injury, and who survived until presentation at Auckland City, Middlemore, Waikato, or North Shore Hospitals between 2002 and 2006. Coefficients were estimated using ordinary and multilevel mixed-effects logistic regression models. Results – 1735 eligible patients were identified, 1672 (96%) injured from a blunt mechanism and 63 (4%) from a penetrating mechanism. For blunt mechanism trauma, 1250 (75%) were male and average age was 38 years (range: 15-94 years). TRISS information was available for 1565 patients of whom 204 (13%) died. Area under the Receiver Operating Characteristic (ROC) curves was 0.901 (95%CI: 0.879-0.923) for the TRISS(NZ) model and 0.890 (95% CI: 0.866-0.913) for TRISS (P<0.001). Insufficient data were available to determine coefficients for penetrating mechanism TRISS(NZ) models. Conclusions – Both TRISS models accurately predicted survival for blunt mechanism trauma. However, TRISS(NZ) coefficients were statistically superior to TRISS coefficients. A strong case exists for replacing TRISS coefficients in the New Zealand benchmarking software with these updated TRISS(NZ) estimates.