956 resultados para likelihood to publication
Resumo:
Unstructured text data, such as emails, blogs, contracts, academic publications, organizational documents, transcribed interviews, and even tweets, are important sources of data in Information Systems research. Various forms of qualitative analysis of the content of these data exist and have revealed important insights. Yet, to date, these analyses have been hampered by limitations of human coding of large data sets, and by bias due to human interpretation. In this paper, we compare and combine two quantitative analysis techniques to demonstrate the capabilities of computational analysis for content analysis of unstructured text. Specifically, we seek to demonstrate how two quantitative analytic methods, viz., Latent Semantic Analysis and data mining, can aid researchers in revealing core content topic areas in large (or small) data sets, and in visualizing how these concepts evolve, migrate, converge or diverge over time. We exemplify the complementary application of these techniques through an examination of a 25-year sample of abstracts from selected journals in Information Systems, Management, and Accounting disciplines. Through this work, we explore the capabilities of two computational techniques, and show how these techniques can be used to gather insights from a large corpus of unstructured text.
Resumo:
In this paper, we apply a simulation based approach for estimating transmission rates of nosocomial pathogens. In particular, the objective is to infer the transmission rate between colonised health-care practitioners and uncolonised patients (and vice versa) solely from routinely collected incidence data. The method, using approximate Bayesian computation, is substantially less computer intensive and easier to implement than likelihood-based approaches we refer to here. We find through replacing the likelihood with a comparison of an efficient summary statistic between observed and simulated data that little is lost in the precision of estimated transmission rates. Furthermore, we investigate the impact of incorporating uncertainty in previously fixed parameters on the precision of the estimated transmission rates.
Resumo:
Motor unit number estimation (MUNE) is a method which aims to provide a quantitative indicator of progression of diseases that lead to loss of motor units, such as motor neurone disease. However the development of a reliable, repeatable and fast real-time MUNE method has proved elusive hitherto. Ridall et al. (2007) implement a reversible jump Markov chain Monte Carlo (RJMCMC) algorithm to produce a posterior distribution for the number of motor units using a Bayesian hierarchical model that takes into account biological information about motor unit activation. However we find that the approach can be unreliable for some datasets since it can suffer from poor cross-dimensional mixing. Here we focus on improved inference by marginalising over latent variables to create the likelihood. In particular we explore how this can improve the RJMCMC mixing and investigate alternative approaches that utilise the likelihood (e.g. DIC (Spiegelhalter et al., 2002)). For this model the marginalisation is over latent variables which, for a larger number of motor units, is an intractable summation over all combinations of a set of latent binary variables whose joint sample space increases exponentially with the number of motor units. We provide a tractable and accurate approximation for this quantity and also investigate simulation approaches incorporated into RJMCMC using results of Andrieu and Roberts (2009).
Resumo:
To enhance workplace safety in the construction industry it is important to understand interrelationships among safety risk factors associated with construction accidents. This study incorporates the systems theory into Heinrich’s domino theory to explore the interrelationships of risks and break the chain of accident causation. Through both empirical and statistical analyses of 9,358 accidents which occurred in the U.S. construction industry between 2002 and 2011, the study investigates relationships between accidents and injury elements (e.g., injury type, part of body, injury severity) and the nature of construction injuries by accident type. The study then discusses relationships between accidents and risks, including worker behavior, injury source, and environmental condition, and identifies key risk factors and risk combinations causing accidents. The research outcomes will assist safety managers to prioritize risks according to the likelihood of accident occurrence and injury characteristics, and pay more attention to balancing significant risk relationships to prevent accidents and achieve safer working environments.
Resumo:
This paper proposes the use of Bayesian approaches with the cross likelihood ratio (CLR) as a criterion for speaker clustering within a speaker diarization system, using eigenvoice modeling techniques. The CLR has previously been shown to be an effective decision criterion for speaker clustering using Gaussian mixture models. Recently, eigenvoice modeling has become an increasingly popular technique, due to its ability to adequately represent a speaker based on sparse training data, as well as to provide an improved capture of differences in speaker characteristics. The integration of eigenvoice modeling into the CLR framework to capitalize on the advantage of both techniques has also been shown to be beneficial for the speaker clustering task. Building on that success, this paper proposes the use of Bayesian methods to compute the conditional probabilities in computing the CLR, thus effectively combining the eigenvoice-CLR framework with the advantages of a Bayesian approach to the diarization problem. Results obtained on the 2002 Rich Transcription (RT-02) Evaluation dataset show an improved clustering performance, resulting in a 33.5% relative improvement in the overall Diarization Error Rate (DER) compared to the baseline system.
Resumo:
A quasi-maximum likelihood procedure for estimating the parameters of multi-dimensional diffusions is developed in which the transitional density is a multivariate Gaussian density with first and second moments approximating the true moments of the unknown density. For affine drift and diffusion functions, the moments are exactly those of the true transitional density and for nonlinear drift and diffusion functions the approximation is extremely good and is as effective as alternative methods based on likelihood approximations. The estimation procedure generalises to models with latent factors. A conditioning procedure is developed that allows parameter estimation in the absence of proxies.
Resumo:
Driver sleepiness is a substantial crash risk factor and as such, is a major contributor to crash statistics. A number of individual factors (i.e., psychological factors) have been suggested to influence driving while sleepy. However, few studies have examined the influence of these individual factors for sleepy driving in combination. The current study sought to examine how various demographic factors, attitudes, perceived legitimacy, personality constructs, and risk taking variables were associated with self-reported likelihood of driving sleepy and pulling over and resting when sleepy. The results show that being a younger driver, having positive attitudes towards driving sleepy, and high levels of emotional stability were related to self-reported likelihood of driving sleepy. Whereas, being an older driver and having negative attitudes towards driving sleepy were associated with self-reported likelihood of pulling over and resting when sleepy. Overall, the obtained results suggest that the age and attitudes of the driver have greater influence than personality traits or risk taking factors. Campaigns focused on changing attitudes to reflect the dangerousness of sleepy driving could be important for road safety outcomes.
Resumo:
The method of generalized estimating equations (GEE) is a popular tool for analysing longitudinal (panel) data. Often, the covariates collected are time-dependent in nature, for example, age, relapse status, monthly income. When using GEE to analyse longitudinal data with time-dependent covariates, crucial assumptions about the covariates are necessary for valid inferences to be drawn. When those assumptions do not hold or cannot be verified, Pepe and Anderson (1994, Communications in Statistics, Simulations and Computation 23, 939–951) advocated using an independence working correlation assumption in the GEE model as a robust approach. However, using GEE with the independence correlation assumption may lead to significant efficiency loss (Fitzmaurice, 1995, Biometrics 51, 309–317). In this article, we propose a method that extracts additional information from the estimating equations that are excluded by the independence assumption. The method always includes the estimating equations under the independence assumption and the contribution from the remaining estimating equations is weighted according to the likelihood of each equation being a consistent estimating equation and the information it carries. We apply the method to a longitudinal study of the health of a group of Filipino children.
Resumo:
This thesis documented pathogenic species of nontuberculous mycobacteria in the Brisbane water distribution system. When water and shower aerosol strains were compared with human strains of mycobacteria, the study found that the likelihood of acquiring infection from municipal water was specific for four main species. The method for isolation of mycobacteria from water was refined, followed by sampling from 220 sites across Brisbane. A variety of species (incl 15 pathogens) were identified and genotypically compared to human strains. For M. abscessus and M. lentiflavum, water strains clustered with human strains. Pathogenic strains of M. kansasii were found, though non-pathogenic strains dominated. Waterborne strains of M. fortuitum differed to human strains. Extensive home sampling of 20 patients with NTM disease, supported the theory that the risk of acquiring NTM from water or shower aerosols appears species specific for M. avium, M. kansasii, M. lentiflavum and M. abscessus.
Resumo:
We investigate the utility to computational Bayesian analyses of a particular family of recursive marginal likelihood estimators characterized by the (equivalent) algorithms known as "biased sampling" or "reverse logistic regression" in the statistics literature and "the density of states" in physics. Through a pair of numerical examples (including mixture modeling of the well-known galaxy dataset) we highlight the remarkable diversity of sampling schemes amenable to such recursive normalization, as well as the notable efficiency of the resulting pseudo-mixture distributions for gauging prior-sensitivity in the Bayesian model selection context. Our key theoretical contributions are to introduce a novel heuristic ("thermodynamic integration via importance sampling") for qualifying the role of the bridging sequence in this procedure, and to reveal various connections between these recursive estimators and the nested sampling technique.
Resumo:
The social cost of road injury and fatalities is still unacceptable. The driver is often mainly responsible for road crashes, therefore changing the driver behaviour is one of the most important and most challenging priority in road transport. This paper presents three innovative visions that articulate the potential of using Vehicle to Vehicle (V2V) communication for supporting the exchange of social information amongst drivers. We argue that there could be tremendous benefits in socialising cars to influence human driving behaviours for the better and that this aspect is still relevant in the age of looming autonomous cars. Our visions provide theoretical grounding how V2V infrastructure and emerging human–machine interfaces (HMI) could persuade drivers to: (i) adopt better (e.g. greener) driving practices, (ii) reduce drivers aggressiveness towards pro-social driving behaviours, and (iii) reduce risk-taking behaviour in young, particularly male, adults. The visions present simple but powerful concepts that reveal ‘good’ aspects of the driver behaviour to other drivers and make them contagious. The use of self-efficacy, social norms, gamification theories and social cues could then increase the likelihood of a widespread adoption of such ‘good’ driving behaviours.
Resumo:
Wastewater containing human sewage is often discharged with little or no treatment into the Antarctic marine environment. Faecal sterols (primarily coprostanol) in sediments have been used for assessment of human sewage contamination in this environment, but in situ production and indigenous faunal inputs can confound such determinations. Using gas chromatography with mass spectral detection profiles of both C27 and C29 sterols, potential sources of faecal sterols were examined in nearshore marine sediments, encompassing sites proximal and distal to the wastewater outfall at Davis Station. Faeces from indigenous seals and penguins were also examined. Faeces from several indigenous species contained significant quantities of coprostanol but not 24-ethylcoprostanol, which is present in human faeces. In situ coprostanol and 24-ethylcoprostanol production was identified by co-production of their respective epi isomers at sites remote from the wastewat er source and in high total organic matter sediments. A C 29 sterols-based polyphasic likelihood assessment matrix for human sewage contamination is presented, which distinguishes human from local fauna faecal inputs and in situ production in the Antarctic environment. Sewage contamination was detected up to 1.5 km from Davis Station.
Resumo:
Public acceptance is consistently listed as having an enormous impact on the implementation and success of a congestion charge scheme. This paper investigates public acceptance of such a scheme in Australia. Surveys were conducted in Brisbane and Melbourne, the two fastest growing Australian cities. Using an ordered logit modeling approach, the survey data including stated preferences were analyzed to pinpoint the important factors influencing people’s attitudes to a congestion charge and, in turn, to their transport mode choices. To accommodate the nature of, and to account for the resulting heterogeneity of the panel data, random effects were considered in the models. As expected, this study found that the amount of the congestion charge and the financial benefits of implementing it have a significant influence on respondents’ support for the charge and on the likelihood of their taking a bus to city areas. However, respondents’ current primary transport mode for travelling to the city areas has a more pronounced impact. Meanwhile, respondents’ perceptions of the congestion charge’s role in protecting the environment by reducing vehicle emissions, and of the extent to which the charge would mean that they travelled less frequently to the city for shopping or entertainment, also have a significant impact on their level of support for its implementation. We also found and explained notable differences across two cities. Finally, findings from this study have been fully discussed in relation to the literature.
Resumo:
Seat belts are one of the most effective passive safety features in vehicles and there is a host of research literature attesting to the effectiveness of seat belts in protecting against death and injury. Even when use rates are high the potential gains in trauma reduction from further improvements in wearing rates are substantial. However, those currently most resistant to restraint use have also proven most difficult to target using conventional countermeasures. It is necessary to address the issues of non-wearing in order to achieve further gains in seat belt wearing. This study provide evidence-based recommendations for the way forward to tackle the problems of adult restraint non-use in light passenger vehicles in the short, medium and longer term in Australia. While there are substantial issues to be addressed for these groups, these are outside the scope of this study.
Resumo:
Background Currently, care providers and policy-makers internationally are working to promote normal birth. In Australia, such initiatives are being implemented without any evidence of the prevalence or determinants of normal birth as a multidimensional construct. This study aimed to better understand the determinants of normal birth (defined as without induction of labour, epidural/spinal/general anaesthesia, forceps/vacuum, caesarean birth, or episiotomy) using secondary analyses of data from a population survey of women in Queensland, Australia. Methods Women who birthed in Queensland during a two-week period in 2009 were mailed a survey approximately three months after birth. Women (n=772) provided retrospective data on their pregnancy, labour and birth preferences and experiences, socio-demographic characteristics, and reproductive history. A series of logistic regressions were conducted to determine factors associated with having labour, having a vaginal birth, and having a normal birth. Findings Overall, 81.9% of women had labour, 66.4% had a vaginal birth, and 29.6% had a normal birth. After adjusting for other significant factors, women had significantly higher odds of having labour if they birthed in a public hospital and had a pre-existing preference for a vaginal birth. Of women who had labour, 80.8% had a vaginal birth. Women who had labour had significantly higher odds of having a vaginal birth if they attended antenatal classes, did not have continuous fetal monitoring, felt able to ‘take their time’ in labour, and had a pre-existing preference for a vaginal birth. Of women who had a vaginal birth, 44.7% had a normal birth. Women who had a vaginal birth had significantly higher odds of having a normal birth if they birthed in a public hospital, birthed outside regular business hours, had mobility in labour, did not have continuous fetal monitoring, and were non-supine during birth. Conclusions These findings provide a strong foundation on which to base resources aimed at increasing informed decision-making for maternity care consumers, providers, and policy-makers alike. Research to evaluate the impact of modifying key clinical practices (e.g., supporting women׳s mobility during labour, facilitating non-supine positioning during birth) on the likelihood of a normal birth is an important next step.