243 resultados para confidence measures

em Queensland University of Technology - ePrints Archive


Relevância:

100.00% 100.00%

Publicador:

Resumo:

This paper outlines existing matching diagnostics, which may be used for identifying invalid matches and estimating the probability of a correct match. In addition, it proposes a new diagnostic for error prediction which can be used with the rank and census transforms. Both the existing and the new diagnostics have been evaluated and compared for a number of test images. In each case, a confidence estimate was computed for every location of the disparity map, and disparities having a low confidence estimate removed from the disparity map. Collectively, these confidence estimates may be termed a confidence map. Such information would be useful for potential applications of stereo vision such as automation and navigation.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Speaker verification is the process of verifying the identity of a person by analysing their speech. There are several important applications for automatic speaker verification (ASV) technology including suspect identification, tracking terrorists and detecting a person’s presence at a remote location in the surveillance domain, as well as person authentication for phone banking and credit card transactions in the private sector. Telephones and telephony networks provide a natural medium for these applications. The aim of this work is to improve the usefulness of ASV technology for practical applications in the presence of adverse conditions. In a telephony environment, background noise, handset mismatch, channel distortions, room acoustics and restrictions on the available testing and training data are common sources of errors for ASV systems. Two research themes were pursued to overcome these adverse conditions: Modelling mismatch and modelling uncertainty. To directly address the performance degradation incurred through mismatched conditions it was proposed to directly model this mismatch. Feature mapping was evaluated for combating handset mismatch and was extended through the use of a blind clustering algorithm to remove the need for accurate handset labels for the training data. Mismatch modelling was then generalised by explicitly modelling the session conditions as a constrained offset of the speaker model means. This session variability modelling approach enabled the modelling of arbitrary sources of mismatch, including handset type, and halved the error rates in many cases. Methods to model the uncertainty in speaker model estimates and verification scores were developed to address the difficulties of limited training and testing data. The Bayes factor was introduced to account for the uncertainty of the speaker model estimates in testing by applying Bayesian theory to the verification criterion, with improved performance in matched conditions. Modelling the uncertainty in the verification score itself met with significant success. Estimating a confidence interval for the "true" verification score enabled an order of magnitude reduction in the average quantity of speech required to make a confident verification decision based on a threshold. The confidence measures developed in this work may also have significant applications for forensic speaker verification tasks.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

This paper presents an extended study on the implementation of support vector machine(SVM) based speaker verification in systems that employ continuous progressive model adaptation using the weight-based factor analysis model. The weight-based factor analysis model compensates for session variations in unsupervised scenarios by incorporating trial confidence measures in the general statistics used in the inter-session variability modelling process. Employing weight-based factor analysis in Gaussian mixture models (GMM) was recently found to provide significant performance gains to unsupervised classification. Further improvements in performance were found through the integration of SVM-based classification in the system by means of GMM supervectors. This study focuses particularly on the way in which a client is represented in the SVM kernel space using single and multiple target supervectors. Experimental results indicate that training client SVMs using a single target supervector maximises performance while exhibiting a certain robustness to the inclusion of impostor training data in the model. Furthermore, the inclusion of low-scoring target trials in the adaptation process is investigated where they were found to significantly aid performance.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Information and communication technology (ICT) curriculum integration is the apparent goal of an extensive array of educational initiatives in all Australian states and territories. However, ICT curriculum integration is neither value neutral nor universally understood. The literature indicates the complexity of rationales and terminology that underwrite various initiatives; various dimensions and stages of integration; inherent methodological difficulties; obstacles to integration; and significant issues relating to teacher professional development and ICT competencies (Jamieson-Proctor, Watson, & Finger, 2003). This paper investigates the overarching question: Are ICT integration initiatives making a significant impact on teaching and learning in Queensland state schools? It reports the results from a teacher survey that measures the quantity and quality of student use of ICT. Results from 929 teachers across all year levels and from 38 Queensland state schools indicate that female teachers (73% of the full time teachers in Queensland state schools in 2005) are significantly less confident than their male counterparts in using ICT with students for teaching and learning, and there is evidence of significant resistance to using ICT to align curriculum with new times and new technologies. This result supports the hypothesis that current initiatives with ICT are having uneven and less than the desired results system wide. These results require further urgent investigation in order to address the factors that currently constrain the use of ICT for teaching and learning.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The Melbourne Decision Making Questionnaire (Mann, Burnett, Radford, & Ford, 1997) measures selfreported decision-making coping patterns. The questionnaire was administered to samples of University students in the US (N = 475), Australia (N = 262), New Zealand (N = 260), Japan (N = 359), Hong Kong (N = 281), and Taiwan (N = 414). As predicted, students from the three Western, individualistic cultures (US, Australia, and New Zealand) were more con® dent of their decision-making ability than students from the three East Asian, group-oriented cultures (Japan, Hong Kong, Taiwan). No cross-cultural differences were found in scores on decision vigilance (a careful decision-making style). However, compared with Western students, the Asian students tended to score higher on buck-passing and procrastination (avoidant styles of decision making) as well as hypervigilance (a panicky style of decision making). Japanese students scored lowest on decision self-esteem and highest on procrastination and hypervigilance. It was argued that the con¯ ict model and its attendant coping patterns is relevant for describing and comparing decision making in both Western and Asian cultures.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Association rule mining is one technique that is widely used when querying databases, especially those that are transactional, in order to obtain useful associations or correlations among sets of items. Much work has been done focusing on efficiency, effectiveness and redundancy. There has also been a focusing on the quality of rules from single level datasets with many interestingness measures proposed. However, with multi-level datasets now being common there is a lack of interestingness measures developed for multi-level and cross-level rules. Single level measures do not take into account the hierarchy found in a multi-level dataset. This leaves the Support-Confidence approach,which does not consider the hierarchy anyway and has other drawbacks, as one of the few measures available. In this paper we propose two approaches which measure multi-level association rules to help evaluate their interestingness. These measures of diversity and peculiarity can be used to help identify those rules from multi-level datasets that are potentially useful.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Forecasts of volatility and correlation are important inputs into many practical financial problems. Broadly speaking, there are two ways of generating forecasts of these variables. Firstly, time-series models apply a statistical weighting scheme to historical measurements of the variable of interest. The alternative methodology extracts forecasts from the market traded value of option contracts. An efficient options market should be able to produce superior forecasts as it utilises a larger information set of not only historical information but also the market equilibrium expectation of options market participants. While much research has been conducted into the relative merits of these approaches, this thesis extends the literature along several lines through three empirical studies. Firstly, it is demonstrated that there exist statistically significant benefits to taking the volatility risk premium into account for the implied volatility for the purposes of univariate volatility forecasting. Secondly, high-frequency option implied measures are shown to lead to superior forecasts of the intraday stochastic component of intraday volatility and that these then lead on to superior forecasts of intraday total volatility. Finally, the use of realised and option implied measures of equicorrelation are shown to dominate measures based on daily returns.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The authors present a qualitative and quantitative comparison of various similarity measures that form the kernel of common area-based stereo-matching systems. The authors compare classical difference and correlation measures as well as nonparametric measures based on the rank and census transforms for a number of outdoor images. For robotic applications, important considerations include robustness to image defects such as intensity variation and noise, the number of false matches, and computational complexity. In the absence of ground truth data, the authors compare the matching techniques based on the percentage of matches that pass the left-right consistency test. The authors also evaluate the discriminatory power of several match validity measures that are reported in the literature for eliminating false matches and for estimating match confidence. For guidance applications, it is essential to have and estimate of confidence in the three-dimensional points generated by stereo vision. Finally, a new validity measure, the rank constraint, is introduced that is capable of resolving ambiguous matches for rank transform-based matching.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Background Foot ulcers are a leading cause of diabetes-related hospitalisations. Clinical training has been shown to be beneficial in foot ulcer management. Recently, improved self-confidence in podiatrists was reported immediately after foot ulcer simulation training (FUST) pilot programs. This study aimed to investigate the longer-term impacts of the FUST program on podiatrists’ self-confidence over 12 months in a larger sample. Methods Participants were podiatrists attending a two-day FUST course comprising web-based interactive learning, low-fidelity part-tasks and high-fidelity full clinical scenarios. Primary outcome measures included participants’ self-confidence measured pre-, (immediately) post-, 6-month post- and 12-month post-course via a purpose designed 21-item survey using a five-point Likert scale (1=Very limited, 5=Highly confident). Participants’ perceptions of knowledge gained, satisfaction, relevance and fidelity were also investigated. ANOVA and post hoc tests were used to test any differences between groups. Results Thirty-four participants completed FUST. Survey response rates were 100% (pre), 82% (post), 74% (6-month post), and 47% (12-month post). Overall mean scores were 3.13 (pre), 4.49 (post), 4.35 (6-month post) and 4.30 (12-month post) (p < 0.05); post hoc tests indicated no differences between the immediately, 6-month and 12-month post group scores (p > 0.05). Satisfaction, knowledge, relevance and fidelity were all rated highly. Conclusion This study suggests that significant short-term improvements in self-confidence to manage foot ulcers via simulation training are retained over the longer term. It is likely that improved self-confidence leads to improved foot ulcer clinical practice and outcomes; although this requires further research.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Association rule mining is one technique that is widely used when querying databases, especially those that are transactional, in order to obtain useful associations or correlations among sets of items. Much work has been done focusing on efficiency, effectiveness and redundancy. There has also been a focusing on the quality of rules from single level datasets with many interestingness measures proposed. However, with multi-level datasets now being common there is a lack of interestingness measures developed for multi-level and cross-level rules. Single level measures do not take into account the hierarchy found in a multi-level dataset. This leaves the Support-Confidence approach, which does not consider the hierarchy anyway and has other drawbacks, as one of the few measures available. In this chapter we propose two approaches which measure multi-level association rules to help evaluate their interestingness by considering the database’s underlying taxonomy. These measures of diversity and peculiarity can be used to help identify those rules from multi-level datasets that are potentially useful.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Dietitians have reported a lack of confidence in counselling clients with mental health issues. Standardised tools are needed to evaluate programs aiming to improve confidence. The Dietetic Confidence Scale (DCS) was developed to assess dietitians’perception of their capability about working with clients experiencing depression. Exploratory research revealed a 13-item, two-factor model. Dietetic confidence was associated with: 1) Confidence using the Nutrition Care Process; and 2) Confidence in Advocacy for Self-care and Client-care. This study aimed to validate the DCS using this two-factor model.The DCS was administered to 458 dietitians. Confirmatory factor analysis (CFA) assessed the scale’s psychometric validity. Reliability was measured using Cronbach’s alpha (α) co-efficient. CFA results supported the hypothesised two-factor, 13-item model. The Good Fit Index (GFI = 0.95) indicated a strong fit. Item-factor correlations ranged from r = 0.50 to 0.89. The overall scale and subscales showed good reliability (α = 0.93 to 0.76). This is the first study to validate an instrument that measures dietetic confidence about working with clients experiencing depression. The DCS can be used to measure changes in perceived confidence and identify where further training, mentoring or experience is needed. The findings also suggest that initiatives aimed at building dietitians' confidence about working with clients experiencing depression, should focus on improving client-focused nutrition care, foster advocacy, reflective practice, mentoring and encourage professional support networks. Avenues for future research include further validity and reliability testing to expand the generalisability of results; and modifying the scale for other disease or client populations.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Emotion and cognition are known to interact during human decision processes. In this study we focus on a specific kind of cognition, namely metacognition. Our experiment induces a negative emotion, worry, during a perceptual task. In a numerosity task subjects have to make a two alternative forced choice and then reveal their confidence in this decision. We measure metacognition in terms of discrimination and calibration abilities. Our results show that metacognition, but not choice, is affected by the level of worry anticipatedbefore the decision. Under worry individuals tend to have better metacognition in terms of the two measures. Furthermore understanding the formation of confidence is better explained with taking into account the level of worry in the model. This study shows the importance of an emotional component in the formation and the quality of the subjective probabilities.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Objective The aim of this systematic review and meta-analysis was to determine the overall effect of resistance training (RT) on measures of muscular strength in people with Parkinson’s disease (PD). Methods Controlled trials with parallel-group-design were identified from computerized literature searching and citation tracking performed until August 2014. Two reviewers independently screened for eligibility and assessed the quality of the studies using the Cochrane risk-of-bias-tool. For each study, mean differences (MD) or standardized mean differences (SMD) and 95% confidence intervals (CI) were calculated for continuous outcomes based on between-group comparisons using post-intervention data. Subgroup analysis was conducted based on differences in study design. Results Nine studies met the inclusion criteria; all had a moderate to high risk of bias. Pooled data showed that knee extension, knee flexion and leg press strength were significantly greater in PD patients who undertook RT compared to control groups with or without interventions. Subgroups were: RT vs. control-without-intervention, RT vs. control-with-intervention, RT-with-other-form-of-exercise vs. control-without-intervention, RT-with-other-form-of-exercise vs. control-with-intervention. Pooled subgroup analysis showed that RT combined with aerobic/balance/stretching exercise resulted in significantly greater knee extension, knee flexion and leg press strength compared with no-intervention. Compared to treadmill or balance exercise it resulted in greater knee flexion, but not knee extension or leg press strength. RT alone resulted in greater knee extension and flexion strength compared to stretching, but not in greater leg press strength compared to no-intervention. Discussion Overall, the current evidence suggests that exercise interventions that contain RT may be effective in improving muscular strength in people with PD compared with no exercise. However, depending on muscle group and/or training dose, RT may not be superior to other exercise types. Interventions which combine RT with other exercise may be most effective. Findings should be interpreted with caution due to the relatively high risk of bias of most studies.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

In this paper, we use an experimental design to compare the performance of elicitation rules for subjective beliefs. Contrary to previous works in which elicited beliefs are compared to an objective benchmark, we consider a purely subjective belief framework (confidence in one’s own performance in a cognitive task and a perceptual task). The performance of different elicitation rules is assessed according to the accuracy of stated beliefs in predicting success. We measure this accuracy using two main factors: calibration and discrimination. For each of them, we propose two statistical indexes and we compare the rules’ performances for each measurement. The matching probability method provides more accurate beliefs in terms of discrimination, while the quadratic scoring rule reduces overconfidence and the free rule, a simple rule with no incentives, which succeeds in eliciting accurate beliefs. Nevertheless, the matching probability appears to be the best mechanism for eliciting beliefs due to its performances in terms of calibration and discrimination, but also its ability to elicit consistent beliefs across measures and across tasks, as well as its empirical and theoretical properties.