942 resultados para speaker linking


Relevância:

20.00% 20.00%

Publicador:

Resumo:

Presentation Structure: - THEORY - CASE STUDY 1: Southbank Institute of Technology - CASE STUDY 2: QUT Science and Technology Precinct - MORE IDEAS - ACTIVITY

Relevância:

20.00% 20.00%

Publicador:

Resumo:

In this paper we examine automated Chinese to English link discovery in Wikipedia and the effects of Chinese segmentation and Chinese to English translation on the hyperlink recommendation. Our experimental results show that the implemented link discovery framework can effectively recommend Chinese-to-English cross-lingual links. The techniques described here can assist bi-lingual users where a particular topic is not covered in Chinese, is not equally covered in both languages, or is biased in one language; as well as for language learning.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The UN Decade of Action outlines five pillars of activity within a safe system framework to achieve the goal of slowing and then reversing the global growth in road traffic fatalities, especially in low-income and middle-income countries. The first four pillars - road safety management, safer roads and mobility, safer vehicles, and safer road users – have a strong focus on prevention of road traffic crashes and mitigation of energy exchange when a crash occurs. The fifth pillar – post-crash response – is far more specific, focusing only on crash victims in the event of a safe system failure. The victims appear to be relevant to the first four pillars only insofar as their numbers can be used to evaluate the success of road safety programs and identify the target groups and contributing factors. This paper argues that a better understanding of the lived experience of long term disability from traffic crashes has the potential to provide a feedback loop from the fifth pillar to the first. Research conducted in Thailand with male crash victims with spinal injury demonstrates that patterns of attribution and social and cultural factors have important implications for road safety management and for interventions aimed at influencing behaviour. In addition, the mobility constraints experienced by people with long term disability can point to systemic issues that might otherwise go unnoticed. The UN Decade of Action can benefit from a more thorough exploration of the experiences and circumstances of people with long term disability as the result of a road traffic crash. Rather than being evidence of the failure of the safe system, they can inform the development of more effective road safety management on low-income and middle-income countries.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

We consider how data from scientific research should be used for decision making in health services. Whether a hand hygiene intervention to reduce risk of nosocomial infection should be widely adopted is the case study. Improving hand hygiene has been described as the most important measure to prevent nosocomial infection. 1 Transmission of microorganisms is reduced, and fewer infections arise, which leads to a reduction in mortality2 and cost savings.3 Implementing a hand hygiene program is itself costly, so the extra investment should be tested for cost-effectiveness.4,5 The first part of our commentary is about cost-effectiveness models and how they inform decision making for health services. The second part is about how data on the effectiveness of hand hygiene programs arising from scientific studies are used, and 2 points are made: the threshold for statistical inference of .05 used to judge effectiveness studies is not important for decision making,6,7 and potentially valuable evidence about effectiveness might be excluded by decision makers because it is deemed low quality.8 The ideas put forward will help researchers and health services decision makers to appraise scientific evidence in a more powerful way.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This paper investigates advanced channel compensation techniques for the purpose of improving i-vector speaker verification performance in the presence of high intersession variability using the NIST 2008 and 2010 SRE corpora. The performance of four channel compensation techniques: (a) weighted maximum margin criterion (WMMC), (b) source-normalized WMMC (SN-WMMC), (c) weighted linear discriminant analysis (WLDA), and; (d) source-normalized WLDA (SN-WLDA) have been investigated. We show that, by extracting the discriminatory information between pairs of speakers as well as capturing the source variation information in the development i-vector space, the SN-WLDA based cosine similarity scoring (CSS) i-vector system is shown to provide over 20% improvement in EER for NIST 2008 interview and microphone verification and over 10% improvement in EER for NIST 2008 telephone verification, when compared to SN-LDA based CSS i-vector system. Further, score-level fusion techniques are analyzed to combine the best channel compensation approaches, to provide over 8% improvement in DCF over the best single approach, (SN-WLDA), for NIST 2008 interview/ telephone enrolment-verification condition. Finally, we demonstrate that the improvements found in the context of CSS also generalize to state-of-the-art GPLDA with up to 14% relative improvement in EER for NIST SRE 2010 interview and microphone verification and over 7% relative improvement in EER for NIST SRE 2010 telephone verification.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Speaker diarization is the process of annotating an input audio with information that attributes temporal regions of the audio signal to their respective sources, which may include both speech and non-speech events. For speech regions, the diarization system also specifies the locations of speaker boundaries and assign relative speaker labels to each homogeneous segment of speech. In short, speaker diarization systems effectively answer the question of ‘who spoke when’. There are several important applications for speaker diarization technology, such as facilitating speaker indexing systems to allow users to directly access the relevant segments of interest within a given audio, and assisting with other downstream processes such as summarizing and parsing. When combined with automatic speech recognition (ASR) systems, the metadata extracted from a speaker diarization system can provide complementary information for ASR transcripts including the location of speaker turns and relative speaker segment labels, making the transcripts more readable. Speaker diarization output can also be used to localize the instances of specific speakers to pool data for model adaptation, which in turn boosts transcription accuracies. Speaker diarization therefore plays an important role as a preliminary step in automatic transcription of audio data. The aim of this work is to improve the usefulness and practicality of speaker diarization technology, through the reduction of diarization error rates. In particular, this research is focused on the segmentation and clustering stages within a diarization system. Although particular emphasis is placed on the broadcast news audio domain and systems developed throughout this work are also trained and tested on broadcast news data, the techniques proposed in this dissertation are also applicable to other domains including telephone conversations and meetings audio. Three main research themes were pursued: heuristic rules for speaker segmentation, modelling uncertainty in speaker model estimates, and modelling uncertainty in eigenvoice speaker modelling. The use of heuristic approaches for the speaker segmentation task was first investigated, with emphasis placed on minimizing missed boundary detections. A set of heuristic rules was proposed, to govern the detection and heuristic selection of candidate speaker segment boundaries. A second pass, using the same heuristic algorithm with a smaller window, was also proposed with the aim of improving detection of boundaries around short speaker segments. Compared to single threshold based methods, the proposed heuristic approach was shown to provide improved segmentation performance, leading to a reduction in the overall diarization error rate. Methods to model the uncertainty in speaker model estimates were developed, to address the difficulties associated with making segmentation and clustering decisions with limited data in the speaker segments. The Bayes factor, derived specifically for multivariate Gaussian speaker modelling, was introduced to account for the uncertainty of the speaker model estimates. The use of the Bayes factor also enabled the incorporation of prior information regarding the audio to aid segmentation and clustering decisions. The idea of modelling uncertainty in speaker model estimates was also extended to the eigenvoice speaker modelling framework for the speaker clustering task. Building on the application of Bayesian approaches to the speaker diarization problem, the proposed approach takes into account the uncertainty associated with the explicit estimation of the speaker factors. The proposed decision criteria, based on Bayesian theory, was shown to generally outperform their non- Bayesian counterparts.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Linking Karumba: Creating Sustainable Connections This exhibition showcases the work of 3rd -4th year undergraduate landscape architecture, architecture, Industrial Design, Environmental Engineering, Civil Engineering students in response to issues of sustainability in the Gulf of Carpentaria town of Karumba. It presented the work to the Karumba and Carpentaria Shire community. 16 students and four staff set off on a 2488km journey to undertake the first half of the Carpentaria Project: a fortnight-long strategic planning project entitled Linking Karumba to encourage social, economic, environmental and cultural linkages across the town. Karumba, along with the nearby town of Normanton, is one of Queensland’s most remote settlements. Its economy is based on fishing, tourism, and mining. It has two centres, 2.5km apart by river, or 9km by road. This physical disconnect was identified by Carpentaria Shire Council (CSC) and the Karumba Progress Association (KPA) as a source of socio-cultural disconnection, which formed the basis of our project brief. Student designs were highly responsive to the character of Karumba’s culture and environment, indicating remarkable levels of immersion, and attracting $830 000 in Qld. state government funding for implementation. The Exhibition Four groups of four students produced four strategic planning and design options toward this future: Make the Switch: Alice Anonuevo, Michael Marriott, Carla Priestley & Grant Harvey Realigning the Systems: Claudia Bergs, Rebecca Stephens, Anna Coulson & Lois Kerrigan Diversification of Experience: Rebecca North, Kyle Bush, Debra Sullivan & Jenna Green The River is the Main Street: Ashley Nicholson, Monica Kuiken, Dean Bowen & Bill Schild

Relevância:

20.00% 20.00%

Publicador:

Resumo:

QUT Linking Karumba Project This exhibition showcases the work of 3rd -4th year undergraduate landscape architecture, architecture, Industrial Design, Environmental Engineering, Civil Engineering students in response to issues of sustainability in the Gulf of Carpentaria town of Karumba. It presented the final, polished set of work to the Karumba and Carpentaria Shire community, following revisions in line with feedback from the 2008 exhibition. 16 students and four staff set off on a 2488km journey to undertake the first half of the Carpentaria Project: a fortnight-long strategic planning project entitled Linking Karumba to encourage social, economic, environmental and cultural linkages across the town. Karumba, along with the nearby town of Normanton, is one of Queensland’s most remote settlements. Its economy is based on fishing, tourism, and mining. It has two centres, 2.5km apart by river, or 9km by road. This physical disconnect was identified by Carpentaria Shire Council (CSC) and the Karumba Progress Association (KPA) as a source of socio-cultural disconnection, which formed the basis of our project brief. Student designs were highly responsive to the character of Karumba’s culture and environment, indicating remarkable levels of immersion, and attracting $830 000 in Qld. state government funding for implementation. The Exhibition Four groups of four students produced four strategic planning and design options toward this future: Make the Switch: Alice Anonuevo, Michael Marriott, Carla Priestley & Grant Harvey Realigning the Systems: Claudia Bergs, Rebecca Stephens, Anna Coulson & Lois Kerrigan Diversification of Experience: Rebecca North, Kyle Bush, Debra Sullivan & Jenna Green The River is the Main Street: Ashley Nicholson, Monica Kuiken, Dean Bowen & Bill Schild

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Better management of knowledge assets has the potential to improve business processes and increase productivity. This fact has led to considerable interest in recent years in the knowledge management (KM) phenomenon, and in the main dimensions that can impact on its application in construction. However, a lack of a systematic way of assessing KM initia-tives’ contribution towards achieving organisational business objectives is evident. This paper describes the first stage of a research project intended to develop, and empirically test, a KM input-process-output framework comprising unique and well-defined theoretical constructs representing the KM process and its internal and external determinants in the context of con-struction. The paper presents the underlying principles used in operationally defining each construct through the use of extant KM literature. The KM process itself is explicitly mod-elled via a number of clearly articulated phases that ultimately lead to knowledge utilisation and capitalisation, which in turn adds value or otherwise to meeting defined business objec-tives. The main objective of the model is to reduce the impact of subjectivity in assessing the contribution made by KM practices and initiatives toward achieving performance improvements.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Health outcomes research has developed as a means of evaluating the effectiveness of health care interventions and as an approach to informing resource allocation. The use of a health outcomes approach in health promotion has made increasing demands on evaluation methodologies to demonstrate program effectiveness. However, criticism of the contribution of health promotion to outcomes research has made several assumptions about the use of qualitative methodologies and the content of program objectives largely derived from a biomedical approach. In contrast to the measurement of biomedical interventions in clinical health care, health promotion practice involves social phenomena, wide-reaching cultural, psychological, political and ideological problems and issues. The integration of methodologies of health promotion evaluation will inform further conceptualisation of the health outcomes approach with the differentiation of three types of outcomes: health development outcomes; social health outcomes; and biomedical health outcomes. It is concluded that this differentiation moves away from dualist concepts that advocate the replacement of goals and targets with regional and locally based approaches. Rather, the future direction for health promotion evaluation needs to employ a framework that elaborates multiple methodologies and approaches necessary for establishing what relationships exist between morbidity, mortality, health advancement and equity.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

It is exciting to be living at a time when the big questions in biology can be investigated using modern genetics and computing [1]. Bauzà-Ribot et al.[2] take on one of the fundamental drivers of biodiversity, the effect of continental drift in the formation of the world’s biota 3 and 4, employing next-generation sequencing of whole mitochondrial genomes and modern Bayesian relaxed molecular clock analysis. Bauzà-Ribot et al.[2] conclude that vicariance via plate tectonics best explains the genetic divergence between subterranean metacrangonyctid amphipods currently found on islands separated by the Atlantic Ocean. This finding is a big deal in biogeography, and science generally [3], as many other presumed biotic tectonic divergences have been explained as probably due to more recent transoceanic dispersal events [4]. However, molecular clocks can be problematic 5 and 6 and we have identified three issues with the analyses of Bauzà-Ribot et al.[2] that cast serious doubt on their results and conclusions. When we reanalyzed their mitochondrial data and attempted to account for problems with calibration 5 and 6, modeling rates across branches 5 and 7 and substitution saturation [5], we inferred a much younger date for their key node. This implies either a later trans-Atlantic dispersal of these crustaceans, or more likely a series of later invasions of freshwaters from a common marine ancestor, but either way probably not ancient tectonic plate movements.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

A significant amount of speech is typically required for speaker verification system development and evaluation, especially in the presence of large intersession variability. This paper introduces a source and utterance duration normalized linear discriminant analysis (SUN-LDA) approaches to compensate session variability in short-utterance i-vector speaker verification systems. Two variations of SUN-LDA are proposed where normalization techniques are used to capture source variation from both short and full-length development i-vectors, one based upon pooling (SUN-LDA-pooled) and the other on concatenation (SUN-LDA-concat) across the duration and source-dependent session variation. Both the SUN-LDA-pooled and SUN-LDA-concat techniques are shown to provide improvement over traditional LDA on NIST 08 truncated 10sec-10sec evaluation conditions, with the highest improvement obtained with the SUN-LDA-concat technique achieving a relative improvement of 8% in EER for mis-matched conditions and over 3% for matched conditions over traditional LDA approaches.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

A significant amount of speech data is required to develop a robust speaker verification system, but it is difficult to find enough development speech to match all expected conditions. In this paper we introduce a new approach to Gaussian probabilistic linear discriminant analysis (GPLDA) to estimate reliable model parameters as a linearly weighted model taking more input from the large volume of available telephone data and smaller proportional input from limited microphone data. In comparison to a traditional pooled training approach, where the GPLDA model is trained over both telephone and microphone speech, this linear-weighted GPLDA approach is shown to provide better EER and DCF performance in microphone and mixed conditions in both the NIST 2008 and NIST 2010 evaluation corpora. Based upon these results, we believe that linear-weighted GPLDA will provide a better approach than pooled GPLDA, allowing for the further improvement of GPLDA speaker verification in conditions with limited development data.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Reliability of the performance of biometric identity verification systems remains a significant challenge. Individual biometric samples of the same person (identity class) are not identical at each presentation and performance degradation arises from intra-class variability and inter-class similarity. These limitations lead to false accepts and false rejects that are dependent. It is therefore difficult to reduce the rate of one type of error without increasing the other. The focus of this dissertation is to investigate a method based on classifier fusion techniques to better control the trade-off between the verification errors using text-dependent speaker verification as the test platform. A sequential classifier fusion architecture that integrates multi-instance and multisample fusion schemes is proposed. This fusion method enables a controlled trade-off between false alarms and false rejects. For statistically independent classifier decisions, analytical expressions for each type of verification error are derived using base classifier performances. As this assumption may not be always valid, these expressions are modified to incorporate the correlation between statistically dependent decisions from clients and impostors. The architecture is empirically evaluated by applying the proposed architecture for text dependent speaker verification using the Hidden Markov Model based digit dependent speaker models in each stage with multiple attempts for each digit utterance. The trade-off between the verification errors is controlled using the parameters, number of decision stages (instances) and the number of attempts at each decision stage (samples), fine-tuned on evaluation/tune set. The statistical validation of the derived expressions for error estimates is evaluated on test data. The performance of the sequential method is further demonstrated to depend on the order of the combination of digits (instances) and the nature of repetitive attempts (samples). The false rejection and false acceptance rates for proposed fusion are estimated using the base classifier performances, the variance in correlation between classifier decisions and the sequence of classifiers with favourable dependence selected using the 'Sequential Error Ratio' criteria. The error rates are better estimated by incorporating user-dependent (such as speaker-dependent thresholds and speaker-specific digit combinations) and class-dependent (such as clientimpostor dependent favourable combinations and class-error based threshold estimation) information. The proposed architecture is desirable in most of the speaker verification applications such as remote authentication, telephone and internet shopping applications. The tuning of parameters - the number of instances and samples - serve both the security and user convenience requirements of speaker-specific verification. The architecture investigated here is applicable to verification using other biometric modalities such as handwriting, fingerprints and key strokes.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

By presenting an overview of institutional theory, specifically the concepts of organizational fields, institutional pressures, and legitimacy, in addition to classical rhetoric, we have sought to highlight that there are links within the literature between the concepts of institutional theory and legitimacy, and also legitimacy and classical rhetoric. To date however, the three concepts – institutional pressures, legitimacy, and rhetoric – have not been explicitly linked. Through building on the current literature, and using the notion of legitimacy as the axis to connect institutional pressures with rhetoric, we argue that certain rhetorical devices may in fact be used to build and construct legitimacy in relation to the different institutional pressures an organization may face within a field. We believe that this preliminary framework may be useful to the field of CSR communication, whereby it may assist in constructing legitimate CSR communication in response to the various pressures an organization may face in relation to CSR.