Biblioteca Digital

947 resultados para Social Recognition

New Media and Public Communication: Mapping Australian User-Created Content in Online Social Networks

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Presentation describling a project in data intensive research in the humanities. Measuring activity of publically available data in social networks such as Blogosphere, Twitter, Flickr, YouTube

Veja mais

Towards improved speech recognition for resource poor languages

Relevância:

20.00% 20.00%

Publicador:

Resumo:

In recent times, the improved levels of accuracy obtained by Automatic Speech Recognition (ASR) technology has made it viable for use in a number of commercial products. Unfortunately, these types of applications are limited to only a few of the world’s languages, primarily because ASR development is reliant on the availability of large amounts of language specific resources. This motivates the need for techniques which reduce this language-specific, resource dependency. Ideally, these approaches should generalise across languages, thereby providing scope for rapid creation of ASR capabilities for resource poor languages. Cross Lingual ASR emerges as a means for addressing this need. Underpinning this approach is the observation that sound production is largely influenced by the physiological construction of the vocal tract, and accordingly, is human, and not language specific. As a result, a common inventory of sounds exists across languages; a property which is exploitable, as sounds from a resource poor, target language can be recognised using models trained on resource rich, source languages. One of the initial impediments to the commercial uptake of ASR technology was its fragility in more challenging environments, such as conversational telephone speech. Subsequent improvements in these environments has gained consumer confidence. Pragmatically, if cross lingual techniques are to considered a viable alternative when resources are limited, they need to perform under the same types of conditions. Accordingly, this thesis evaluates cross lingual techniques using two speech environments; clean read speech and conversational telephone speech. Languages used in evaluations are German, Mandarin, Japanese and Spanish. Results highlight that previously proposed approaches provide respectable results for simpler environments such as read speech, but degrade significantly when in the more taxing conversational environment. Two separate approaches for addressing this degradation are proposed. The first is based on deriving better target language lexical representation, in terms of the source language model set. The second, and ultimately more successful approach, focuses on improving the classification accuracy of context-dependent (CD) models, by catering for the adverse influence of languages specific phonotactic properties. Whilst the primary research goal in this thesis is directed towards improving cross lingual techniques, the catalyst for investigating its use was based on expressed interest from several organisations for an Indonesian ASR capability. In Indonesia alone, there are over 200 million speakers of some Malay variant, provides further impetus and commercial justification for speech related research on this language. Unfortunately, at the beginning of the candidature, limited research had been conducted on the Indonesian language in the field of speech science, and virtually no resources existed. This thesis details the investigative and development work dedicated towards obtaining an ASR system with a 10000 word recognition vocabulary for the Indonesian language.

Veja mais

Speaker recognition modelling with artificial neural networks

Relevância:

20.00% 20.00%

Publicador:

Veja mais

Planning for social equality in the urban environment : a case study of planning for elderly women in Brisbane

Relevância:

20.00% 20.00%

Publicador:

Veja mais

Design and implementation of an Arabic optical character recognition system

Relevância:

20.00% 20.00%

Publicador:

Veja mais

Speech recognition ...

Relevância:

20.00% 20.00%

Publicador:

Veja mais

Multiple classifier structures for automatic speaker recognition under adverse conditions

Relevância:

20.00% 20.00%

Publicador:

Veja mais

Multi-modal speech processing for automatic speaker recognition

Relevância:

20.00% 20.00%

Publicador:

Veja mais

Robust speech recognition using microphone arrays

Relevância:

20.00% 20.00%

Publicador:

Veja mais

Representation and recognition of 3-D free-form objects incorporating statistical techniques

Relevância:

20.00% 20.00%

Publicador:

Veja mais

Document sorting and logo recognition using image processing techniques

Relevância:

20.00% 20.00%

Publicador:

Veja mais

Automatic speaker recognition under adverse conditions

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Speaker verification is the process of verifying the identity of a person by analysing their speech. There are several important applications for automatic speaker verification (ASV) technology including suspect identification, tracking terrorists and detecting a person’s presence at a remote location in the surveillance domain, as well as person authentication for phone banking and credit card transactions in the private sector. Telephones and telephony networks provide a natural medium for these applications. The aim of this work is to improve the usefulness of ASV technology for practical applications in the presence of adverse conditions. In a telephony environment, background noise, handset mismatch, channel distortions, room acoustics and restrictions on the available testing and training data are common sources of errors for ASV systems. Two research themes were pursued to overcome these adverse conditions: Modelling mismatch and modelling uncertainty. To directly address the performance degradation incurred through mismatched conditions it was proposed to directly model this mismatch. Feature mapping was evaluated for combating handset mismatch and was extended through the use of a blind clustering algorithm to remove the need for accurate handset labels for the training data. Mismatch modelling was then generalised by explicitly modelling the session conditions as a constrained offset of the speaker model means. This session variability modelling approach enabled the modelling of arbitrary sources of mismatch, including handset type, and halved the error rates in many cases. Methods to model the uncertainty in speaker model estimates and verification scores were developed to address the difficulties of limited training and testing data. The Bayes factor was introduced to account for the uncertainty of the speaker model estimates in testing by applying Bayesian theory to the verification criterion, with improved performance in matched conditions. Modelling the uncertainty in the verification score itself met with significant success. Estimating a confidence interval for the "true" verification score enabled an order of magnitude reduction in the average quantity of speech required to make a confident verification decision based on a threshold. The confidence measures developed in this work may also have significant applications for forensic speaker verification tasks.

Veja mais

Implementation and comparison of time-frequency representations for speech recognition

Relevância:

20.00% 20.00%

Publicador:

Veja mais

Morphological scale-space with application to three-dimensional object recognition

Relevância:

20.00% 20.00%

Publicador:

Veja mais

AIDS and pro-social television : industry, policy and Australian television drama

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This thesis examines the intersection of popular cultural representations of HIV and AIDS and the discourses of public health campaigns. Part Two provides a comprehensive record of all HIV related storylines in Australian television drama from the first AIDS episode of The Flying Doctors in 1986 to the ongoing narrative of Pacific Drive, with its core HIV character, in 1996. Textual representations are examined alongside the agency of "cultural technicians" working within the television industry. The framework for this analysis is established in Part One of the thesis, which examines the discursive contexts for speaking about HIV and AIDS established through national health policy and the regulatory and industry framework for broadcasting in Australia. The thesis examines the dominant liberal democratic framework for representation of HIV I AIDS and adopts a Foucauldian understanding of the processes of governmentality to argue that during the period of the 1980s and 1990s a strand of social democratic discourse combined with practices of self management and the management of the Australian population. The actions of committed agents within both domains of popular culture and health education ensured that more challenging expressions of HIV found their way into public culture.

Veja mais

947 resultados para Social Recognition

Filtro por publicador