75 resultados para indexing


Relevância:

10.00% 10.00%

Publicador:

Resumo:

This paper investigates the effect of topic dependent language models (TDLM) on phonetic spoken term detection (STD) using dynamic match lattice spotting (DMLS). Phonetic STD consists of two steps: indexing and search. The accuracy of indexing audio segments into phone sequences using phone recognition methods directly affects the accuracy of the final STD system. If the topic of a document in known, recognizing the spoken words and indexing them to an intermediate representation is an easier task and consequently, detecting a search word in it will be more accurate and robust. In this paper, we propose the use of TDLMs in the indexing stage to improve the accuracy of STD in situations where the topic of the audio document is known in advance. It is shown that using TDLMs instead of the traditional general language model (GLM) improves STD performance according to figure of merit (FOM) criteria.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

The aim of spoken term detection (STD) is to find all occurrences of a specified query term in a large audio database. This process is usually divided into two steps: indexing and search. In a previous study, it was shown that knowing the topic of an audio document would help to improve the accuracy of indexing step which results in a better performance for STD system. In this paper, we propose the use of topic information not only in the indexing step, but also in the search step. Results of our experiments show that topic information could also be used in search step to improve the STD accuracy.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

As Editor of Economic Analysis and Policy (EAP) I am delighted to announce that EAP is now published by Elsevier. EAP is the journal of the Economic Society of Australia (Queensland branch). As a result of this move, four issues of EAP will be published per year instead of the current three. This will include special issues. EAP will now receive wider coverage in the relevant abstracting and indexing services...

Relevância:

10.00% 10.00%

Publicador:

Resumo:

A key concept in many Information Retrieval (IR) tasks, e.g. document indexing, query language modelling, aspect and diversity retrieval, is the relevance measurement of topics, i.e. to what extent an information object (e.g. a document or a query) is about the topics. This paper investigates the interference of relevance measurement of a topic caused by another topic. For example, consider that two user groups are required to judge whether a topic q is relevant to a document d, and q is presented together with another topic (referred to as a companion topic). If different companion topics are used for different groups, interestingly different relevance probabilities of q given d can be reached. In this paper, we present empirical results showing that the relevance of a topic to a document is greatly affected by the companion topic’s relevance to the same document, and the extent of the impact differs with respect to different companion topics. We further analyse the phenomenon from classical and quantum-like interference perspectives, and connect the phenomenon to nonreality and contextuality in quantum mechanics. We demonstrate that quantum like model fits in the empirical data, could be potentially used for predicting the relevance when interference exists.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

We used our TopSig open-source indexing and retrieval tool to produce runs for the ShARe/CLEF eHealth 2013 track. TopSig was used to produce runs using the query fields and provided discharge summaries, where appropriate. Although the improvement was not great TopSig was able to gain some benefit from utilising the discharge summaries, although the software needed to be modified to support this. This was part of a larger experiment involving determining the applicability and limits to signature-based approaches.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

This study investigates whether academics can capitalize on their external prominence (measured by the number of pages indexed on Google, TED talk invitations or New York Times bestselling book successes) and internal success within academia (measured by publication and citation performance) in the speakers’ market. The results indicate that the larger the number of web pages indexing a particular scholar, the higher the minimum speaking fee. Invitations to speak at a TED event, or making the New York Times Best Seller list is also positively correlated with speaking fees. Scholars with a stronger internal impact or success also achieve higher speaking fees. However, once external impact is controlled, most metrics used to measure internal impact are no longer statistically significant.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

The use of ‘topic’ concepts has shown improved search performance, given a query, by bringing together relevant documents which use different terms to describe a higher level concept. In this paper, we propose a method for discovering and utilizing concepts in indexing and search for a domain specific document collection being utilized in industry. This approach differs from others in that we only collect focused concepts to build the concept space and that instead of turning a user’s query into a concept based query, we experiment with different techniques of combining the original query with a concept query. We apply the proposed approach to a real-world document collection and the results show that in this scenario the use of concept knowledge at index and search can improve the relevancy of results.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Rapid urbanization has brought environmentally, socially, and economically great challenges to cities and societies. To build a sustainable city, these challenges need to be faced efficiently and successfully. This paper focuses on the environmental issues and investigates the ecological approaches for planning sustainable cities through a comprehensive review of the relevant literature. The review focuses on several differing aspects of sustainable city formation. The paper provides insights on the interaction between the natural environment and human activities by identifying environmental effects resulting from this interaction; provides an introduction to the concept of sustainable urban development by underlining the important role of ecological planning in achieving sustainable cities; introduces the notion of urban ecosystems by establishing principles for the management of their sustainability; describes urban ecosystem sustainability assessment by introducing a review of current assessment methods, and; offers an outline of indexing urban environmental sustainability. The paper concludes with a summary of the findings.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Affect is an important feature of multimedia content and conveys valuable information for multimedia indexing and retrieval. Most existing studies for affective content analysis are limited to low-level features or mid-level representations, and are generally criticized for their incapacity to address the gap between low-level features and high-level human affective perception. The facial expressions of subjects in images carry important semantic information that can substantially influence human affective perception, but have been seldom investigated for affective classification of facial images towards practical applications. This paper presents an automatic image emotion detector (IED) for affective classification of practical (or non-laboratory) data using facial expressions, where a lot of “real-world” challenges are present, including pose, illumination, and size variations etc. The proposed method is novel, with its framework designed specifically to overcome these challenges using multi-view versions of face and fiducial point detectors, and a combination of point-based texture and geometry. Performance comparisons of several key parameters of relevant algorithms are conducted to explore the optimum parameters for high accuracy and fast computation speed. A comprehensive set of experiments with existing and new datasets, shows that the method is effective despite pose variations, fast, and appropriate for large-scale data, and as accurate as the method with state-of-the-art performance on laboratory-based data. The proposed method was also applied to affective classification of images from the British Broadcast Corporation (BBC) in a task typical for a practical application providing some valuable insights.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Spoken term detection (STD) is the task of looking up a spoken term in a large volume of speech segments. In order to provide fast search, speech segments are first indexed into an intermediate representation using speech recognition engines which provide multiple hypotheses for each speech segment. Approximate matching techniques are usually applied at the search stage to compensate the poor performance of automatic speech recognition engines during indexing. Recently, using visual information in addition to audio information has been shown to improve phone recognition performance, particularly in noisy environments. In this paper, we will make use of visual information in the form of lip movements of the speaker in indexing stage and will investigate its effect on STD performance. Particularly, we will investigate if gains in phone recognition accuracy will carry through the approximate matching stage to provide similar gains in the final audio-visual STD system over a traditional audio only approach. We will also investigate the effect of using visual information on STD performance in different noise environments.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Bioacoustic monitoring has become a significant research topic for species diversity conservation. Due to the development of sensing techniques, acoustic sensors are widely deployed in the field to record animal sounds over a large spatial and temporal scale. With large volumes of collected audio data, it is essential to develop semi-automatic or automatic techniques to analyse the data. This can help ecologists make decisions on how to protect and promote the species diversity. This paper presents generic features to characterize a range of bird species for vocalisation retrieval. In the implementation, audio recordings are first converted to spectrograms using short-time Fourier transform, then a ridge detection method is applied to the spectrogram for detecting points of interest. Based on the detected points, a new region representation are explored for describing various bird vocalisations and a local descriptor including temporal entropy, frequency bin entropy and histogram of counts of four ridge directions is calculated for each sub-region. To speed up the retrieval process, indexing is carried out and the retrieved results are ranked according to similarity scores. The experiment results show that our proposed feature set can achieve 0.71 in term of retrieval success rate which outperforms spectral ridge features alone (0.55) and Mel frequency cepstral coefficients (0.36).

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Some statistical procedures already available in literature are employed in developing the water quality index, WQI. The nature of complexity and interdependency that occur in physical and chemical processes of water could be easier explained if statistical approaches were applied to water quality indexing. The most popular statistical method used in developing WQI is the principal component analysis (PCA). In literature, the WQI development based on the classical PCA mostly used water quality data that have been transformed and normalized. Outliers may be considered in or eliminated from the analysis. However, the classical mean and sample covariance matrix used in classical PCA methodology is not reliable if the outliers exist in the data. Since the presence of outliers may affect the computation of the principal component, robust principal component analysis, RPCA should be used. Focusing in Langat River, the RPCA-WQI was introduced for the first time in this study to re-calculate the DOE-WQI. Results show that the RPCA-WQI is capable to capture similar distribution in the existing DOE-WQI.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

This research has made contributions to the area of spoken term detection (STD), defined as the process of finding all occurrences of a specified search term in a large collection of speech segments. The use of visual information in the form of lip movements of the speaker in addition to audio and the use of topic of the speech segments, and the expected frequency of words in the target speech domain, are proposed. By using these complementary information, improvement in the performance of STD has been achieved which enables efficient search of key words in large collection of multimedia documents.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Background Australian policy mandates consumer and carer participation in mental health services at all levels including research. Inspired by a UK model - Service Users Group Advising on Research [SUGAR] - we conducted a scoping project in 2013 with a view to create a consumer and carer led research process that moves beyond stigma and tokenism, that values the unique knowledge of lived experience and leads to people being treated better when accessing services. This poster presents the initial findings. Aims The project’s purpose was to explore with consumers, consumer companions and carers at the Metro North Mental Health-RBWH their interest in and views about research partnerships with academic and clinical colleagues. Methods This poster overviews the initial findings from three audio-recorded focus groups conducted with a total of 14 consumers, carers and consumer companions at the Brisbane site. Analysis Our work was guided by framework analysis (Gale et al. 2013). It defines 5 steps for analysing narrative data: familiarising; development of categories; indexing; charting and interpretation. Eight main ideas were initially developed and were divided between the authors to further index. This process identified 37 related analytic ideas. The authors integrated these by combining, removing and redefining them by consensus though a mapping process. The final step is the return of the analysis to the participants for feedback and input into the interpretation of the focus group discussions. Results 1. Value & Respect: Feeling Valued & Respected, Tokenism, Stigma, Governance, Valuing prior knowledge / background 2. Pathways to Knowledge and Involvement in Research: ‘Where to begin’, Support, Unity & partnership, Communication, Co-ordination, Flexibility due to fluctuating capacity 3. Personal Context: Barriers regarding Commitments & the nature of mental illness, Wellbeing needs, Prior experience of research, Motivators, Attributes 4. What is research? Developing Knowledge, What to do research on, how and why? Conclusion and Discussion Initial analysis suggests that participants saw potential for ‘amazing things’ in mental health research such as reflecting their priorities and moving beyond stigma and tokenism. The main needs identified were education, mentoring, funding support and research processes that fitted consumers’ and carers’limitations and fluctuating capacities. They identified maintaining motivation and interest as an issue since research processes are often extended by ethics and funding applications. Participants felt that consumer and carer led research would value the unique knowledge that the lived experience of consumers and carers brings and lead to people being treated better when accessing services.