Biblioteca Digital

297 resultados para agglomerative clustering

em Queensland University of Technology - ePrints Archive

Speaker linking using complete-linkage clustering

Relevância:

70.00% 70.00%

Publicador:

Resumo:

Speaker diarization determines instances of the same speaker within a recording. Extending this task to a collection of recordings for linking together segments spoken by a unique speaker requires speaker linking. In this paper we propose a speaker linking system using linkage clustering and state-of-the-art speaker recognition techniques. We evaluate our approach against two baseline linking systems using agglomerative cluster merging (AC) and agglomerative clustering with model retraining (ACR). We demonstrate that our linking method, using complete-linkage clustering, provides a relative improvement of 20% and 29% in attribution error rate (AER), over the AC and ACR systems, respectively.

Speaker attribution of multiple telephone conversations using a complete-linkage clustering approach

Relevância:

70.00% 70.00%

Publicador:

Resumo:

In this paper we propose and evaluate a speaker attribution system using a complete-linkage clustering method. Speaker attribution refers to the annotation of a collection of spoken audio based on speaker identities. This can be achieved using diarization and speaker linking. The main challenge associated with attribution is achieving computational efficiency when dealing with large audio archives. Traditional agglomerative clustering methods with model merging and retraining are not feasible for this purpose. This has motivated the use of linkage clustering methods without retraining. We first propose a diarization system using complete-linkage clustering and show that it outperforms traditional agglomerative and single-linkage clustering based diarization systems with a relative improvement of 40% and 68%, respectively. We then propose a complete-linkage speaker linking system to achieve attribution and demonstrate a 26% relative improvement in attribution error rate (AER) over the single-linkage speaker linking approach.

Robust automatic face clustering in news video

Relevância:

70.00% 70.00%

Publicador:

Resumo:

Clustering identities in a video is a useful task to aid in video search, annotation and retrieval, and cast identification. However, reliably clustering faces across multiple videos is challenging task due to variations in the appearance of the faces, as videos are captured in an uncontrolled environment. A person's appearance may vary due to session variations including: lighting and background changes, occlusions, changes in expression and make up. In this paper we propose the novel Local Total Variability Modelling (Local TVM) approach to cluster faces across a news video corpus; and incorporate this into a novel two stage video clustering system. We first cluster faces within a single video using colour, spatial and temporal cues; after which we use face track modelling and hierarchical agglomerative clustering to cluster faces across the entire corpus. We compare different face recognition approaches within this framework. Experiments on a news video database show that the Local TVM technique is able effectively model the session variation observed in the data, resulting in improved clustering performance, with much greater computational efficiency than other methods.

Robust automatic speaker linking and attribution

Relevância:

60.00% 60.00%

Publicador:

Resumo:

This research makes a major contribution which enables efficient searching and indexing of large archives of spoken audio based on speaker identity. It introduces a novel technique dubbed as “speaker attribution” which is the task of automatically determining ‘who spoke when?’ in recordings and then automatically linking the unique speaker identities within each recording across multiple recordings. The outcome of the research will also have significant impact in improving the performance of automatic speech recognition systems through the extracted speaker identities.

Detecting approximate clones in business process model repositories

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Empirical evidence shows that repositories of business process models used in industrial practice contain significant amounts of duplication. This duplication arises for example when the repository covers multiple variants of the same processes or due to copy-pasting. Previous work has addressed the problem of efficiently retrieving exact clones that can be refactored into shared subprocess models. This article studies the broader problem of approximate clone detection in process models. The article proposes techniques for detecting clusters of approximate clones based on two well-known clustering algorithms: DBSCAN and Hi- erarchical Agglomerative Clustering (HAC). The article also defines a measure of standardizability of an approximate clone cluster, meaning the potential benefit of replacing the approximate clones with a single standardized subprocess. Experiments show that both techniques, in conjunction with the proposed standardizability measure, accurately retrieve clusters of approximate clones that originate from copy-pasting followed by independent modifications to the copied fragments. Additional experiments show that both techniques produce clusters that match those produced by human subjects and that are perceived to be standardizable.

Clustering of Protein Structures Using Hydrophobic Free Energy And Solvent Accessibility of Proteins

Relevância:

20.00% 20.00%

Publicador:

Bearing Parameter Identification of Rotor-Bearing System Using Clustering-Based Evolutionary Algorithm

Relevância:

20.00% 20.00%

Publicador:

Clustering-Based Hybrid Evolutionary Algorithm for Optimization

Relevância:

20.00% 20.00%

Publicador:

XML Document Clustering by Structures

Relevância:

20.00% 20.00%

Publicador:

Multilingual Phone Clustering for Recognition of Spontaneous Indonesian Speech Utilising Pronunciation Modelling Techniques

Relevância:

20.00% 20.00%

Publicador:

Improvement of Web Data Clustering Using Web Page Contents

Relevância:

20.00% 20.00%

Publicador:

Markov Model-Based Clustering for Efficient Patient Care

Relevância:

20.00% 20.00%

Publicador:

Investigating Usage of the Vivisimo Clustering Search Interface

Relevância:

20.00% 20.00%

Publicador:

Bioregion Classification Using Model-Based Clustering: A Case Study in North Eastern Queensland

Relevância:

20.00% 20.00%

Publicador:

Colour Image Segmentation Using Optimal Fuzzy Clustering

Relevância:

20.00% 20.00%

Publicador:

«
1
2
3
4
5
6
7
8
...
19
20
»