Biblioteca Digital

74 resultados para bag-of-features

em QUB Research Portal - Research Directory and Institutional Repository for Queen's University Belfast

Executing bag of distributed tasks on the cloud: Investigating the trade-offs between performance and cost

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Bag of Distributed Tasks (BoDT) can benefit from decentralised execution on the Cloud. However, there is a trade-off between the performance that can be achieved by employing a large number of Cloud VMs for the tasks and the monetary constraints that are often placed by a user. The research reported in this paper is motivated towards investigating this trade-off so that an optimal plan for deploying BoDT applications on the cloud can be generated. A heuristic algorithm, which considers the user's preference of performance and cost is proposed and implemented. The feasibility of the algorithm is demonstrated by generating execution plans for a sample application. The key result is that the algorithm generates optimal execution plans for the application over 91% of the time.

Minimising the Execution of Unknown Bag-of-Task Jobs with Deadlines on the Cloud

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Scheduling jobs with deadlines, each of which defines the latest time that a job must be completed, can be challenging on the cloud due to incurred costs and unpredictable performance. This problem is further complicated when there is not enough information to effectively schedule a job such that its deadline is satisfied, and the cost is minimised. In this paper, we present an approach to schedule jobs, whose performance are unknown before execution, with deadlines on the cloud. By performing a sampling phase to collect the necessary information about those jobs, our approach delivers the scheduling decision within 10% cost and 16% violation rate when compared to the ideal setting, which has complete knowledge about each of the jobs from the beginning. It is noted that our proposed algorithm outperforms existing approaches, which use a fixed amount of resources by reducing the violation cost by at least two times.

Comparison of Image Transform-Based Features for Visual Speech Recognition in Clean and Corrupted Videos

Relevância:

100.00% 100.00%

Publicador:

Resumo:

We present results of a study into the performance of a variety of different image transform-based feature types for speaker-independent visual speech recognition of isolated digits. This includes the first reported use of features extracted using a discrete curvelet transform. The study will show a comparison of some methods for selecting features of each feature type and show the relative benefits of both static and dynamic visual features. The performance of the features will be tested on both clean video data and also video data corrupted in a variety of ways to assess each feature type's robustness to potential real-world conditions. One of the test conditions involves a novel form of video corruption we call jitter which simulates camera and/or head movement during recording.

Comparison of positron-impact vibrational excitation cross sections with the Born-dipole model

Relevância:

90.00% 90.00%

Publicador:

Resumo:

The development of cold trap-based positron beams and new scattering techniques has recently enabled the ?rst measurements of state-resolved positron-impact vibrational excitation cross sections. These measurements revealed a number of features worth further consideration, such as relatively sharp increases near threshold. This paper describes a comparison of the magnitudes and shapes of these cross sections with the predictions of the Born-dipole model. Agreement of the magnitudes of the cross sections varies widely, ranging from reasonable to excellent agreement for CO2 and CF4 to poor agreement for CO and CH4. In contrast, the energy dependence of these cross sections in all these cases is close to that predicted by the Born model.

The nervous system of Procerodes littoralis (Maricola, Tricladida). An ultrastructural and immunoelectron microscopical study

Relevância:

90.00% 90.00%

Publicador:

Resumo:

The ultrastructure of the nervous system of a planarian, Procerodes littoralis, belonging to the taxon Maricola is described for the first time. The study has revealed the presence of two neuronal cell types and a glia-like cell. Immunogold labelling with antibodies to two native flatworm neuropeptides-neuropeptide F and GNFFRFamide-has been localised to one neuronal cell type and associated processes and synapses, thus indicating its peptidergic nature. The ultrastructural features are compared to those of other investigated turbellarian species. The number of features shared by species from the Proseriata, Lecitoepitheliata and Tricladida show that in respect of the nervous system these taxa form a closely related group. (C) 1997 The Royal Swedish Academy of Sciences. Published by Elsevier Science Ltd.

Invariant Information Local Sub-map Filter (IILSF) for Efficient Simultaneous Localisation and Mapping of Large Environments

Relevância:

90.00% 90.00%

Publicador:

Resumo:

This paper presents an Invariant Information Local Sub-map Filter (IILSF) as a technique for consistent Simultaneous Localisation and Mapping (SLAM) in a large environment. It harnesses the benefits of sub-map technique to improve the consistency and efficiency of Extended Kalman Filter (EKF) based SLAM. The IILSF makes use of invariant information obtained from estimated locations of features in independent sub-maps, instead of incorporating every observation directly into the global map. Then the global map is updated at regular intervals. Applying this technique to the EKF based SLAM algorithm: (a) reduces the computational complexity of maintaining the global map estimates and (b) simplifies transformation complexities and data association ambiguities usually experienced in fusing sub-maps together. Simulation results show that the method was able to accurately fuse local map observations to generate an efficient and consistent global map, in addition to significantly reducing computational cost and data association ambiguities.

Generating a Word-Emotion Lexicon from #Emotional Tweets

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Research in emotion analysis of text suggest that emotion lexicon based features are superior to corpus based n-gram features. However the static nature of the general purpose emotion lexicons make them less suited to social media analysis, where the need to adopt to changes in vocabulary usage and context is crucial. In this paper we propose a set of methods to extract a word-emotion lexicon automatically from an emotion labelled corpus of tweets. Our results confirm that the features derived from these lexicons outperform the standard Bag-of-words features when applied to an emotion classification task. Furthermore, a comparative analysis with both manually crafted lexicons and a state-of-the-art lexicon generated using Point-Wise Mutual Information, show that the lexicons generated from the proposed methods lead to significantly better classi- fication performance.

Fast Mining of Interesting Phrases from Subsets of Text Corpora

Relevância:

90.00% 90.00%

Publicador:

Resumo:

We address the problem of mining interesting phrases from subsets of a text corpus where the subset is specified using a set of features such as keywords that form a query. Previous algorithms for the problem have proposed solutions that involve sifting through a phrase dictionary based index or a document-based index where the solution is linear in either the phrase dictionary size or the size of the document subset. We propose the usage of an independence assumption between query keywords given the top correlated phrases, wherein the pre-processing could be reduced to discovering phrases from among the top phrases per each feature in the query. We then outline an indexing mechanism where per-keyword phrase lists are stored either in disk or memory, so that popular aggregation algorithms such as No Random Access and Sort-merge Join may be adapted to do the scoring at real-time to identify the top interesting phrases. Though such an approach is expected to be approximate, we empirically illustrate that very high accuracies (of over 90%) are achieved against the results of exact algorithms. Due to the simplified list-aggregation, we are also able to provide response times that are orders of magnitude better than state-of-the-art algorithms. Interestingly, our disk-based approach outperforms the in-memory baselines by up to hundred times and sometimes more, confirming the superiority of the proposed method.

Face Recognition in the Scrambled Domain via Salience-Aware Ensembles of Many Kernels

Relevância:

90.00% 90.00%

Publicador:

Resumo:

With the rapid development of internet-of-things (IoT), face scrambling has been proposed for privacy protection during IoT-targeted image/video distribution. Consequently in these IoT applications, biometric verification needs to be carried out in the scrambled domain, presenting significant challenges in face recognition. Since face models become chaotic signals after scrambling/encryption, a typical solution is to utilize traditional data-driven face recognition algorithms. While chaotic pattern recognition is still a challenging task, in this paper we propose a new ensemble approach – Many-Kernel Random Discriminant Analysis (MK-RDA) to discover discriminative patterns from chaotic signals. We also incorporate a salience-aware strategy into the proposed ensemble method to handle chaotic facial patterns in the scrambled domain, where random selections of features are made on semantic components via salience modelling. In our experiments, the proposed MK-RDA was tested rigorously on three human face datasets: the ORL face dataset, the PIE face dataset and the PUBFIG wild face dataset. The experimental results successfully demonstrate that the proposed scheme can effectively handle chaotic signals and significantly improve the recognition accuracy, making our method a promising candidate for secure biometric verification in emerging IoT applications.

Multi-scale colorectal tumour segmentation using a novel coarse to fine strategy

Relevância:

90.00% 90.00%

Publicador:

Resumo:

This paper addresses the problem of colorectal tumour segmentation in complex real world imagery. For efficient segmentation, a multi-scale strategy is developed for extracting the potentially cancerous region of interest (ROI) based on colour histograms while searching for the best texture resolution. To achieve better segmentation accuracy, we apply a novel bag-of-visual-words method based on rotation invariant raw statistical features and random projection based l2-norm sparse representation to classify tumour areas in histopathology images. Experimental results on 20 real world digital slides demonstrate that the proposed algorithm results in better recognition accuracy than several state of the art segmentation techniques.

What is Irish Standard English?

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Standard English need not be a matter of prescriptivism or any attempt to ‘create’ a particular standard, but, rather, can be a matter of observation of actual linguistic behaviour. For Hudson (2000), standard English is the kind of English which is written in published work, which is spoken in situations where published writing is most influential – especially in university level education and so in post-university professions – and which is spoken ‘natively’ at home by the ‘professional class’, i.e. people who are most influenced by published writing. In the papers in Bex and Watts (eds, 1999), it is recurrently claimed that, when speaking English, what the ‘social group with highest degree of power, wealth or prestige’ or more neutrally ‘educated people’ or ‘socially admired people’ speak is the variety known as ‘standard English’. However, ‘standard English’ may also mean that shared aspect of English which makes global communication possible. This latter perspective allows for two meanings of ‘standard’: it may refer both to an idealised set of shared features, and also to different sets of national features, reflecting different demographic and political histories and language influences. The methodology adopted in the International Corpus of English (henceforth ICE – cf. Greenbaum, 1996) enables us to observe and investigate each set of features, showing what everybody shares and also what makes each national variety of English different.

An efficient feature selection method for mobile devices with application to activity recognition

Relevância:

80.00% 80.00%

Publicador:

Resumo:

This paper presents a feature selection method for data classification, which combines a model-based variable selection technique and a fast two-stage subset selection algorithm. The relationship between a specified (and complete) set of candidate features and the class label is modelled using a non-linear full regression model which is linear-in-the-parameters. The performance of a sub-model measured by the sum of the squared-errors (SSE) is used to score the informativeness of the subset of features involved in the sub-model. The two-stage subset selection algorithm approaches a solution sub-model with the SSE being locally minimized. The features involved in the solution sub-model are selected as inputs to support vector machines (SVMs) for classification. The memory requirement of this algorithm is independent of the number of training patterns. This property makes this method suitable for applications executed in mobile devices where physical RAM memory is very limited. An application was developed for activity recognition, which implements the proposed feature selection algorithm and an SVM training procedure. Experiments are carried out with the application running on a PDA for human activity recognition using accelerometer data. A comparison with an information gain based feature selection method demonstrates the effectiveness and efficiency of the proposed algorithm.

SVM Training Phase Reduction Using Dataset Feature Filtering for Malware Detection

Relevância:

80.00% 80.00%

Publicador:

Resumo:

N-gram analysis is an approach that investigates the structure of a program using bytes, characters, or text strings. A key issue with N-gram analysis is feature selection amidst the explosion of features that occurs when N is increased. The experiments within this paper represent programs as operational code (opcode) density histograms gained through dynamic analysis. A support vector machine is used to create a reference model, which is used to evaluate two methods of feature reduction, which are 'area of intersect' and 'subspace analysis using eigenvectors.' The findings show that the relationships between features are complex and simple statistics filtering approaches do not provide a viable approach. However, eigenvector subspace analysis produces a suitable filter.

COMPLETELY ITERATIVE, PIPELINED MULTIPLIER ARRAY SUITABLE FOR VLSI.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

A pipelined array multiplier which has been derived by applying 'systolic array' principles at the bit level is described. Attention is focused on a circuit which is used to multiply streams of parallel unsigned data. Then an algorithm is given which demonstrates that, with only a simple modification to the basic cell, the same array can cope with two's complement numbers. The resulting structure has a number of features whch make it attractive to LSI and VLSI. These include regularity and modularity.

Common-sense reasoning for human action recognition

Relevância:

80.00% 80.00%

Publicador:

Resumo:

This paper presents a novel method that leverages reasoning capabilities in a computer vision system dedicated to human action recognition. The proposed methodology is decomposed into two stages. First, a machine learning based algorithm - known as bag of words - gives a first estimate of action classification from video sequences, by performing an image feature analysis. Those results are afterward passed to a common-sense reasoning system, which analyses, selects and corrects the initial estimation yielded by the machine learning algorithm. This second stage resorts to the knowledge implicit in the rationality that motivates human behaviour. Experiments are performed in realistic conditions, where poor recognition rates by the machine learning techniques are significantly improved by the second stage in which common-sense knowledge and reasoning capabilities have been leveraged. This demonstrates the value of integrating common-sense capabilities into a computer vision pipeline. © 2012 Elsevier B.V. All rights reserved.

«
1
2
3
4
5
»