42 resultados para classification aided by clustering

em QUB Research Portal - Research Directory and Institutional Repository for Queen's University Belfast


Relevância:

100.00% 100.00%

Publicador:

Resumo:

In this paper we study the classification of spatiotemporal pattern of one-dimensional cellular automata (CA) whereas the classification comprises CA rules including their initial conditions. We propose an exploratory analysis method based on the normalized compression distance (NCD) of spatiotemporal patterns which is used as dissimilarity measure for a hierarchical clustering. Our approach is different with respect to the following points. First, the classification of spatiotemporal pattern is comparative because the NCD evaluates explicitly the difference of compressibility among two objects, e.g., strings corresponding to spatiotemporal patterns. This is in contrast to all other measures applied so far in a similar context because they are essentially univariate. Second, Kolmogorov complexity, which underlies the NCD, was used in the classification of CA with respect to their spatiotemporal pattern. Third, our method is semiautomatic allowing us to investigate hundreds or thousands of CA rules or initial conditions simultaneously to gain insights into their organizational structure. Our numerical results are not only plausible confirming previous classification attempts but also shed light on the intricate influence of random initial conditions on the classification results.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Clusters of text documents output by clustering algorithms are often hard to interpret. We describe motivating real-world scenarios that necessitate reconfigurability and high interpretability of clusters and outline the problem of generating clusterings with interpretable and reconfigurable cluster models. We develop two clustering algorithms toward the outlined goal of building interpretable and reconfigurable cluster models. They generate clusters with associated rules that are composed of conditions on word occurrences or nonoccurrences. The proposed approaches vary in the complexity of the format of the rules; RGC employs disjunctions and conjunctions in rule generation whereas RGC-D rules are simple disjunctions of conditions signifying presence of various words. In both the cases, each cluster is comprised of precisely the set of documents that satisfy the corresponding rule. Rules of the latter kind are easy to interpret, whereas the former leads to more accurate clustering. We show that our approaches outperform the unsupervised decision tree approach for rule-generating clustering and also an approach we provide for generating interpretable models for general clusterings, both by significant margins. We empirically show that the purity and f-measure losses to achieve interpretability can be as little as 3 and 5%, respectively using the algorithms presented herein.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The identification and classification of network traffic and protocols is a vital step in many quality of service and security systems. Traffic classification strategies must evolve, alongside the protocols utilising the Internet, to overcome the use of ephemeral or masquerading port numbers and transport layer encryption. This research expands the concept of using machine learning on the initial statistics of flow of packets to determine its underlying protocol. Recognising the need for efficient training/retraining of a classifier and the requirement for fast classification, the authors investigate a new application of k-means clustering referred to as 'two-way' classification. The 'two-way' classification uniquely analyses a bidirectional flow as two unidirectional flows and is shown, through experiments on real network traffic, to improve classification accuracy by as much as 18% when measured against similar proposals. It achieves this accuracy while generating fewer clusters, that is, fewer comparisons are needed to classify a flow. A 'two-way' classification offers a new way to improve accuracy and efficiency of machine learning statistical classifiers while still maintaining the fast training times associated with the k-means.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Automatic gender classification has many security and commercial applications. Various modalities have been investigated for gender classification with face-based classification being the most popular. In some real-world scenarios the face may be partially occluded. In these circumstances a classification based on individual parts of the face known as local features must be adopted. We investigate gender classification using lip movements. We show for the first time that important gender specific information can be obtained from the way in which a person moves their lips during speech. Furthermore our study indicates that the lip dynamics during speech provide greater gender discriminative information than simply lip appearance. We also show that the lip dynamics and appearance contain complementary gender information such that a model which captures both traits gives the highest overall classification result. We use Discrete Cosine Transform based features and Gaussian Mixture Modelling to model lip appearance and dynamics and employ the XM2VTS database for our experiments. Our experiments show that a model which captures lip dynamics along with appearance can improve gender classification rates by between 16-21% compared to models of only lip appearance.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Chronic kidney disease (CKD) has become a serious public health problem because of its associated morbidity, premature mortality and attendant healthcare costs. The rising number of persons with CKD is linked with ageing population structure and an increased prevalence of diabetes, hypertension and obesity. There is an inherited risk associated with developing CKD as evidenced by familial clustering and differing prevalence rates across ethnic groups. Earlier studies to determine the inherited risk factors for CKD rarely identified genetic variants that were robustly replicated. However, improvements in genotyping technologies and analytical methods are now helping to identify promising genetic loci aided by international collaboration and multi-consortia efforts. More recently, epigenetic modifications have been proposed to play a role in both the inherited susceptibility to CKD and, importantly, to explain how the environment dynamically interacts with the genome to alter an individual's disease risk. Genome-wide, epigenome-wide and whole transcriptome studies have been performed and optimal approaches for integrative analysis are being developed. This review summarises recent research and the current status of genetic and epigenetic risk factors influencing CKD using population-based information.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Aims: We investigated the physical properties and dynamical evolution of near-Earth asteroid (NEA) (190491) 2000 FJ10 in order to assess the suitability of this accessible NEA as a space mission target. Methods: Photometry and colour determination were carried out with the 1.54 m Kuiper Telescope (Mt Bigelow, USA) and the 10 m Southern African Large Telescope (SALT; Sutherland, South Africa) during the object's recent favourable apparition in 2011-12. During the earlier 2008 apparition, a spectrum of the object in the 6000-9000 Angstrom region was obtained with the 4.2 m William Herschel Telescope (WHT; Canary Islands, Spain). Interpretation of the observational results was aided by numerical simulations of 1000 dynamical clones of 2000 FJ10 up to 106 yr in the past and in the future. Results: The asteroid's spectrum and colours determined by our observations suggest a taxonomic classification within the S-complex although other classifications (V, D, E, M, P) cannot be ruled out. On this evidence, it is unlikely to be a primitive, relatively unaltered remnant from the early history of the solar system and thus a low priority target for robotic sample return. Our photometry placed a lower bound of 2 h to the asteroid's rotation period. Its absolute magnitude was estimated to be 21.54 ± 0.1 which, for a typical S-complex albedo, translates into a diameter of 130 ± 20 m. Our dynamical simulations show that it has likely been an Amor for the past 105 yr. Although currently not Earth-crossing, it will likely become so during the period 50-100 kyr in the future. It may have arrived from the inner or central main belt >1 Myr ago as a former member of a low-inclination S-class asteroid family. Its relatively slow rotation and large size make it a suitable destination for a human mission. We show that ballistic Earth-190491-Earth transfer trajectories with ΔV <2 km s-1 at the asteroid exist between 2052 and 2061. Based on observations made with the Southern African Large Telescope (SALT).

Relevância:

100.00% 100.00%

Publicador:

Resumo:

This paper presents a new review of our knowledge of the ancient forest beetle fauna from Holocene archaeological and palaeoecological sites in Great Britain and Ireland. It examines the colonisation, dispersal and decline of beetle species, highlighting the scale and nature of human activities in the shaping of the landscape of the British Isles. In particular, the paper discusses effects upon the insect fauna, and examines in detail the fossil record from the Humberhead Levels, eastern England. It discusses the local extirpation of up to 40 species in Britain and 15 species in Ireland. An evaluation of the timing of extirpations is made, suggesting that many species in Britain disappear from the fossil record between c. 3000 cal BC and 1000 cal BC (c. 5000-3000 cal BP), although some taxa may well have survived until considerably later. In Ireland, there are two distinct trends, with a group of species which seem to be absent after c. 2000 cal BC (c. 4000 cal BP) and a further group which survives until at least as late as the medieval period. The final clearance of the Irish landscape over the last few hundred years was so dramatic, however, that some species which are not especially unusual in a British context were decimated. Reasons behind the extirpation of taxa are examined in detail, and include a combination of forest clearance and human activities, isolation of populations, lack of temporal continuity of habitats, edaphic and competition factors affecting distribution of host trees (particularly pine), lack of forest fires and a decline in open forest systems. The role of climate change in extirpations is also evaluated. Consideration is given to the significance of these specialised ancient forest inhabitants in Ireland in the absence of an early Holocene land-bridge which suggests that colonisation was aided by other mechanisms, such as human activities and wood-rafting. Finally, the paper discusses the Continental origins of the British and Irish fauna and its hosts and the role played by European glacial refugia.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

We provide an analysis of basic quantum-information processing protocols under the effect of intrinsic nonidealities in cluster states. These nonidealities are based on the introduction of randomness in the entangling steps that create the cluster state and are motivated by the unavoidable imperfections faced in creating entanglement using condensed-matter systems. Aided by the use of an alternative and very efficient method to construct cluster-state configurations, which relies on the concatenation of fundamental cluster structures, we address quantum-state transfer and various fundamental gate simulations through noisy cluster states. We find that a winning strategy to limit the effects of noise is the management of small clusters processed via just a few measurements. Our study also reinforces recent ideas related to the optical implementation of a one-way quantum computer.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Query processing over the Internet involving autonomous data sources is a major task in data integration. It requires the estimated costs of possible queries in order to select the best one that has the minimum cost. In this context, the cost of a query is affected by three factors: network congestion, server contention state, and complexity of the query. In this paper, we study the effects of both the network congestion and server contention state on the cost of a query. We refer to these two factors together as system contention states. We present a new approach to determining the system contention states by clustering the costs of a sample query. For each system contention state, we construct two cost formulas for unary and join queries respectively using the multiple regression process. When a new query is submitted, its system contention state is estimated first using either the time slides method or the statistical method. The cost of the query is then calculated using the corresponding cost formulas. The estimated cost of the query is further adjusted to improve its accuracy. Our experiments show that our methods can produce quite accurate cost estimates of the submitted queries to remote data sources over the Internet.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Mammalian transient receptor potential melastatin (TRPM) non-selective cation channels, the largest TRP subfamily, are widely expressed in excitable and non-excitable cells where they perform diverse functions ranging from detection of cold, taste, osmolarity, redox state and pH to control of Mg(2+) homeostasis and cell proliferation or death. Recently, TRPM gene expression has been identified in vascular smooth muscles with dominance of the TRPM8 channel. There has been in parallel considerable progress in decoding the functional roles of several TRPMs in the vasculature. This research on native cells is aided by the knowledge of the activation mechanisms and pharmacological properties of heterologously expressed TRPM subtypes. This paper summarizes the present state of knowledge of vascular TRPM channels and outlines several anticipated directions of future research in this area.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Previous research shows that approximately half of the coagulase-negative staphylococci (CNS) isolated from patients in the intensive care unit (ICU) at Belfast City Hospital were resistant to methicillin. The presence of this relatively high proportion of methicillin-resistance genetic material gives rise to speculation that these organisms may act as potential reservoirs of methicillinresistance genetic material to methicillin-sensitive Staphylococcus aureus (MSSA). Mechanisms of horizontal gene transfer from PBP2a-positive CNS to MSSA, potentially transforming MSSA to MRSA, aided by electroporation-type activities such as transcutaneous electrical nerve stimulation (TENS), should be considered. Methicillin-resistant CNS (MR-CNS) isolates are collected over a two-month period from a variety of clinical specimen types, particularly wound swabs. The species of all isolates are confirmed, as well as their resistance to oxacillin by standard disc diffusion assays. In addition, MSSA isolates are collected over the same period and confirmed as PBP2a-negative. Electroporation experiments are designed to mimic the time/voltage combinations used commonly in the clinical application of TENS. No transformed MRSA were isolated and all viable S. aureus cells remained susceptible to oxacillin and PBP2a-negative. Experiments using MSSA pre-exposed to sublethal concentrations of oxacillin (0.25 µg/mL) showed no evidence of methicillin gene transfer and the generation of an MRSA. The study showed no evidence of horizontal transfer of methicillin resistance genetic material from MR-CNS to MSSA. These data support the belief that TENS and the associated time/voltage combinations used do not increase conjugational transposons or facilitate horizontal gene transfer from MR-CNS to MSSA.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Understanding and predicting the dynamics of multispecies systems generally require estimates of interaction strength among species. Measuring interaction strength is difficult because of the large number of interactions in any natural system, long-term feedback, multiple pathways of effects between species pairs, and possible nonlinearities in interaction-strength functions. Presently, the few studies that extensively estimate interaction strength suggest that distributions of interaction strength tend to be skewed toward few strong and many weak interactions. Modeling studies indicate that such skewed patterns tend to promote system stability and arise during assembly of persistent communities. Methods for estimating interaction strength efficiently from traits of organisms, such as allometric relationships, show some promise. Methods for estimating community response to environmental perturbations without an estimate of interaction strength may also be of use. Spatial and temporal scale may affect patterns of interaction strength, but these effects require further investigation and new multispecies modeling frameworks. Future progress will be aided by development of long-term multispecies time series of natural communities, by experimental tests of different methods for estimating interaction strength, and by increased understanding of nonlinear functional forms.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The nematodes Trichinella spiralis and Trichinella pseudospiralis are both intracellular parasites of skeletal muscle cells and induce profound alterations in the host cell resulting in a re-alignment of muscle-specific gene expression. While T. spiralis induces the production of a collagen capsule surrounding the host-parasite complex, T. pseudospiralis exists in a non-encapsulated form and is also characterised by suppression of the host inflammatory response in the muscle. These observed differences between the two species are thought to be due to variation in the proteins excreted or secreted (ES proteins) by the muscle larva. In this study, we use a global proteomics approach to compare the ES protein profiles from both species and to identify individual T. pseudospiralis proteins that complement earlier studies with T. spiralis. Following two-dimensional gel electrophoresis, tandem mass spectrometry was used to identify the peptide spots. In many cases identification was aided by the determination of partial peptide sequence from selected mass ions. The T. pseudospiralis spots identified included the major secreted glycoproteins and the secreted 5'-nucleotidase. Furthermore, two major groups of T. spiralis-specific proteins and several T. pseudospiralis-specific proteins were identified. Our results demonstrate the value of proteomics as a tool for the identification of ES proteins that are differentially expressed between Trichinella species and as an aid to identifying key parasite proteins that are involved in the host-parasite interaction. The value of this approach will be further enhanced by data arising out the current T. spiralis genome sequencing project.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Multiple cue probability learning (MCPL) involves learning to predict a criterion based on a set of novel cues when feedback is provided in response to each judgment made. But to what extent does MCPL require controlled attention and explicit hypothesis testing? The results of two experiments show that this depends on cue polarity. Learning about cues that predict positively is aided by automatic cognitive processes, whereas learning about cues that predict negatively is especially demanding on controlled attention and hypothesis testing processes. In the studies reported here, negative, but not positive cue learning related to individual differences in working memory capacity both on measures of overall judgment performance and modelling of the implicit learning process. However, the introduction of a novel method to monitor participants' explicit beliefs about a set of cues on a trial-by-trial basis revealed that participants were engaged in explicit hypothesis testing about positive and negative cues, and explicit beliefs about both types of cues were linked to working memory capacity. Taken together, our results indicate that while people are engaged in explicit hypothesis testing during cue learning, explicit beliefs are applied to judgment only when cues are negative. © 2012 Elsevier Inc.