942 resultados para Significant matched pattern


Relevância:

20.00% 20.00%

Publicador:

Resumo:

Data clustering is a common technique for statistical data analysis, which is used in many fields, including machine learning and data mining. Clustering is grouping of a data set or more precisely, the partitioning of a data set into subsets (clusters), so that the data in each subset (ideally) share some common trait according to some defined distance measure. In this paper we present the genetically improved version of particle swarm optimization algorithm which is a population based heuristic search technique derived from the analysis of the particle swarm intelligence and the concepts of genetic algorithms (GA). The algorithm combines the concepts of PSO such as velocity and position update rules together with the concepts of GA such as selection, crossover and mutation. The performance of the above proposed algorithm is evaluated using some benchmark datasets from Machine Learning Repository. The performance of our method is better than k-means and PSO algorithm.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Frequent episode discovery is a popular framework for pattern discovery from sequential data. It has found many applications in domains like alarm management in telecommunication networks, fault analysis in the manufacturing plants, predicting user behavior in web click streams and so on. In this paper, we address the discovery of serial episodes. In the episodes context, there have been multiple ways to quantify the frequency of an episode. Most of the current algorithms for episode discovery under various frequencies are apriori-based level-wise methods. These methods essentially perform a breadth-first search of the pattern space. However currently there are no depth-first based methods of pattern discovery in the frequent episode framework under many of the frequency definitions. In this paper, we try to bridge this gap. We provide new depth-first based algorithms for serial episode discovery under non-overlapped and total frequencies. Under non-overlapped frequency, we present algorithms that can take care of span constraint and gap constraint on episode occurrences. Under total frequency we present an algorithm that can handle span constraint. We provide proofs of correctness for the proposed algorithms. We demonstrate the effectiveness of the proposed algorithms by extensive simulations. We also give detailed run-time comparisons with the existing apriori-based methods and illustrate scenarios under which the proposed pattern-growth algorithms perform better than their apriori counterparts. (C) 2013 Elsevier B.V. All rights reserved.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

For compressive sensing, we endeavor to improve the atom selection strategy of the existing orthogonal matching pursuit (OMP) algorithm. To achieve a better estimate of the underlying support set progressively through iterations, we use a least squares solution based atom selection method. From a set of promising atoms, the choice of an atom is performed through a new method that uses orthogonal projection along-with a standard matched filter. Through experimental evaluations, the effect of projection based atom selection strategy is shown to provide a significant improvement for the support set recovery performance, in turn, the compressive sensing recovery.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

In this paper, we consider the setting of the pattern maximum likelihood (PML) problem studied by Orlitsky et al. We present a well-motivated heuristic algorithm for deciding the question of when the PML distribution of a given pattern is uniform. The algorithm is based on the concept of a ``uniform threshold''. This is a threshold at which the uniform distribution exhibits an interesting phase transition in the PML problem, going from being a local maximum to being a local minimum.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

In addition to the biologically active monomer of the protein insulin circulating in human blood, the molecule also exists in dimeric and hexameric forms that are used as storage. The insulin monomer contains two distinct surfaces, namely, the dimer forming surface (DFS) and the hexamer forming surface (HFS), that are specifically designed to facilitate the formation of the dimer and the hexamer, respectively. In order to characterize the structural and dynamical behavior of interfacial water molecules near these two surfaces (DFS and HFS), we performed atomistic molecular dynamics simulations of insulin with explicit water. Dynamical characterization reveals that the structural relaxation of the hydrogen bonds formed between the residues of DFS and the interfacial water molecules is faster than those formed between water and that of the HFS. Furthermore, the residence times of water molecules in the protein hydration layer for both the DFS and HFS are found to be significantly higher than those for some of the other proteins studied so far, such as HP-36 and lysozyme. In particular, we find that more structured water molecules, with higher residence times (similar to 300-500 ps), are present near HFS than those near DFS. A significant slowing down is observed in the decay of associated rotational auto time correlation functions of O-H bond vector of water in the vicinity of HFS. The surface topography and the arrangement of amino acid residues work together to organize the water molecules in the hydration layer in order to provide them with a preferred orientation. HFS having a large polar solvent accessible surface area and a convex extensive nonpolar region, drives the surrounding water molecules to acquire predominantly an outward H-atoms directed, clathrate-like structure. In contrast, near the DFS, the surrounding water molecules acquire an inward H-atoms directed orientation owing to the flat curvature of hydrophobic surface and the interrupted hydrophilic residual alignment. We have followed escape trajectory of several such quasi-bound water molecules from both the surfaces that reveal the significant differences between the two hydration layers.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Group VB and VIB M-Si systems are considered to show an interesting pattern in the diffusion of components with the change in atomic number in a particular group (M = V, Nb, Ta or M = Mo, W, respectively). Mainly two phases, MSi2 and M5Si3 are considered for this discussion. Except for Ta-silicides, the activation energy for the integrated diffusion of MSi2 is always lower than M5Si3. In both phases, the relative mobilities measured by the ratio of the tracer diffusion coefficients, , decrease with an increasing atomic number in the given group. If determined at the same homologous temperature, the interdiffusion coefficients increase with the atomic number of the refractory metal in the MSi2 phases and decrease in the M5Si3 ones. This behaviour features the basic changes in the defect concentrations on different sublattices with a change in the atomic number of the refractory components.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This paper presents the shaking table studies to investigate the factors that influence the liquefaction resistance of sand. A uniaxial shaking table with a perspex model container was used for the model tests, and saturated sand beds were prepared using wet pluviation method. The models were subjected to horizontal base shaking, and the variation of pore water pressure was measured. Three series of tests varying the acceleration and frequency of base shaking and density of the soil were carried out on sand beds simulating free field condition. Liquefaction was visualized in some model tests, which was also established through pore water pressure ratios. Effective stress was calculated at the point of pore water pressure measurement, and the number of cycles required to liquefy the sand bed were estimated and matched with visual observations. It was observed that there was a gradual variation in pore water pressure with change in base acceleration at a given frequency of shaking. The variation in pore water pressure is not significant for the range of frequency used in the tests. The frequency of base shaking at which the sand starts to liquefy when the sand bed is subjected to any specific base acceleration depends on the density of sand, and it was observed that the sand does not liquefy at any other frequency less than this. A substantial improvement in liquefaction resistance of the sand was observed with the increase in soil density, inferring that soil densification is a simple technique that can be applied to increase the liquefaction resistance.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

We demonstrate here that supramolecular interactions enhance the sensitivity towards detection of electron-deficient nitro-aromatic compounds (NACs) over discrete analogues. NACs are the most commonly used explosive ingredients and are common constituents of many unexploded landmines used during World WarII. In this study, we have synthesised a series of pyrene-based polycarboxylic acids along with their corresponding discrete esters. Due to the electron richness and the fluorescent behaviour of the pyrene moiety, all the compounds act as sensors for electron-deficient NACs through a fluorescence quenching mechanism. A Stern-Volmer quenching constant determination revealed that the carboxylic acids are more sensitive than the corresponding esters towards NACs in solution. The high sensitivity of the acids was attributed to supramolecular polymer formation through hydrogen bonding in the case of the acids, and the enhancement mechanism is based on an exciton energy migration upon excitation along the hydrogen-bond backbone. The presence of intermolecular hydrogen bonding in the acids in solution was established by solvent-dependent fluorescence studies and dynamic light scattering (DLS) experiments. In addition, the importance of intermolecular hydrogen bonds in solid-state sensing was further explored by scanning tunnelling microscopy (STM) experiments at the liquid-solid interface, in which structures of self-assembled monolayer of the acids and the corresponding esters were compared. The sensitivity tests revealed that these supramolecular sensors can even detect picric acid and trinitrotoluene in solution at levels as low as parts per trillion (ppt), which is much below the recommended permissible level of these constituents in drinking water.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Background: The function of a protein can be deciphered with higher accuracy from its structure than from its amino acid sequence. Due to the huge gap in the available protein sequence and structural space, tools that can generate functionally homogeneous clusters using only the sequence information, hold great importance. For this, traditional alignment-based tools work well in most cases and clustering is performed on the basis of sequence similarity. But, in the case of multi-domain proteins, the alignment quality might be poor due to varied lengths of the proteins, domain shuffling or circular permutations. Multi-domain proteins are ubiquitous in nature, hence alignment-free tools, which overcome the shortcomings of alignment-based protein comparison methods, are required. Further, existing tools classify proteins using only domain-level information and hence miss out on the information encoded in the tethered regions or accessory domains. Our method, on the other hand, takes into account the full-length sequence of a protein, consolidating the complete sequence information to understand a given protein better. Results: Our web-server, CLAP (Classification of Proteins), is one such alignment-free software for automatic classification of protein sequences. It utilizes a pattern-matching algorithm that assigns local matching scores (LMS) to residues that are a part of the matched patterns between two sequences being compared. CLAP works on full-length sequences and does not require prior domain definitions. Pilot studies undertaken previously on protein kinases and immunoglobulins have shown that CLAP yields clusters, which have high functional and domain architectural similarity. Moreover, parsing at a statistically determined cut-off resulted in clusters that corroborated with the sub-family level classification of that particular domain family. Conclusions: CLAP is a useful protein-clustering tool, independent of domain assignment, domain order, sequence length and domain diversity. Our method can be used for any set of protein sequences, yielding functionally relevant clusters with high domain architectural homogeneity. The CLAP web server is freely available for academic use at http://nslab.mbu.iisc.ernet.in/clap/.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Terrestrial water storage (TWS) plays a key role in the global water cycle and is highly influenced by climate variability and human activities. In this study, monthly TWS, rainfall and Ganga-Brahmaputra river discharge (GBRD) are analysed over India for the period of 2003-12 using remote sensing satellite data. The spatial pattern of mean TWS shows a decrease over a large and populous region of Northern India comprising the foothills of the Himalayas, the Indo-Gangetic Plains and North East India. Over this region, the mean monthly TWS exhibits a pronounced seasonal cycle and a large interannual variability, highly correlated with rainfall and GBRD variations (r > 0.8) with a lag time of 2 months and 1 month respectively. The time series of monthly TWS shows a consistent and statistically significant decrease of about 1 cm year(-1) over Northern India, which is not associated with changes in rainfall and GBRD. This recent change in TWS suggests a possible impact of rapid industrialization, urbanization and increase in population on land water resources. Our analysis highlights the potential of the Earth-observation satellite data for hydrological applications.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Themono-alkylation of DPP derivatives leads to cofacial pi-pi stacking via H-bonding unlike their di-alkylated counterparts, which exhibit a classical herringbone packing pattern. Single crystal organic field-effect transistor (OFET) measurements reveal a significant enhancement of charge carrier mobility for mono-hexyl DPP derivatives.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Developments in the statistical extreme value theory, which allow non-stationary modeling of changes in the frequency and severity of extremes, are explored to analyze changes in return levels of droughts for the Colorado River. The transient future return levels (conditional quantiles) derived from regional drought projections using appropriate extreme value models, are compared with those from observed naturalized streamflows. The time of detection is computed as the time at which significant differences exist between the observed and future extreme drought levels, accounting for the uncertainties in their estimates. Projections from multiple climate model-scenario combinations are considered; no uniform pattern of changes in drought quantiles is observed across all the projections. While some projections indicate shifting to another stationary regime, for many projections which are found to be non-stationary, detection of change in tail quantiles of droughts occurs within the 21st century with no unanimity in the time of detection. Earlier detection is observed in droughts levels of higher probability of exceedance. (C) 2014 Elsevier Ltd. All rights reserved.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

We propose two-photon excitation-based light-sheet technique for nano-lithography. The system consists of 2 -configured cylindrical lens system with a common geometrical focus. Upon superposition, the phase-matched counter-propagating light-sheets result in the generation of identical and equi spaced nano-bump pattern. Study shows a feature size of as small as few tens of nanometers with a inter-bump distance of few hundred nanometers. This technique overcomes some of the limitations of existing nano-lithography techniques, thereby, may pave the way for mass-production of nano-structures. Potential applications can also be found in optical microscopy, plasmonics, and nano-electronics. Microsc. Res. Tech. 78:1-7, 2015. (c) 2014 Wiley Periodicals, Inc.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Frequent episode discovery is one of the methods used for temporal pattern discovery in sequential data. An episode is a partially ordered set of nodes with each node associated with an event type. For more than a decade, algorithms existed for episode discovery only when the associated partial order is total (serial episode) or trivial (parallel episode). Recently, the literature has seen algorithms for discovering episodes with general partial orders. In frequent pattern mining, the threshold beyond which a pattern is inferred to be interesting is typically user-defined and arbitrary. One way of addressing this issue in the pattern mining literature has been based on the framework of statistical hypothesis testing. This paper presents a method of assessing statistical significance of episode patterns with general partial orders. A method is proposed to calculate thresholds, on the non-overlapped frequency, beyond which an episode pattern would be inferred to be statistically significant. The method is first explained for the case of injective episodes with general partial orders. An injective episode is one where event-types are not allowed to repeat. Later it is pointed out how the method can be extended to the class of all episodes. The significance threshold calculations for general partial order episodes proposed here also generalize the existing significance results for serial episodes. Through simulations studies, the usefulness of these statistical thresholds in pruning uninteresting patterns is illustrated. (C) 2014 Elsevier Inc. All rights reserved.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Significant changes are reported in extreme rainfall characteristics over India in recent studies though there are disagreements on the spatial uniformity and causes of trends. Based on recent theoretical advancements in the Extreme Value Theory (EVT), we analyze changes in extreme rainfall characteristics over India using a high-resolution daily gridded (1 degrees latitude x 1 degrees longitude) dataset. Intensity, duration and frequency of excess rain over a high threshold in the summer monsoon season are modeled by non-stationary distributions whose parameters vary with physical covariates like the El-Nino Southern Oscillation index (ENSO-index) which is an indicator of large-scale natural variability, global average temperature which is an indicator of human-induced global warming and local mean temperatures which possibly indicate more localized changes. Each non-stationary model considers one physical covariate and the best chosen statistical model at each rainfall grid gives the most significant physical driver for each extreme rainfall characteristic at that grid. Intensity, duration and frequency of extreme rainfall exhibit non-stationarity due to different drivers and no spatially uniform pattern is observed in the changes in them across the country. At most of the locations, duration of extreme rainfall spells is found to be stationary, while non-stationary associations between intensity and frequency and local changes in temperature are detected at a large number of locations. This study presents the first application of nonstationary statistical modeling of intensity, duration and frequency of extreme rainfall over India. The developed models are further used for rainfall frequency analysis to show changes in the 100-year extreme rainfall event. Our findings indicate the varying nature of each extreme rainfall characteristic and their drivers and emphasize the necessity of a comprehensive framework to assess resulting risks of precipitation induced flooding. (C) 2014 Elsevier B.V. All rights reserved.