989 resultados para sequential frequent pattern
Resumo:
We present a improved language modeling technique for Lempel-Ziv-Welch (LZW) based LID scheme. The previous approach to LID using LZW algorithm prepares the language pattern table using LZW algorithm. Because of the sequential nature of the LZW algorithm, several language specific patterns of the language were missing in the pattern table. To overcome this, we build a universal pattern table, which contains all patterns of different length. For each language it's corresponding language specific pattern table is constructed by retaining the patterns of the universal table whose frequency of appearance in the training data is above the threshold.This approach reduces the classification score (Compression Ratio [LZW-CR] or the weighted discriminant score[LZW-WDS]) for non native languages and increases the LID performance considerably.
Resumo:
With the emergence of large-volume and high-speed streaming data, the recent techniques for stream mining of CFIpsilas (closed frequent itemsets) will become inefficient. When concept drift occurs at a slow rate in high speed data streams, the rate of change of information across different sliding windows will be negligible. So, the user wonpsilat be devoid of change in information if we slide window by multiple transactions at a time. Therefore, we propose a novel approach for mining CFIpsilas cumulatively by making sliding width(ges1) over high speed data streams. However, it is nontrivial to mine CFIpsilas cumulatively over stream, because such growth may lead to the generation of exponential number of candidates for closure checking. In this study, we develop an efficient algorithm, stream-close, for mining CFIpsilas over stream by exploring some interesting properties. Our performance study reveals that stream-close achieves good scalability and has promising results.
Resumo:
Wetlands are the most productive and biologically diverse but very fragile ecosystems. They are vulnerable to even small changes in their biotic and abiotic factors. In recent years, there has been concern over the continuous degradation of wetlands due to unplanned developmental activities. This necessitates inventorying, mapping, and monitoring of wetlands to implement sustainable management approaches. The principal objective of this work is to evolve a strategy to identify and monitor wetlands using temporal remote sensing (RS) data. Pattern classifiers were used to extract wetlands automatically from NIR bands of MODIS, Landsat MSS and Landsat TM remote sensing data. MODIS provided data for 2002 to 2007, while for 1973 and 1992 IR Bands of Landsat MSS and TM (79m and 30m spatial resolution) data were used. Principal components of IR bands of MODIS (250 m) were fused with IRS LISS-3 NIR (23.5 m). To extract wetlands, statistical unsupervised learning of IR bands for the respective temporal data was performed using Bayesian approach based on prior probability, mean and covariance. Temporal analysis of wetlands indicates a sharp decline of 58% in Greater Bangalore attributing to intense urbanization processes, evident from a 466% increase in built-up area from 1973 to 2007.
Resumo:
Urbanisation is the increase in the population of cities in proportion to the region's rural population. Urbanisation in India is very rapid with urban population growing at around 2.3 percent per annum. Urban sprawl refers to the dispersed development along highways or surrounding the city and in rural countryside with implications such as loss of agricultural land, open space and ecologically sensitive habitats. Sprawl is thus a pattern and pace of land use in which the rate of land consumed for urban purposes exceeds the rate of population growth resulting in an inefficient and consumptive use of land and its associated resources. This unprecedented urbanisation trend due to burgeoning population has posed serious challenges to the decision makers in the city planning and management process involving plethora of issues like infrastructure development, traffic congestion, and basic amenities (electricity, water, and sanitation), etc. In this context, to aid the decision makers in following the holistic approaches in the city and urban planning, the pattern, analysis, visualization of urban growth and its impact on natural resources has gained importance. This communication, analyses the urbanisation pattern and trends using temporal remote sensing data based on supervised learning using maximum likelihood estimation of multivariate normal density parameters and Bayesian classification approach. The technique is implemented for Greater Bangalore – one of the fastest growing city in the World, with Landsat data of 1973, 1992 and 2000, IRS LISS-3 data of 1999, 2006 and MODIS data of 2002 and 2007. The study shows that there has been a growth of 466% in urban areas of Greater Bangalore across 35 years (1973 to 2007). The study unravels the pattern of growth in Greater Bangalore and its implication on local climate and also on the natural resources, necessitating appropriate strategies for the sustainable management.
Resumo:
Frequent episode discovery is a popular framework for mining data available as a long sequence of events. An episode is essentially a short ordered sequence of event types and the frequency of an episode is some suitable measure of how often the episode occurs in the data sequence. Recently,we proposed a new frequency measure for episodes based on the notion of non-overlapped occurrences of episodes in the event sequence, and showed that, such a definition, in addition to yielding computationally efficient algorithms, has some important theoretical properties in connecting frequent episode discovery with HMM learning. This paper presents some new algorithms for frequent episode discovery under this non-overlapped occurrences-based frequency definition. The algorithms presented here are better (by a factor of N, where N denotes the size of episodes being discovered) in terms of both time and space complexities when compared to existing methods for frequent episode discovery. We show through some simulation experiments, that our algorithms are very efficient. The new algorithms presented here have arguably the least possible orders of spaceand time complexities for the task of frequent episode discovery.
Resumo:
We consider the classical problem of sequential detection of change in a distribution (from hypothesis 0 to hypothesis 1), where the fusion centre receives vectors of periodic measurements, with the measurements being i.i.d. over time and across the vector components, under each of the two hypotheses. In our problem, the sensor devices ("motes") that generate the measurements constitute an ad hoc wireless network. The motes contend using a random access protocol (such as CSMA/CA) to transmit their measurement packets to the fusion centre. The fusion centre waits for vectors of measurements to accumulate before taking decisions. We formulate the optimal detection problem, taking into account the network delay experienced by the vectors of measurements, and find that, under periodic sampling, the detection delay decouples into network delay and decision delay. We obtain a lower bound on the network delay, and propose a censoring scheme, where lagging sensors drop their delayed observations in order to mitigate network delay. We show that this scheme can achieve the lower bound. This approach is explored via simulation. We also use numerical evaluation and simulation to study issues such as: the optimal sampling rate for a given number of sensors, and the optimal number of sensors for a given measurement rate
Resumo:
Ripe fruit need to signal their presence to attract dispersal agents. Plants may employ visual and/or olfactory sensory channels to signal the presence of ripe fruit. Visual signals of ripe fruit have been extensively investigated. However, the volatile signatures of ripe fruit that use olfactorily-oriented dispersers have been scarcely investigated. Moreover, as in flowers, where floral scents are produced at times when pollinators are active (diurnal versus nocturnal), whether plants can modulate the olfactory signal to produce fruit odours when dispersers are active in the diel cycle is completely unknown. We investigated day night differences in fruit odours in two species of figs, Ficus racemosa and Ficus benghalensis. The volatile bouquet of fruit of F.racemosa that are largely dispersed by bats and other mammals was dominated by fatty acid derivatives such as esters. In this species in which the ripe fig phase is very short, and where the figs drop off soon after ripening, there were no differences between day and night in fruit volatile signature. The volatile bouquet of fruit of F. benghalensis that has a long ripening period, however, and that remain attached to the tree for extended periods when ripe, showed an increase in fatty acid derivatives such as esters and of benzenoids such as benzaldehyde at night when they are dispersed by bats, and an elevation of sesquiterpenes during the day when they are dispersed by birds. For the first time we provide data that suggest that the volatile signal produced by fruit can show did l differences based on the activity period of the dispersal agent. (C) 2011 Elsevier Masson SAS. All rights reserved.
Resumo:
Among squamate reptiles, lizards exhibit an impressive array of sex-determining modes viz. genotypic sex determination, temperature-dependent sex determination, co-occurrence of both these and those that reproduce parthenogenetically. The oviparous lizard, Calotes versicolor, lacks heteromorphic sex chromosomes and there are no reports on homomorphic chromosomes. Earlier studies on this species presented little evidence to the sex-determining mechanism. Here we provide evidences for the potential role played by incubation temperature that has a significant effect (P<0.01) on gonadal sex and sex ratio. The eggs were incubated at 14 different incubation temperatures. Interestingly, 100% males were produced at low (25.5 +/- 0.5 degrees C) as well as high (34 +/- 0.5 degrees C) incubation temperatures and 100% females were produced at low (23.5 +/- 0.5 degrees C) and high (31.5 +/- 0.5 degrees C) temperatures, clearly indicating the occurrence of TSD in this species. Sex ratios of individual clutches did not vary at any of the critical male-producing or female-producing temperatures within as well as across the seasons. However, clutch sex ratios were female- or male-biased at intermediate temperatures. Thermosensitive period occurred during the embryonic stages 3033. Three pivotal temperatures operate producing 1:1 sex ratio. Histology of gonad and accessory reproductive structures provide additional evidence for TSD. The sex-determining pattern, observed for the first time in this species, that neither compares to Pattern I [Ia (MF) and Ib (FM)] nor to Pattern II (FMF), is being referred to as FMFM pattern of TSD. This novel FMFM pattern of sex ratio exhibited by C. versicolor may have an adaptive significance in maintaining sex ratio. J. Exp. Zool. 317:3246, 2012. (c) 2011 Wiley Periodicals, Inc.
Resumo:
In this paper, we give a brief review of pattern classification algorithms based on discriminant analysis. We then apply these algorithms to classify movement direction based on multivariate local field potentials recorded from a microelectrode array in the primary motor cortex of a monkey performing a reaching task. We obtain prediction accuracies between 55% and 90% using different methods which are significantly above the chance level of 12.5%.