274 resultados para Sequential patterns

em Queensland University of Technology - ePrints Archive


Relevância:

100.00% 100.00%

Publicador:

Resumo:

In many applications, e.g., bioinformatics, web access traces, system utilisation logs, etc., the data is naturally in the form of sequences. People have taken great interest in analysing the sequential data and finding the inherent characteristics or relationships within the data. Sequential association rule mining is one of the possible methods used to analyse this data. As conventional sequential association rule mining very often generates a huge number of association rules, of which many are redundant, it is desirable to find a solution to get rid of those unnecessary association rules. Because of the complexity and temporal ordered characteristics of sequential data, current research on sequential association rule mining is limited. Although several sequential association rule prediction models using either sequence constraints or temporal constraints have been proposed, none of them considered the redundancy problem in rule mining. The main contribution of this research is to propose a non-redundant association rule mining method based on closed frequent sequences and minimal sequential generators. We also give a definition for the non-redundant sequential rules, which are sequential rules with minimal antecedents but maximal consequents. A new algorithm called CSGM (closed sequential and generator mining) for generating closed sequences and minimal sequential generators is also introduced. A further experiment has been done to compare the performance of generating non-redundant sequential rules and full sequential rules, meanwhile, performance evaluation of our CSGM and other closed sequential pattern mining or generator mining algorithms has also been conducted. We also use generated non-redundant sequential rules for query expansion in order to improve recommendations for infrequently purchased products.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

With the overwhelming increase in the amount of texts on the web, it is almost impossible for people to keep abreast of up-to-date information. Text mining is a process by which interesting information is derived from text through the discovery of patterns and trends. Text mining algorithms are used to guarantee the quality of extracted knowledge. However, the extracted patterns using text or data mining algorithms or methods leads to noisy patterns and inconsistency. Thus, different challenges arise, such as the question of how to understand these patterns, whether the model that has been used is suitable, and if all the patterns that have been extracted are relevant. Furthermore, the research raises the question of how to give a correct weight to the extracted knowledge. To address these issues, this paper presents a text post-processing method, which uses a pattern co-occurrence matrix to find the relation between extracted patterns in order to reduce noisy patterns. The main objective of this paper is not only reducing the number of closed sequential patterns, but also improving the performance of pattern mining as well. The experimental results on Reuters Corpus Volume 1 data collection and TREC filtering topics show that the proposed method is promising.

Relevância:

70.00% 70.00%

Publicador:

Resumo:

With the overwhelming increase in the amount of data on the web and data bases, many text mining techniques have been proposed for mining useful patterns in text documents. Extracting closed sequential patterns using the Pattern Taxonomy Model (PTM) is one of the pruning methods to remove noisy, inconsistent, and redundant patterns. However, PTM model treats each extracted pattern as whole without considering included terms, which could affect the quality of extracted patterns. This paper propose an innovative and effective method that extends the random set to accurately weigh patterns based on their distribution in the documents and their terms distribution in patterns. Then, the proposed approach will find the specific closed sequential patterns (SCSP) based on the new calculated weight. The experimental results on Reuters Corpus Volume 1 (RCV1) data collection and TREC topics show that the proposed method significantly outperforms other state-of-the-art methods in different popular measures.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

It is a big challenge to guarantee the quality of discovered relevance features in text documents for describing user preferences because of the large number of terms, patterns, and noise. Most existing popular text mining and classification methods have adopted term-based approaches. However, they have all suffered from the problems of polysemy and synonymy. Over the years, people have often held the hypothesis that pattern-based methods should perform better than term- based ones in describing user preferences, but many experiments do not support this hypothesis. This research presents a promising method, Relevance Feature Discovery (RFD), for solving this challenging issue. It discovers both positive and negative patterns in text documents as high-level features in order to accurately weight low-level features (terms) based on their specificity and their distributions in the high-level features. The thesis also introduces an adaptive model (called ARFD) to enhance the exibility of using RFD in adaptive environment. ARFD automatically updates the system's knowledge based on a sliding window over new incoming feedback documents. It can efficiently decide which incoming documents can bring in new knowledge into the system. Substantial experiments using the proposed models on Reuters Corpus Volume 1 and TREC topics show that the proposed models significantly outperform both the state-of-the-art term-based methods underpinned by Okapi BM25, Rocchio or Support Vector Machine and other pattern-based methods.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Fusion techniques have received considerable attention for achieving lower error rates with biometrics. A fused classifier architecture based on sequential integration of multi-instance and multi-sample fusion schemes allows controlled trade-off between false alarms and false rejects. Expressions for each type of error for the fused system have previously been derived for the case of statistically independent classifier decisions. It is shown in this paper that the performance of this architecture can be improved by modelling the correlation between classifier decisions. Correlation modelling also enables better tuning of fusion model parameters, ‘N’, the number of classifiers and ‘M’, the number of attempts/samples, and facilitates the determination of error bounds for false rejects and false accepts for each specific user. Error trade-off performance of the architecture is evaluated using HMM based speaker verification on utterances of individual digits. Results show that performance is improved for the case of favourable correlated decisions. The architecture investigated here is directly applicable to speaker verification from spoken digit strings such as credit card numbers in telephone or voice over internet protocol based applications. It is also applicable to other biometric modalities such as finger prints and handwriting samples.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Knowledge of the amounts and types of fatty acids in groundnut oil is beneficial, particularly from a nutritional standpoint. Germplasm evaluation data for fatty acid composition on 819 accessions of groundnut (Arachis hypogaea L.) from the Australian Tropical Field Crops Genetic Resource Centre, Biloela, Queensland were examined. Data for eight quantitative fatty acid descriptors have been documented. Statistical assessment, via methods of pattern analysis, summarised and described the patterns of variation in fatty acid composition of the groundnut accessions in the Australian germplasm collection. Presentation of the results from principal components analysis and hierarchical cluster analysis using a biplot was shown to be a very useful interpretative tool. Such a biplot enables a simultaneous examination of the relationships among all the accessions and the fatty acids. Unlike that information available via database searches, the results from contribution analysis together with the biplot provide a global picture of the diversity available for use in plant breeding programs. The use of standardised data for eight fatty acids, compared to using three specific fatty acids, provided a better description of the total diversity available because it remains relevant with possible changes in the nutritional preferences for fatty acids. Fatty acid composition was found to vary in relation to the branching pattern of the accessions. This pattern is generally indicative of the botanical types of groundnuts; Virginia (alternate) compared to Spanish and Valencia (sequential) botanical types.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Introduction Clinically, the Cobb angle method measures the overall scoliotic curve in the coronal plane but does not measure individual vertebra and disc wedging. The contributions of the vertebrae and discs in the growing scoliotic spine were measured to investigate coronal plane deformity progression with growth. Methods A 0.49mm isotropic 3D MRI technique was developed to investigate the level-by-level changes that occur in the growing spine of a group of Adolescent Idiopathic Scoliosis (AIS) patients, who received two to four sequential scans (spaced 3-12 months apart). The coronal plane wedge angles of each vertebra and disc in the major curve were measured to capture any changes that occurred during their adolescent growth phase. Results Seventeen patients had at least two scans. Mean patient age was 12.9 years (SD 1.5 years). Sixteen were classified as right-sided major thoracic Lenke Type 1 (one left sided). Mean standing Cobb angle at initial presentation was 31° (SD 12°). Six received two scans, nine three scans and two four scans, with 65% showing a Cobb angle progression of 5° or more between scans. Overall, there was no clear pattern of deformity progression of individual vertebrae and discs, nor between patients who progressed and those who didn’t. There were measurable changes in the wedging of the vertebrae and discs in all patients. In sequential scans, change in direction of wedging was also seen. In several patients there was reverse wedging in the discs that counteracted increased wedging of the vertebrae such that no change in overall Cobb angle was seen. Conclusion Sequential MRI data showed complex patterns of deformity progression. Changes to the wedging of individual vertebrae and discs may occur in patients who have no increase in Cobb angle measure; the Cobb method alone may be insufficient to capture the complex mechanisms of deformity progression.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

INTRODUCTION. Clinically, the Cobb angle method measures the overall scoliotic curve in the coronal plane but does not measure individual vertebra and disc wedging. The contributions of the vertebrae and discs in the growing scoliotic spine were measured to investigate coronal plane deformity progression with growth. METHODS. A 0.49mm isotropic 3D MRI technique was developed to investigate the level-by-level changes that occur in the growing spine of a group of Adolescent Idiopathic Scoliosis (AIS) patients, who received two to four sequential scans (spaced 3-12 months apart). The coronal plane wedge angles of each vertebra and disc in the major curve were measured to capture any changes that occurred during their adolescent growth phase. RESULTS. Seventeen patients had at least two scans. Mean patient age was 12.9 years (SD 1.5 years). Sixteen were classified as right-sided major thoracic Lenke Type 1 (one left sided). Mean standing Cobb angle at initial presentation was 31° (SD 12°). Six received two scans, nine three scans and two four scans, with 65% showing a Cobb angle progression of 5° or more between scans. Overall, there was no clear pattern of deformity progression of individual vertebrae and discs, nor between patients who progressed and those who didn’t. There were measurable changes in the wedging of the vertebrae and discs in all patients. In sequential scans, change in direction of wedging was also seen. In several patients there was reverse wedging in the discs that counteracted increased wedging of the vertebrae such that no change in overall Cobb angle was seen. CONCLUSION. Sequential MRI data showed complex patterns of deformity progression. Changes to the wedging of individual vertebrae and discs may occur in patients who have no increase in Cobb angle measure; the Cobb method alone may be insufficient to capture the complex mechanisms of deformity progression.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Clinically, the Cobb angle method measures the overall scoliotic curve in the coronal plane but does not measure individual vertebra and disc wedging. The contributions of the vertebrae and discs in the growing scoliotic spine were measured to investigate coronal plane deformity progression with growth. Sequential MRI data in this project showed complex patterns of deformity progression. Changes to the wedging of individual vertebrae and discs may occur in patients who have no increase in Cobb angle measure; the Cobb method alone may be insufficient to capture the complex mechanisms of deformity progression.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Interdependence is a central concept in systems and organizations, yet our methods for measuring it are not well developed. Here, we report on a novel method for transforming digital trace data into networks of events that can be used to visualize and measure interdependence. The edges in the network represent sequential flow and the vertices represent actors, actions and artifacts. We refer to this representation as an affordance network. As with conventional approaches such as process mining, our method uses input from a stream of time-stamped occurrences, but the representation is simpler and more appropriate for exploration and theory building. As digital trace data becomes more widely available, this method may become more useful in information systems research and practice. Like a thermometer, it helps us measure a basic property of a system that would otherwise be difficult to see.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The one-dimensional propagation of a combustion wave through a premixed solid fuel for two-stage kinetics is studied. We re-examine the analysis of a single reaction travelling-wave and extend it to the case of two-stage reactions. We derive an expression for the travelling wave speed in the limit of large activation energy for both reactions. The analysis shows that when both reactions are exothermic, the wave structure is similar to the single reaction case. However, when the second reaction is endothermic, the wave structure can be significantly different from single reaction case. In particular, as might be expected, a travelling wave does not necessarily exist in this case. We establish conditions in the limiting large activation energy limit for the non-existence, and for monotonicity of the temperature profile in the travelling wave.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Introduction. This is a pilot study of quantitative electro-encephalographic (QEEG) comodulation analysis, which is used to assist in identifying regional brain differences in those people suffering from chronic fatigue syndrome (CFS) compared to a normative database. The QEEG comodulation analysis examines spatial-temporal cross-correlation of spectral estimates in the resting dominant frequency band. A pattern shown by Sterman and Kaiser (2001) and referred to as the anterior posterior dissociation (APD) discloses a significant reduction in shared functional modulation between frontal and centro-parietal areas of the cortex. This research attempts to examine whether this pattern is evident in CFS. Method. Eleven adult participants, diagnosed by a physician as having CFS, were involved in QEEG data collection. Nineteen-channel cap recordings were made in five conditions: eyes-closed baseline, eyes-open, reading task one, math computations task two, and a second eyes-closed baseline. Results. Four of the 11 participants showed an anterior posterior dissociation pattern for the eyes-closed resting dominant frequency. However, seven of the 11 participants did not show this pattern. Examination of the mean 8-12 Hz amplitudes across three cortical regions (frontal, central and parietal) indicated a trend of higher overall alpha levels in the parietal region in CFS patients who showed the APD pattern compared to those who did not have this pattern. All patients showing the pattern were free of medication, while 71% of those absent of the pattern were using antidepressant medications. Conclusions. Although the sample is small, it is suggested that this method of evaluating the disorder holds promise. The fact that this pattern was not consistently represented in the CFS sample could be explained by the possibility of subtypes of CFS, or perhaps co-morbid conditions. Further, the use of antidepressant medications may mask the pattern by altering the temporal characteristics of the EEG. The results of this pilot study indicate that further research is warranted to verify that the pattern holds across the wider population of CFS sufferers.