988 resultados para Auto-association
Resumo:
For most of the work done in developing association rule mining, the primary focus has been on the efficiency of the approach and to a lesser extent the quality of the derived rules has been emphasized. Often for a dataset, a huge number of rules can be derived, but many of them can be redundant to other rules and thus are useless in practice. The extremely large number of rules makes it difficult for the end users to comprehend and therefore effectively use the discovered rules and thus significantly reduces the effectiveness of rule mining algorithms. If the extracted knowledge can’t be effectively used in solving real world problems, the effort of extracting the knowledge is worth little. This is a serious problem but not yet solved satisfactorily. In this paper, we propose a concise representation called Reliable Approximate basis for representing non-redundant approximate association rules. We prove that the redundancy elimination based on the proposed basis does not reduce the belief to the extracted rules. We also prove that all approximate association rules can be deduced from the Reliable Approximate basis. Therefore the basis is a lossless representation of approximate association rules.
Resumo:
Background: The seasonality of suicide has long been recognised. However, little is known about the relative importance of socio-environmental factors in the occurrence of suicide in different geographical areas. This study examined the association of climate, socioeconomic and demographic factors with suicide in Queensland, Australia, using a spatiotemporal approach. Methods: Seasonal data on suicide, demographic variables and socioeconomic indexes for areas in each Local Government Area (LGA) between 1999 and 2003 were acquired from the Australian Bureau of Statistics. Climate data were supplied by the Australian Bureau of Meteorology. A multivariable generalized estimating equation model was used to examine the impact of socio-environmental factors on suicide. Results: The preliminary data analyses show that far north Queensland had the highest suicide incidence (e.g., Cook and Mornington Shires), while the south-western areas had the lowest incidence (e.g., Barcoo and Bauhinia Shires) in all the seasons. Maximum temperature, unemployment rate, the proportion of Indigenous population and the proportion of population with low individual income were statistically significantly and positively associated with suicide. There were weaker but not significant associations for other variables. Conclusions: Maximum temperature, the proportion of Indigenous population and unemployment rate appeared to be major determinants of suicide at a LGA level in Queensland.
Resumo:
Association rule mining is one technique that is widely used when querying databases, especially those that are transactional, in order to obtain useful associations or correlations among sets of items. Much work has been done focusing on efficiency, effectiveness and redundancy. There has also been a focusing on the quality of rules from single level datasets with many interestingness measures proposed. However, with multi-level datasets now being common there is a lack of interestingness measures developed for multi-level and cross-level rules. Single level measures do not take into account the hierarchy found in a multi-level dataset. This leaves the Support-Confidence approach,which does not consider the hierarchy anyway and has other drawbacks, as one of the few measures available. In this paper we propose two approaches which measure multi-level association rules to help evaluate their interestingness. These measures of diversity and peculiarity can be used to help identify those rules from multi-level datasets that are potentially useful.
Resumo:
Association rule mining has made many advances in the area of knowledge discovery. However, the quality of the discovered association rules is a big concern and has drawn more and more attention recently. One problem with the quality of the discovered association rules is the huge size of the extracted rule set. Often for a dataset, a huge number of rules can be extracted, but many of them can be redundant to other rules and thus useless in practice. Mining non-redundant rules is a promising approach to solve this problem. In this paper, we firstly propose a definition for redundancy; then we propose a concise representation called Reliable basis for representing non-redundant association rules for both exact rules and approximate rules. An important contribution of this paper is that we propose to use the certainty factor as the criteria to measure the strength of the discovered association rules. With the criteria, we can determine the boundary between redundancy and non-redundancy to ensure eliminating as many redundant rules as possible without reducing the inference capacity of and the belief to the remaining extracted non-redundant rules. We prove that the redundancy elimination based on the proposed Reliable basis does not reduce the belief to the extracted rules. We also prove that all association rules can be deduced from the Reliable basis. Therefore the Reliable basis is a lossless representation of association rules. Experimental results show that the proposed Reliable basis can significantly reduce the number of extracted rules.
Resumo:
Recommender systems are widely used online to help users find other products, items etc that they may be interested in based on what is known about that user in their profile. Often however user profiles may be short on information and thus when there is not sufficient knowledge on a user it is difficult for a recommender system to make quality recommendations. This problem is often referred to as the cold-start problem. Here we investigate whether association rules can be used as a source of information to expand a user profile and thus avoid this problem, leading to improved recommendations to users. Our pilot study shows that indeed it is possible to use association rules to improve the performance of a recommender system. This we believe can lead to further work in utilising appropriate association rules to lessen the impact of the cold-start problem.
Resumo:
Two areas of particular importance in prostate cancer progression are primary tumour development and metastasis. These processes involve a number of physiological events, the mediators of which are still being discovered and characterised. Serine proteases have been shown to play a major role in cancer invasion and metastasis. The recently discovered phenomenon of their activation of a receptor family known as the protease activated receptors (PARs) has extended their physiological role to that of signaling molecule. Several serine proteases are expressed by malignant prostate cancer cells, including members of the kallikreinrelated peptidase (KLK) serine protease family, and increasingly these are being shown to be associated with prostate cancer progression. KLK4 is highly expressed in the prostate and expression levels increase during prostate cancer progression. Critically, recent studies have implicated KLK4 in processes associated with cancer. For example, the ectopic over-expression of KLK4 in prostate cancer cell lines results in an increased ability of these cells to form colonies, proliferate and migrate. In addition, it has been demonstrated that KLK4 is a potential mediator of cellular interactions between prostate cancer cells and osteoblasts (bone forming cells). The ability of KLK4 to influence cellular behaviour is believed to be through the selective cleavage of specific substrates. Identification of relevant in vivo substrates of KLK4 is critical to understanding the pathophysiological roles of this enzyme. Significantly, recent reports have demonstrated that several members of the KLK family are able to activate PARs. The PARs are relatively new members of the seven transmembrane domain containing G protein coupled receptor (GPCR) family. PARs are activated through proteolytic cleavage of their N-terminus by serine proteases, the resulting nascent N-terminal binds intramolecularly to initiate receptor activation. PARs are involved in a number of patho-physiological processes, including vascular repair and inflammation, and a growing body of evidence suggests roles in cancer. While expression of PAR family members has been documented in several types of cancers, including prostate, the role of these GPCRs in prostate cancer development and progression is yet to be examined. Interestingly, several studies have suggested potential roles in cellular invasion through the induction of cytoskeletal reorganisation and expression of basement membrane-degrading enzymes. Accordingly, this program of research focussed on the activation of the PARs by the prostate cancer associated enzyme KLK4, cellular processing of activated PARs and the expression pattern of receptor and agonist in prostate cancer. For these studies KLK4 was purified from the conditioned media of stably transfected Sf9 insect cells expressing a construct containing the complete human KLK4 coding sequence in frame with a V5 epitope and poly-histidine encoding sequences. The first aspect of this study was the further characterisation of this recombinant zymogen form of KLK4. The recombinant KLK4 zymogen was demonstrated to be activatable by the metalloendopeptidase thermolysin and amino terminal sequencing indicated that thermolysin activated KLK4 had the predicted N-terminus of mature active KLK4 (31IINED). Critically, removal of the pro-region successfully generated a catalytically active enzyme, with comparable activity to a previously published recombinant KLK4 produced from S2 insect cells. The second aspect of this study was the activation of the PARs by KLK4 and the initiation of signal transduction. This study demonstrated that KLK4 can activate PAR-1 and PAR-2 to mobilise intracellular Ca2+, but failed to activate PAR-4. Further, KLK4 activated PAR-1 and PAR-2 over distinct concentration ranges, with KLK4 activation and mobilisation of Ca2+ demonstrating higher efficacy through PAR-2. Thus, the remainder of this study focussed on PAR-2. KLK4 was demonstrated to directly cleave a synthetic peptide that mimicked the PAR-2 Nterminal activation sequence. Further, KLK4 mediated Ca2+ mobilisation through PAR-2 was accompanied by the initiation of the extra-cellular regulated kinase (ERK) cascade. The specificity of intracellular signaling mediated through PAR-2 by KLK4 activation was demonstrated by siRNA mediated protein depletion, with a reduction in PAR-2 protein levels correlating to a reduction in KLK4 mediated Ca2+mobilisation and ERK phosphorylation. The third aspect of this study examined cellular processing of KLK4 activated PAR- 2 in a prostate cancer cell line. PAR-2 was demonstrated to be expressed by five prostate derived cell lines including the prostate cancer cell line PC-3. It was also demonstrated by flow cytometry and confocal microscopy analyses that activation of PC-3 cell surface PAR-2 by KLK4 leads to internalisation of this receptor in a time dependent manner. Critically, in vivo relevance of the interaction between KLK4 and PAR-2 was established by the observation of the co-expression of receptor and agonist in primary prostate cancer and prostate cancer bone lesion samples by immunohistochemical analysis. Based on the results of this study a number of exciting future studies have been proposed, including, delineating differences in KLK4 cellular signaling via PAR-1 and PAR-2 and the role of PAR-1 and PAR-2 activation by KLK4 in prostate cancer cells and bone cells in prostate cancer progression.
Resumo:
The roles of weather variability and sunspots in the occurrence of cyanobacteria blooms, were investigated using cyanobacteria cell data collected from the Fred Haigh Dam, Queensland, Australia. Time series generalized linear model and classification and regression (CART) model were used in the analysis. Data on notified cell numbers of cyanobacteria and weather variables over the periods 2001 and 2005 were provided by the Australian Department of Natural Resources and Water, and Australian Bureau of Meteorology, respectively. The results indicate that monthly minimum temperature (relative risk [RR]: 1.13, 95% confidence interval [CI]: 1.02-1.25) and rainfall (RR: 1.11; 95% CI: 1.03-1.20) had a positive association, but relative humidity (RR: 0.94; 95% CI: 0.91-0.98) and wind speed (RR:0.90; 95% CI: 0.82-0.98) were negatively associated with the cyanobacterial numbers, after adjustment for seasonality and auto-correlation. The CART model showed that the cyanobacteria numbers were best described by an interaction between minimum temperature, relative humidity, and sunspot numbers. When minimum temperature exceeded 18%C and relative humidity was under 66%, the number of cyanobacterial cells rose by 2.15-fold. We conclude that the weather variability and sunspot activity may affect cyanobacterial blooms in dams.
Resumo:
Abstract With the phenomenal growth of electronic data and information, there are many demands for the development of efficient and effective systems (tools) to perform the issue of data mining tasks on multidimensional databases. Association rules describe associations between items in the same transactions (intra) or in different transactions (inter). Association mining attempts to find interesting or useful association rules in databases: this is the crucial issue for the application of data mining in the real world. Association mining can be used in many application areas, such as the discovery of associations between customers’ locations and shopping behaviours in market basket analysis. Association mining includes two phases. The first phase, called pattern mining, is the discovery of frequent patterns. The second phase, called rule generation, is the discovery of interesting and useful association rules in the discovered patterns. The first phase, however, often takes a long time to find all frequent patterns; these also include much noise. The second phase is also a time consuming activity that can generate many redundant rules. To improve the quality of association mining in databases, this thesis provides an alternative technique, granule-based association mining, for knowledge discovery in databases, where a granule refers to a predicate that describes common features of a group of transactions. The new technique first transfers transaction databases into basic decision tables, then uses multi-tier structures to integrate pattern mining and rule generation in one phase for both intra and inter transaction association rule mining. To evaluate the proposed new technique, this research defines the concept of meaningless rules by considering the co-relations between data-dimensions for intratransaction-association rule mining. It also uses precision to evaluate the effectiveness of intertransaction association rules. The experimental results show that the proposed technique is promising.
Resumo:
These papers were presented at “Industrial Relations”, the Australasian Drama Studies Association conference hosted by Theatre & Teaching Studies in the Academy of the Arts, Queensland University of Technology, from the 5th to the 9th of July, 1999. Conference delegates included scholars and artists from across the tertiary education and professional theatre sectors, including, of course, many individuals who work across and between both those worlds. More than a hundred delegates from Australia, New Zealand, England, Belgium and Canada attended the week’s events, which included: • Over sixty conference papers covering a variety of topics from project reports to academy/industry partnerships, theatre history, audience reception studies, health & safety, cultural policy, performance theory, theatre technology and more; • Performances ranging from drama to dance, music and cabaret; • Workshops, panel discussions, forums and interviews; • Keynote addresses from Wesley Enoch, Josette Feral and Keith Johnstone; and • A special “Links with Industry” day, which included the launch of ADSA’s “Links with Industry” brochure, an interview between Mark Radvan and David Williamson, and a panel session featuring Jules Holledge, Zane Trow, Katharine Brisbane, John Kotzas, Gay McAuley and David Watt.
Resumo:
Compares the Chinese Securities and Regulatory Commission's guidelines for articles of association of listed companies issued in 2006 with 'replaceable' rules in the Australian Corporations Act 2001. Discusses the provisions of the Chinese guidelines and the Australian rules on corporate constitution, interpretation, a company's representative, object clauses, shareholders' powers and meetings and directors. Questions whether the Chinese guidelines facilitate effective corporate governance.
Resumo:
This paper investigates the use of the FAB-MAP appearance-only SLAM algorithm as a method for performing visual data association for RatSLAM, a semi-metric full SLAM system. While both systems have shown the ability to map large (60-70km) outdoor locations of approximately the same scale, for either larger areas or across longer time periods both algorithms encounter difficulties with false positive matches. By combining these algorithms using a mapping between appearance and pose space, both false positives and false negatives generated by FAB-MAP are significantly reduced during outdoor mapping using a forward-facing camera. The hybrid FAB-MAP-RatSLAM system developed demonstrates the potential for successful SLAM over large periods of time.
Resumo:
The 2010 Native American Indigenous Studies Conference was held at The Westin La Paloma Resort, Tucson, Arizona, USA from 20-22 May. The conference was scholarly and interdisciplinary and intended for Indigenous and non-Indigenous scholars who work in American Indian/ Native American/ First Nations/ Aboriginal/ Indigenous Studies. The 2010 gathering attracted 768 registrations from the USA, Canada, Hawaii, Mexico, New Zealand and Australia and other countries. This paper is a personal reflection and overview of the 2010 Conference.
Resumo:
Background Length of hospital stay (LOS) is a surrogate marker for patients' well-being during hospital treatment and is associated with health care costs. Identifying pretreatment factors associated with LOS in surgical patients may enable early intervention in order to reduce postoperative LOS. Methods This cohort study enrolled 157 patients with suspected or proven gynecological cancer at a tertiary cancer centre (2004-2006). Before commencing treatment, the scored Patient Generated - Subjective Global Assessment (PG-SGA) measuring nutritional status and the Functional Assessment of Cancer Therapy-General (FACT-G) scale measuring quality of life (QOL) were completed. Clinical and demographic patient characteristics were prospectively obtained. Patients were grouped into those with prolonged LOS if their hospital stay was greater than the median LOS and those with average or below average LOS. Results Patients' mean age was 58 years (SD 14 years). Preoperatively, 81 (52%) patients presented with suspected benign disease/pelvic mass, 23 (15%) with suspected advanced ovarian cancer, 36 (23%) patients with suspected endometrial and 17 (11%) with cervical cancer, respectively. In univariate models prolonged LOS was associated with low serum albumin or hemoglobin, malnutrition (PG-SGA score and PG-SGA group B or C), low pretreatment FACT-G score, and suspected diagnosis of cancer. In multivariable models, PG-SGA group B or C, FACT-G score and suspected diagnosis of advanced ovarian cancer independently predicted LOS. Conclusions Malnutrition, low quality of life scores and being diagnosed with advanced ovarian cancer are the major determinants of prolonged LOS amongst gynecological cancer patients. Interventions addressing malnutrition and poor QOL may decrease LOS in gynecological cancer patients.