971 resultados para Association mining
Resumo:
Keyword Spotting is the task of detecting keywords of interest within continu- ous speech. The applications of this technology range from call centre dialogue systems to covert speech surveillance devices. Keyword spotting is particularly well suited to data mining tasks such as real-time keyword monitoring and unre- stricted vocabulary audio document indexing. However, to date, many keyword spotting approaches have su®ered from poor detection rates, high false alarm rates, or slow execution times, thus reducing their commercial viability. This work investigates the application of keyword spotting to data mining tasks. The thesis makes a number of major contributions to the ¯eld of keyword spotting. The ¯rst major contribution is the development of a novel keyword veri¯cation method named Cohort Word Veri¯cation. This method combines high level lin- guistic information with cohort-based veri¯cation techniques to obtain dramatic improvements in veri¯cation performance, in particular for the problematic short duration target word class. The second major contribution is the development of a novel audio document indexing technique named Dynamic Match Lattice Spotting. This technique aug- ments lattice-based audio indexing principles with dynamic sequence matching techniques to provide robustness to erroneous lattice realisations. The resulting algorithm obtains signi¯cant improvement in detection rate over lattice-based audio document indexing while still maintaining extremely fast search speeds. The third major contribution is the study of multiple veri¯er fusion for the task of keyword veri¯cation. The reported experiments demonstrate that substantial improvements in veri¯cation performance can be obtained through the fusion of multiple keyword veri¯ers. The research focuses on combinations of speech background model based veri¯ers and cohort word veri¯ers. The ¯nal major contribution is a comprehensive study of the e®ects of limited training data for keyword spotting. This study is performed with consideration as to how these e®ects impact the immediate development and deployment of speech technologies for non-English languages.
Resumo:
Background Length of hospital stay (LOS) is a surrogate marker for patients' well-being during hospital treatment and is associated with health care costs. Identifying pretreatment factors associated with LOS in surgical patients may enable early intervention in order to reduce postoperative LOS. Methods This cohort study enrolled 157 patients with suspected or proven gynecological cancer at a tertiary cancer centre (2004-2006). Before commencing treatment, the scored Patient Generated - Subjective Global Assessment (PG-SGA) measuring nutritional status and the Functional Assessment of Cancer Therapy-General (FACT-G) scale measuring quality of life (QOL) were completed. Clinical and demographic patient characteristics were prospectively obtained. Patients were grouped into those with prolonged LOS if their hospital stay was greater than the median LOS and those with average or below average LOS. Results Patients' mean age was 58 years (SD 14 years). Preoperatively, 81 (52%) patients presented with suspected benign disease/pelvic mass, 23 (15%) with suspected advanced ovarian cancer, 36 (23%) patients with suspected endometrial and 17 (11%) with cervical cancer, respectively. In univariate models prolonged LOS was associated with low serum albumin or hemoglobin, malnutrition (PG-SGA score and PG-SGA group B or C), low pretreatment FACT-G score, and suspected diagnosis of cancer. In multivariable models, PG-SGA group B or C, FACT-G score and suspected diagnosis of advanced ovarian cancer independently predicted LOS. Conclusions Malnutrition, low quality of life scores and being diagnosed with advanced ovarian cancer are the major determinants of prolonged LOS amongst gynecological cancer patients. Interventions addressing malnutrition and poor QOL may decrease LOS in gynecological cancer patients.
Resumo:
This review evaluated the strength of the evidence for a causal relationship between physical activity (PA) and colorectal cancer (CRC). A systematic review of databases through February 2008 was conducted to identify studies that assessed the association between total or recreational PA and incidence or mortality of CRC (including CRC, rectal cancer, colon cancer, and proximal or distal colon cancer). Studies were evaluated for significant associations between PA and risk of CRC endpoints and for evidence of dose–response relationships in the highest quality studies. Twenty cohort studies were evaluated; 11 were high-quality. Fifty percent of all studies and 64%of highest quality studies reported at least one significant association between PA and risk of a CRC endpoint (Po0.05).However, only 28%of all analyses (31% of analyses of highest quality studies) were significant (Po0.05). Only 40% of analyses of highest quality studies resulted in a significant P for trend (Po0.05); however, a non-significant inverse linear association between PA and colon cancer riskwas apparent.Heterogeneity in the evidence from all studies and from the highest quality studies was evident. Evidence from cohort studies is not sufficient to claim a convincing relationship exists between PA and CRC risk.
Resumo:
In a seminal data mining article, Leo Breiman [1] argued that to develop effective predictive classification and regression models, we need to move away from the sole dependency on statistical algorithms and embrace a wider toolkit of modeling algorithms that include data mining procedures. Nevertheless, many researchers still rely solely on statistical procedures when undertaking data modeling tasks; the sole reliance on these procedures has lead to the development of irrelevant theory and questionable research conclusions ([1], p.199). We will outline initiatives that the HPC & Research Support group is undertaking to engage researchers with data mining tools and techniques; including a new range of seminars, workshops, and one-on-one consultations covering data mining algorithms, the relationship between data mining and the research cycle, and limitations and problems with these new algorithms. Organisational limitations and restrictions to these initiatives are also discussed.
Resumo:
This paper examines Australian media representations of the male managers of two global mining corporations, Rio Tinto and BHP Billiton. These organizations are transnational (or multinational) corporations with assets and/or operations across national boundaries (Dunning and Lundan, 2008), and indeed their respective Chief Executive Officers, Tom Albanese and Marius Kloppers are two of the most economically (and arguably politically) powerful in the world overseeing 37 000 and 39 000 employees internationally. With a 2008 profit of US$15.962 billion and assets of US$ 75.889 Billion BHP Billiton is the world's largest mining company. In terms of its profits and assets Rio Tinto ranks fourth in the world, but with operations in six countries (mainly Canada and Australia) and a 2008 profit of US$10.3 billion it is also emblematic of the transnational in that its ‘budget is larger than that of all but a few nations’ (Giddens, 2003, p. 62).
Resumo:
Information Overload and Mismatch are two fundamental problems affecting the effectiveness of information filtering systems. Even though both term-based and patternbased approaches have been proposed to address the problems of overload and mismatch, neither of these approaches alone can provide a satisfactory solution to address these problems. This paper presents a novel two-stage information filtering model which combines the merits of term-based and pattern-based approaches to effectively filter sheer volume of information. In particular, the first filtering stage is supported by a novel rough analysis model which efficiently removes a large number of irrelevant documents, thereby addressing the overload problem. The second filtering stage is empowered by a semantically rich pattern taxonomy mining model which effectively fetches incoming documents according to the specific information needs of a user, thereby addressing the mismatch problem. The experimental results based on the RCV1 corpus show that the proposed twostage filtering model significantly outperforms the both termbased and pattern-based information filtering models.
Resumo:
Background: A number of studies have examined the relationship between high ambient temperature and mortality. Recently, concern has arisen about whether this relationship is modified by socio-demographic factors. However, data for this type of study is relatively scarce in subtropical/tropical regions where people are well accustomed to warm temperatures. Objective: To investigate whether the relationship between daily mean temperature and daily all-cause mortality is modified by age, gender and socio-economic status (SES) in Brisbane, Australia. Methods: We obtained daily mean temperature and all-cause mortality data for Brisbane, Australia during 1996–2004. A generalised additive model was fitted to assess the percentage increase in all deaths with every one degree increment above the threshold temperature. Different age, gender and SES groups were included in the model as categorical variables and their modification effects were estimated separately. Results: A total of 53,316 non-external deaths were included during the study period. There was a clear increasing trend in the harmful effect of high temperature on mortality with age. The effect estimate among women was more than 20 times that among men. We did not find an SES effect on the percent increase associated with temperature. Conclusions: The effects of high temperature on all deaths were modified by age and gender but not by SES in Brisbane, Australia.
Resumo:
There is increasing epidemiological and molecular evidence that cutaneous melanomas arise through multiple causal pathways. The purpose of this study was to explore the relationship between germline and somatic mutations in a population-based series of melanoma patients to reshape and refine the divergent pathway model for melanoma. Melanomas collected from 123 Australian patients were analyzed for melanocortin-1 receptor (MC1R) variants and mutations in the BRAF and NRAS genes. Detailed phenotypic and sun exposure data were systematically collected from all patients. We found that BRAF-mutant melanomas were significantly more likely from younger patients and those with high nevus counts, and were more likely in melanomas with adjacent neval remnants. Conversely, BRAF-mutant melanomas were significantly less likely in people with high levels of lifetime sun exposure. We observed no association between germline MC1R status and somatic BRAF mutations in melanomas from this population. BRAF-mutant melanomas have different origins from other cutaneous melanomas. These data support the divergent pathways hypothesis for melanoma, which may require a reappraisal of targeted cancer prevention activities.
Resumo:
Advances in data mining have provided techniques for automatically discovering underlying knowledge and extracting useful information from large volumes of data. Data mining offers tools for quick discovery of relationships, patterns and knowledge in large complex databases. Application of data mining to manufacturing is relatively limited mainly because of complexity of manufacturing data. Growing self organizing map (GSOM) algorithm has been proven to be an efficient algorithm to analyze unsupervised DNA data. However, it produced unsatisfactory clustering when used on some large manufacturing data. In this paper a data mining methodology has been proposed using a GSOM tool which was developed using a modified GSOM algorithm. The proposed method is used to generate clusters for good and faulty products from a manufacturing dataset. The clustering quality (CQ) measure proposed in the paper is used to evaluate the performance of the cluster maps. The paper also proposed an automatic identification of variables to find the most probable causative factor(s) that discriminate between good and faulty product by quickly examining the historical manufacturing data. The proposed method offers the manufacturers to smoothen the production flow and improve the quality of the products. Simulation results on small and large manufacturing data show the effectiveness of the proposed method.
Resumo:
A pressing concern within the literature on anticipatory perceptual-motor behaviour is the lack of clarity on the applicability of data, observed under video-simulation task constraints, to actual performance in which actions are coupled to perception, as captured during in-situ experimental conditions. We developed an in-situ experimental paradigm which manipulated the duration of anticipatory visual information from a penalty taker’s actions to examine experienced goalkeepers’ vulnerability to deception for the penalty kick in association football. Irrespective of the penalty taker’s kick strategy, goalkeepers initiated movement responses earlier across consecutively earlier presentation points. Overall goalkeeping performance was better in non-deception trials than in deception conditions. In deception trials, the kinematic information presented up until the penalty taker initiated his/her kicking action had a negative effect on goalkeepers’ performance. It is concluded that goalkeepers are likely to benefit from not anticipating a penalty taker’s performance outcome based on information from the run-up, in preference to later information that emerges just before the initiation of the penalty taker’s kicking action.
Resumo:
The purpose of this research is to report preliminary empirical evidence regarding the association between common physical performance measures and health-related quality of life (HRQoL) of hospitalized older adults recovering from illness and injury. Frequently, these patients do not return to premorbid levels of independence and physical ability. Rehabilitation for this population often focuses on improving physical functioning and mobility with the intention of maximizing their HRQoL for discharge and thereafter. For this reason, longitudinal use of physical performance measures as an indicator of improvement in physical functioning (and thus HRQoL) is common. Although this is a logical approach, there have been mixed results from previous investigations into the association between common measures of physical function and HRQoL amongst other adult patient populations.1,2 There has been no previous investigation reporting the association between HRQoL and a variety of common physical performance measures in hospitalized older adults.
Resumo:
A method of selecting land in any region of Queensland for offsetting purposes is devised, employing uniform standards. The procedure first requires that any core natural asset lands, Crown environmental lands, prime urban and agricultural lands, and highly contentious sites in the region be eliminated from consideration. Other land is then sought that is located between existing large reservations and the centre of greatest potential regional development/disturbance. Using the criteria of rehabilitation (rather than preservation) plus proximity to those officially defined Regional Ecosystems that are most threatened, adjacent sites that are described as ‘Cleared’ are identified in terms of agricultural land capability. Class IV lands – defined as those ‘which may be safely used for occasional cultivation with careful management’,2 ‘where it is favourably located for special usage’,3 and where it is ‘helpful to those who are interested in industry or regional planning or in reconstruction’4 – are examined for their appropriate area, for current tenure and for any conditions such as Mining Leases that may exist. The positive impacts from offsets on adjoining lands can then be designed to be significant; examples are also offered in respect of riparian areas and of Marine Parks. Criteria against which to measure performance for trading purposes include functional lift, with other case studies about this matter reported separately in this issue. The procedure takes no account of demand side economics (financial additionality), which requires commercial rather than environmental analysis.
Resumo:
In the late 20th century, a value-shift began to influence political thinking, recognising the need for environmentally, socially and culturally sustainable resource development. This shift entailed moves away from thinking of nature and culture as separate entities - The former existing merely to serve the latter. Cultural landscape theory recognises 'nature' as at once both 'natural', and as a 'cultural' construct. As such it may offer a framework through which to progress in the quest for 'sustainable development'. This 2005 Masters thesis makes a contribution to that quest by asking whether contemporary developments in cultural landscape theory can contribute to rehabilitation strategies for Australian open-cut coal mining landscapes, an examplar resource development landscape. A thematic historial overview of landscape values and resource development in Australis post-1788, and a review of cultural landscape theory literature contribute to the formation of the theoretical framework: "reconnecting the interrupted landscape". The author then explores a possible application of this framework within the Australian open-cut coal mining landscape.