25 resultados para Cluster Analysis. Information Theory. Entropy. Cross Information Potential. Complex Data
Resumo:
The schema of an information system can significantly impact the ability of end users to efficiently and effectively retrieve the information they need. Obtaining quickly the appropriate data increases the likelihood that an organization will make good decisions and respond adeptly to challenges. This research presents and validates a methodology for evaluating, ex ante, the relative desirability of alternative instantiations of a model of data. In contrast to prior research, each instantiation is based on a different formal theory. This research theorizes that the instantiation that yields the lowest weighted average query complexity for a representative sample of information requests is the most desirable instantiation for end-user queries. The theory was validated by an experiment that compared end-user performance using an instantiation of a data structure based on the relational model of data with performance using the corresponding instantiation of the data structure based on the object-relational model of data. Complexity was measured using three different Halstead metrics: program length, difficulty, and effort. For a representative sample of queries, the average complexity using each instantiation was calculated. As theorized, end users querying the instantiation with the lower average complexity made fewer semantic errors, i.e., were more effective at composing queries. (c) 2005 Elsevier B.V. All rights reserved.
Resumo:
Onsite wastewater treatment systems aim to assimilate domestic effluent into the environment. Unfortunately failure of such systems is common and inadequate effluent treatment can have serious environmental implications. The capacity of a particular soil to treat wastewater will change over time. The physical properties influence the rate of effluent movement through the soil and its chemical properties dictate the ability to renovate effluent. A research project was undertaken to determine the role that physical and chemical soil properties play in predicting the long-term behaviour of soil under effluent irrigation and to determine if they have a potential function as early indicators of adverse effects of effluent irrigation on treatment sustainability. Principal Component Analysis (PCA) and Cluster Analysis grouped the soils independently of their soil classifications and allowed us to distinguish the most suitable soils for sustainable long term effluent irrigation and determine the most influential soil parameters to characterise them. Multivariate analysis allowed a clear distinction between soils based on the cation exchange capacities. This in turn correlated well with the soil mineralogy. Mixed mineralogy soils in particular sodium or magnesium dominant soils are the most susceptible to dispersion under effluent irrigation. The soil Exchangeable Sodium Percentage (ESP) was identified as a crucial parameter and was highly correlated with percentage clay, electrical conductivity, exchangeable sodium, exchangeable magnesium and low Ca:Mg ratios (less than 0.5).
Resumo:
Objective: To describe the characteristics [of self-described 'occasional' and 'social' Australian smokers. Design: Analysis of a national cross-sectional survey of smoking patterns, conducted in Australia in 2004. Setting and participants: Australian adults in 2004 who responded to a survey question about self-described smoking status. Main outcome measures: Demographic characteristics, patterns of alcohol and tobacco use, smoking cessation attempts in the past year, and interest in cessation. Results: Smokers who described themselves as 'occasional' and 'social' smokers comprised 29% of all smokers. A significant proportion of occasional and social smokers had been daily smokers, but the majority either believed that they had 'already quit' or had no intention of quitting smoking. Conclusions: Self-ascribed occasional and social smokers potentially represent an important target group for cessation. These types of smokers may be more resistant to public health messages regarding cessation because they do not view their smoking behaviour as presenting a high risk.
Resumo:
In early generation variety trials, large numbers of new breeders' lines need to be compared, and usually there is little seed available for each new line. A so-called unreplicated trial has each new line on just one plot at a site, but includes several (often around five) replicated check or control (or standard) varieties. The total proportion of check plots is usually between 10% and 20%. The aim of the trial is to choose some good performing lines (usually around 1/3 of those tested) to go on for further testing, rather than precise estimation of their mean yield. Now that spatial analyses of data from field experiments are becoming more common, there is interest in an efficient layout of an experiment given a proposed spatial analysis. Some possible design criteria are discussed, and efficient layouts under spatial dependence are considered.
Resumo:
A complete workflow specification requires careful integration of many different process characteristics. Decisions must be made as to the definitions of individual activities, their scope, the order of execution that maintains the overall business process logic, the rules governing the discipline of work list scheduling to performers, identification of time constraints and more. The goal of this paper is to address an important issue in workflows modelling and specification, which is data flow, its modelling, specification and validation. Researchers have neglected this dimension of process analysis for some time, mainly focussing on structural considerations with limited verification checks. In this paper, we identify and justify the importance of data modelling in overall workflows specification and verification. We illustrate and define several potential data flow problems that, if not detected prior to workflow deployment may prevent the process from correct execution, execute process on inconsistent data or even lead to process suspension. A discussion on essential requirements of the workflow data model in order to support data validation is also given..
Resumo:
In this paper we consider the co-evolutionary dynamics of IS engagement where episodic change of implementation increasingly occurs within the context of linkages and interdependencies between systems and processes within and across organisations. Although there are many theories that interpret the various motors of change be it lifecycle, teleological, dialectic or evolutionary, our paper attempts to move towards a unifying view of change by studying co-evolutionary dynamics from a complex systems perspective. To understand how systems and organisations co-evolve in practice and how order emerges, or fails to emerge, we adopt complex adaptive systems theory to incorporate evolutionary and teleological motors, and actor-network theory to incorporate dialectic motors. We illustrate this through the analysis of the implementation of a novel academic scheduling system at a large research-intensive Australian university.