188 resultados para Model Testing
em University of Queensland eSpace - Australia
Resumo:
To simulate cropping systems, crop models must not only give reliable predictions of yield across a wide range of environmental conditions, they must also quantify water and nutrient use well, so that the status of the soil at maturity is a good representation of the starting conditions for the next cropping sequence. To assess the suitability for this task a range of crop models, currently used in Australia, were tested. The models differed in their design objectives, complexity and structure and were (i) tested on diverse, independent data sets from a wide range of environments and (ii) model components were further evaluated with one detailed data set from a semi-arid environment. All models were coded into the cropping systems shell APSIM, which provides a common soil water and nitrogen balance. Crop development was input, thus differences between simulations were caused entirely by difference in simulating crop growth. Under nitrogen non-limiting conditions between 73 and 85% of the observed kernel yield variation across environments was explained by the models. This ranged from 51 to 77% under varying nitrogen supply. Water and nitrogen effects on leaf area index were predicted poorly by all models resulting in erroneous predictions of dry matter accumulation and water use. When measured light interception was used as input, most models improved in their prediction of dry matter and yield. This test highlighted a range of compensating errors in all modelling approaches. Time course and final amount of water extraction was simulated well by two models, while others left up to 25% of potentially available soil water in the profile. Kernel nitrogen percentage was predicted poorly by all models due to its sensitivity to small dry matter changes. Yield and dry matter could be estimated adequately for a range of environmental conditions using the general concepts of radiation use efficiency and transpiration efficiency. However, leaf area and kernel nitrogen dynamics need to be improved to achieve better estimates of water and nitrogen use if such models are to be use to evaluate cropping systems. (C) 1998 Elsevier Science B.V.
Resumo:
1. Although population viability analysis (PVA) is widely employed, forecasts from PVA models are rarely tested. This study in a fragmented forest in southern Australia contrasted field data on patch occupancy and abundance for the arboreal marsupial greater glider Petauroides volans with predictions from a generic spatially explicit PVA model. This work represents one of the first landscape-scale tests of its type. 2. Initially we contrasted field data from a set of eucalypt forest patches totalling 437 ha with a naive null model in which forecasts of patch occupancy were made, assuming no fragmentation effects and based simply on remnant area and measured densities derived from nearby unfragmented forest. The naive null model predicted an average total of approximately 170 greater gliders, considerably greater than the true count (n = 81). 3. Congruence was examined between field data and predictions from PVA under several metapopulation modelling scenarios. The metapopulation models performed better than the naive null model. Logistic regression showed highly significant positive relationships between predicted and actual patch occupancy for the four scenarios (P = 0.001-0.006). When the model-derived probability of patch occupancy was high (0.50-0.75, 0.75-1.00), there was greater congruence between actual patch occupancy and the predicted probability of occupancy. 4. For many patches, probability distribution functions indicated that model predictions for animal abundance in a given patch were not outside those expected by chance. However, for some patches the model either substantially over-predicted or under-predicted actual abundance. Some important processes, such as inter-patch dispersal, that influence the distribution and abundance of the greater glider may not have been adequately modelled. 5. Additional landscape-scale tests of PVA models, on a wider range of species, are required to assess further predictions made using these tools. This will help determine those taxa for which predictions are and are not accurate and give insights for improving models for applied conservation management.
Resumo:
The Agricultural Production Systems Simulator (APSIM) is a modular modelling framework that has been developed by the Agricultural Production Systems Research Unit in Australia. APSIM was developed to simulate biophysical process in farming systems, in particular where there is interest in the economic and ecological outcomes of management practice in the face of climatic risk. The paper outlines APSIM's structure and provides details of the concepts behind the different plant, soil and management modules. These modules include a diverse range of crops, pastures and trees, soil processes including water balance, N and P transformations, soil pH, erosion and a full range of management controls. Reports of APSIM testing in a diverse range of systems and environments are summarised. An example of model performance in a long-term cropping systems trial is provided. APSIM has been used in a broad range of applications, including support for on-farm decision making, farming systems design for production or resource management objectives, assessment of the value of seasonal climate forecasting, analysis of supply chain issues in agribusiness activities, development of waste management guidelines, risk assessment for government policy making and as a guide to research and education activity. An extensive citation list for these model testing and application studies is provided. Crown Copyright (C) 2002 Published by Elsevier Science B.V. All rights reserved.
Resumo:
Testing ecological models for management is an increasingly important part of the maturation of ecology as an applied science. Consequently, we need to work at applying fair tests of models with adequate data. We demonstrate that a recent test of a discrete time, stochastic model was biased towards falsifying the predictions. If the model was a perfect description of reality, the test falsified the predictions 84% of the time. We introduce an alternative testing procedure for stochastic models, and show that it falsifies the predictions only 5% of the time when the model is a perfect description of reality. The example is used as a point of departure to discuss some of the philosophical aspects of model testing.
Resumo:
Risk-ranking protocols are used widely to classify the conservation status of the world's species. Here we report on the first empirical assessment of their reliability by using a retrospective study of 18 pairs of bird and mammal species (one species extinct and the other extant) with eight different assessors. The performance of individual assessors varied substantially, but performance was improved by incorporating uncertainty in parameter estimates and consensus among the assessors. When this was done, the ranks from the protocols were consistent with the extinction outcome in 70-80% of pairs and there were mismatches in only 10-20% of cases. This performance was similar to the subjective judgements of the assessors after they had estimated the range and population parameters required by the protocols, and better than any single parameter. When used to inform subjective judgement, the protocols therefore offer a means of reducing unpredictable biases that may be associated with expert input and have the advantage of making the logic behind assessments explicit. We conclude that the protocols are useful for forecasting extinctions, although they are prone to some errors that have implications for conservation. Some level of error is to be expected, however, given the influence of chance on extinction. The performance of risk assessment protocols may be improved by providing training in the application of the protocols, incorporating uncertainty in parameter estimates and using consensus among multiple assessors, including some who are experts in the application of the protocols. Continued testing and refinement of the protocols may help to provide better absolute estimates of risk, particularly by re-evaluating how the protocols accommodate missing data.
Resumo:
In this paper we present a model of specification-based testing of interactive systems. This model provides the basis for a framework to guide such testing. Interactive systems are traditionally decomposed into a functionality component and a user interface component; this distinction is termed dialogue separation and is the underlying basis for conceptual and architectural models of such systems. Correctness involves both proper behaviour of the user interface and proper computation by the underlying functionality. Specification-based testing is one method used to increase confidence in correctness, but it has had limited application to interactive system development to date.
Resumo:
Formal specifications can precisely and unambiguously define the required behavior of a software system or component. However, formal specifications are complex artifacts that need to be verified to ensure that they are consistent, complete, and validated against the requirements. Specification testing or animation tools exist to assist with this by allowing the specifier to interpret or execute the specification. However, currently little is known about how to do this effectively. This article presents a framework and tool support for the systematic testing of formal, model-based specifications. Several important generic properties that should be satisfied by model-based specifications are first identified. Following the idea of mutation analysis, we then use variants or mutants of the specification to check that these properties are satisfied. The framework also allows the specifier to test application-specific properties. All properties are tested for a range of states that are defined by the tester in the form of a testgraph, which is a directed graph that partially models the states and transitions of the specification being tested. Tool support is provided for the generation of the mutants, for automatically traversing the testgraph and executing the test cases, and for reporting any errors. The framework is demonstrated on a small specification and its application to three larger specifications is discussed. Experience indicates that the framework can be used effectively to test small to medium-sized specifications and that it can reveal a significant number of problems in these specifications.
Resumo:
Explants of the hard coral Seriatopora hystrix were exposed to sublethal concentrations of the herbicide diuron DCMU (N'-(3,4-dichlorophenyl,-N,N-dimethylurea)) and the heavy metal copper. Pulse amplitude modulated (PAM) chlorophyll fluorescence techniques were used to assess the effects on the photosynthetic efficiency of the algal symbionts in the tissue (in Symbio), and chlorophyll fluorescence and counts of symbiotic algae (normalised to surface area) were used to assess the extent of coral bleaching. At 30 mug DCMU l(-1), there was a reduction in both the maximum effective quantum yield (DeltaF/F-m') and maximum potential quantum yield (F-v/F-m) of the algal symbionts in symbio. Corals subsequently lost their algal symbionts and discoloured (bleached), especially on their upper sunlight-exposed surfaces. At the same DCMU concentration but under low light (5% of growth irradiance), there was a marked reduction in DeltaF/F-m' but only a slight reduction in F-v/F-m and slight loss of algae. Loss of algal symbionts was also noted after a 7 d exposure to concentrations as low as 10 mug DCMU l(-1) under normal growth irradiance, and after 14 d exposure to 10 mug DCMU l(-1) under reduced irradiance. Collectively the results indicate that DCMU-induced bleaching is caused by a light-dependent photoinactivation of algal symbionts, and that bleaching occurs when F-v/F-n, (measured 2 h after sunset) is reduced to a value of less than or equal to 0.6. Elevated copper concentrations (60 mug Cu l(-1) for 10 h) also induced a rapid bleaching in S. hystrix but without affecting the quantum yield of the algae in symbio. Tests with isolated algae indicated that substantially higher concentrations (300 mug Cu l(-1) for 8 h) were needed to significantly reduce the quantum yield. Thus, copper-induced bleaching occurs without affecting the algal photosynthesis and may be related to effects on the host (animal). It is argued that warm-water bleaching of corals resembles both types of chemically induced bleaching, suggesting the need for an integrated model of coral bleaching involving the effect of temperature on both host (coral) and algal symbionts.
Resumo:
This study examined the utility of a stress/coping model in explaining adaptation in two groups of people at-risk for Huntington's Disease (HD): those who have not approached genetic testing services (non-testees) and those who have engaged a testing service (testees). The aims were (1) to compare testees and non-testees on stress/coping variables, (2) to examine relations between adjustment and the stress/coping predictors in the two groups, and (3) to examine relations between the stress/coping variables and testees' satisfaction with their first counselling session. Participants were 44 testees and 40 non-testees who completed questionnaires which measured the stress/coping variables: adjustment (global distress, depression, health anxiety, social and dyadic adjustment), genetic testing concerns, testing context (HD contact, experience, knowledge), appraisal (control, threat, self-efficacy), coping strategies (avoidance, self-blame, wishful thinking, seeking support, problem solving), social support and locus of control. Testees also completed a genetic counselling session satisfaction scale. As expected, non-testees reported lower self-efficacy and control appraisals, higher threat and passive avoidant coping than testees. Overall, results supported the hypothesis that within each group poorer adjustment would be related to higher genetic testing concerns, contact with HD, threat appraisals, passive avoidant coping and external locus of control, and lower levels of positive experiences with HD, social support, internal locus of control, self-efficacy, control appraisals, problem solving, emotional approach and seeking social support coping. Session satisfaction scores were positively correlated with dyadic adjustment, problem solving and positive experience with HD, and inversely related to testing concerns, and threat and control appraisals. Findings support the utility of the stress/coping model in explaining adaptation in people who have decided not to seek genetic testing for HD and those who have decided to engage a genetic testing service.
Resumo:
Traditional measures of termite food preference assess consequences of foraging behavior such as wood consumption, aggregation and/or termite survivorship. Although studies have been done to investigate the specifics of foraging behavior this is not generally integrated into choice assay experiments. Here choice assays were conducted with small isolated (orphaned) groups of workers and compared with choice assays involving foragers from whole nests (non-orphaned) in the laboratory. Aggregation to two different wood types was used as a measure of preference. Specific worker caste and instars participating in initial exploration were compared between assay methods, with samples of termites taken from nest carton material and sites where termites were feeding. Aggregation results differ between choice assay techniques. Castes and instars responsible for initial exploration, as determined in whole nest trials, were not commonly found exploring in isolated group trials, nor were they numerous in termites taken from active feeding sites. Consequently the use of small groups of M. turneri worker termites extracted from active feeding sites may not be appropriate for use in choice assays.
Resumo:
Objective: Our aim was to determine if insomnia severity, dysfunctional beliefs about sleep, and depression predicted sleep-related safety behaviors. Method: Standard sleep-related measures (such as the Insomnia Severity Index; the Dysfunctional Beliefs About Sleep scale; the Depression, Anxiety, and Stress Scale; and the Sleep-Related Behaviors Questionnaire) were administered. Additionally, 14 days of sleep diary (Pittsburg Sleep Diary) data and actual use of sleep-related behaviors were collected. Results: Regression analysis revealed that dysfunctional beliefs about sleep predicted sleep-related safety behaviors. Insomnia severity did not predict sleep-related safety behaviors. Depression accounted for the greatest amount of unique variance in the prediction of safety behaviors, followed by dysfunctional beliefs. Exploratory analysis revealed that participants with higher levels of depression used more sleep-related behaviors and reported greater dysfunctional beliefs about their sleep. Conclusion: The findings underlie the significant influence that dysfunctional beliefs have on individuals' behaviors. Moreover, the results suggest that depression may need to be considered as an explicit component of cognitive-behavioral models of insomnia. (c) 2006 Elsevier Inc. All rights reserved.
Intuitive and analytical decision-making in a high risk industry: Development and testing of a model