55 resultados para Software testing. Test generation. Grammars
em CentAUR: Central Archive University of Reading - UK
Resumo:
This paper considers methods for testing for superiority or non-inferiority in active-control trials with binary data, when the relative treatment effect is expressed as an odds ratio. Three asymptotic tests for the log-odds ratio based on the unconditional binary likelihood are presented, namely the likelihood ratio, Wald and score tests. All three tests can be implemented straightforwardly in standard statistical software packages, as can the corresponding confidence intervals. Simulations indicate that the three alternatives are similar in terms of the Type I error, with values close to the nominal level. However, when the non-inferiority margin becomes large, the score test slightly exceeds the nominal level. In general, the highest power is obtained from the score test, although all three tests are similar and the observed differences in power are not of practical importance. Copyright (C) 2007 John Wiley & Sons, Ltd.
Resumo:
Short-term memory (STM) impairments are prevalent in adults with acquired brain injuries. While there are several published tests to assess these impairments, the majority require speech production, e.g. digit span (Wechsler, 1987). This feature may make them unsuitable for people with aphasia and motor speech disorders because of word finding difficulties and speech demands respectively. If patients perceive the speech demands of the test to be high, the may not engage with testing. Furthermore, existing STM tests are mainly ‘pen-and-paper’ tests, which can jeopardise accuracy. To address these shortcomings, we designed and standardised a novel computerised test that does not require speech output and because of the computerised delivery it would enable clinicians identify STM impairments with greater precision than current tests. The matching listening span tasks, similar to the non-normed PALPA 13 (Kay, Lesser & Coltheart, 1992) is used to test short-term memory for serial order of spoken items. Sequences of digits are presented in pairs. The person hears the first sequence, followed by the second sequence and s/he decides whether the two sequences are the same or different. In the computerised test, the sequences are presented in live voice recordings on a portable computer through a software application (Molero Martin, Laird, Hwang & Salis 2013). We collected normative data from healthy older adults (N=22-24) using digits, real words (one- and two-syllables) and non-words (one- and two- syllables). Their performance was scored following two systems. The Highest Span system was the highest span length (e.g. 2-8) at which a participant correctly responded to over 7 out of 10 trials at the highest sequence length. Test re-test reliability was also tested in a subgroup of participants. The test will be available as free of charge for clinicians and researchers to use.
Resumo:
There has been an ongoing concern about the lack of reliable data on disabled children in schools. To date there has been no consistent way of identifying and categorising disabilities. Schools in England are currentlyrequired to collect data on children with Special Educational Need (SEN), but this does not capture information about all disabled children. The lack of this information may seriously restrict capacity at all levels of policy and practice to understand and respond to the needs of disabled children and their families in line with Disability Discrimination Act (2005) and the single Equality Act (2010). The aim of the project was to test the draft tools for identifying disability and accompanying guidance in a sample of all types of maintained schools in order to assess their usability and reliability and whether they resulted in the generation of robust and consistent data that could reliably inform school returns for the annual School Census.
Resumo:
Mature (clitellate) Eisenia andrei Bouche (ultra epigeic), Lumbricus rubellus Hoffmeister (epigeic), and Aporrectodea caliginosa (Savigny) (endogeic) earthworms were placed in soils treated with Pb(NO3)(2) to have concentrations in the range 1000 to 10 000 mg Pb kg(-1). After 28 days LC50(-95%confidence limit) (+95%confidence limit) values were E. andrei 5824(-361)(+898) mg Pb kg(-1), L. rubellus 2867(-193)(+145) mg Pb kg(-1) and A. caliginosa 2747(-304)(+239) mg Pb kg(-1) and EC50s for weight change were E. andrei 2841(-68)(+150) Pb kg(-1), L. rubellus 1303(-201)(+204) mg Pb kg(-1) and A. caliginosa 1208(-206)(+212) Mg Pb kg(-1). At any given soil Pb concentration, Pb tissue concentrations after 28 days were the same for all three earthworm species. In a soil avoidance test there was no difference between the behaviour of the different species. The lower sensitivity to Pb exhibited by E. andrei is most likely due to physiological adaptations associated with the modes of life of the earthworms, and could have serious implications for the use of this earthworm as the species of choice in standard toxicological testing. (c) 2005 Elsevier Ltd. All rights reserved.
Resumo:
Ecological risk assessments must increasingly consider the effects of chemical mixtures on the environment as anthropogenic pollution continues to grow in complexity. Yet testing every possible mixture combination is impractical and unfeasible; thus, there is an urgent need for models that can accurately predict mixture toxicity from single-compound data. Currently, two models are frequently used to predict mixture toxicity from single-compound data: Concentration addition and independent action (IA). The accuracy of the predictions generated by these models is currently debated and needs to be resolved before their use in risk assessments can be fully justified. The present study addresses this issue by determining whether the IA model adequately described the toxicity of binary mixtures of five pesticides and other environmental contaminants (cadmium, chlorpyrifos, diuron, nickel, and prochloraz) each with dissimilar modes of action on the reproduction of the nematode Caenorhabditis elegans. In three out of 10 cases, the IA model failed to describe mixture toxicity adequately with significant or antagonism being observed. In a further three cases, there was an indication of synergy, antagonism, and effect-level-dependent deviations, respectively, but these were not statistically significant. The extent of the significant deviations that were found varied, but all were such that the predicted percentage effect seen on reproductive output would have been wrong by 18 to 35% (i.e., the effect concentration expected to cause a 50% effect led to an 85% effect). The presence of such a high number and variety of deviations has important implications for the use of existing mixture toxicity models for risk assessments, especially where all or part of the deviation is synergistic.
Resumo:
An eddy current testing system consists of a multi-sensor probe, a computer and a special expansion card and software for data-collection and analysis. The probe incorporates an excitation coil, and sensor coils; at least one sensor coil is a lateral current-normal coil and at least one is a current perturbation coil.
Resumo:
An eddy current testing system consists of a multi-sensor probe, computer and a special expansion card and software for data collection and analysis. The probe incorporates an excitation coil, and sensor coils; at least one sensor coil is a lateral current-normal coil and at least one is a current perturbation coil.
Resumo:
We present a method to enhance fault localization for software systems based on a frequent pattern mining algorithm. Our method is based on a large set of test cases for a given set of programs in which faults can be detected. The test executions are recorded as function call trees. Based on test oracles the tests can be classified into successful and failing tests. A frequent pattern mining algorithm is used to identify frequent subtrees in successful and failing test executions. This information is used to rank functions according to their likelihood of containing a fault. The ranking suggests an order in which to examine the functions during fault analysis. We validate our approach experimentally using a subset of Siemens benchmark programs.
Resumo:
While over-dispersion in capture–recapture studies is well known to lead to poor estimation of population size, current diagnostic tools to detect the presence of heterogeneity have not been specifically developed for capture–recapture studies. To address this, a simple and efficient method of testing for over-dispersion in zero-truncated count data is developed and evaluated. The proposed method generalizes an over-dispersion test previously suggested for un-truncated count data and may also be used for testing residual over-dispersion in zero-inflation data. Simulations suggest that the asymptotic distribution of the test statistic is standard normal and that this approximation is also reasonable for small sample sizes. The method is also shown to be more efficient than an existing test for over-dispersion adapted for the capture–recapture setting. Studies with zero-truncated and zero-inflated count data are used to illustrate the test procedures.
Resumo:
To construct Biodiversity richness maps from Environmental Niche Models (ENMs) of thousands of species is time consuming. A separate species occurrence data pre-processing phase enables the experimenter to control test AUC score variance due to species dataset size. Besides, removing duplicate occurrences and points with missing environmental data, we discuss the need for coordinate precision, wide dispersion, temporal and synonymity filters. After species data filtering, the final task of a pre-processing phase should be the automatic generation of species occurrence datasets which can then be directly ’plugged-in’ to the ENM. A software application capable of carrying out all these tasks will be a valuable time-saver particularly for large scale biodiversity studies.
Resumo:
The Organisation for Economic Co-operation and Development (OECD) Terrestrial plant test is often used for the ecological risk assessment of contaminated land. However, its origins in plant protection product testing mean that the species recommended in the OECD guidelines are unlikely to occur on contaminated land. Six alternative species were tested on contaminated soils from a former Zn smelter and a metal fragmentizer with elevated concentrations of Cd, Cu, Pb, and Zn. The response of the alternative species was compared to two species recommended by the OECD; Lolium perenne (perennial ryegrass) and Trifolium pratense (red clover). Urtica dioica (stinging nettle) and Poa annua (annual meadow-grass) had low emergence rates in the control soil so may be considered unsuitable. Festuca rubra (chewings fescue), Holcus lanatus (Yorkshire fog), Senecio vulgaris (common groundsel), and Verbascum thapsus (great mullein) offer good alternatives to the OECD species. In particular, H. lanatus and S. vulgaris were more sensitive to the soils with moderate concentrations of Cd, Cu, Pb, and Zn than the OECD species.
Resumo:
In this paper we review the experimental development of agri-environment measures for use on grasslands. Sward structure has been shown to have a strong influence on birds' ability to forage in grasslands, but the effects of food abundance on foraging behaviour are poorly understood and this hinders development of grassland conservation measures. The experiments described have a dual purpose: to investigate the foraging ecology of birds on grasslands and to test candidate management measures. Most of the work featured focuses on increasing invertebrate food resources during the summer by increasing habitat heterogeneity. We also identify important gaps in the habitats provided by existing or experimental measures, where similar dual-purpose experiments are required.