103 resultados para Hypothesis tests
Resumo:
We propose new methods for evaluating predictive densities that focus on the models' actual predictive ability in finite samples. The tests offer a simple way of evaluatingthe correct specification of predictive densities, either parametric or non-parametric.The results indicate that our tests are well sized and have good power in detecting mis-specification in predictive densities. An empirical application to the Survey ofProfessional Forecasters and a baseline Dynamic Stochastic General Equilibrium modelshows the usefulness of our methodology.
Resumo:
A change in paradigm is needed in the prevention of toxic effects on the nervous system, moving from its present reliance solely on data from animal testing to a prediction model mostly based on in vitro toxicity testing and in silico modeling. According to the report published by the National Research Council (NRC) of the US National Academies of Science, high-throughput in vitro tests will provide evidence for alterations in"toxicity pathways" as the best possible method of large scale toxicity prediction. The challenges to implement this proposal are enormous, and provide much room for debate. While many efforts address the technical aspects of implementing the vision, many questions around it need also to be addressed. Is the overall strategy the only one to be pursued? How can we move from current to future paradigms? Will we ever be able to reliably model for chronic and developmental neurotoxicity in vitro? This paper summarizes four presentations from a symposium held at the International Neurotoxicology Conference held in Xi"an, China, in June 2011. A. Li reviewed the current guidelines for neurotoxicity and developmental neurotoxicity testing, and discussed the major challenges existing to realize the NCR vision for toxicity testing. J. Llorens reviewed the biology of mammalian toxic avoidance in view of present knowledge on the physiology and molecular biology of the chemical senses, taste and smell. This background information supports the hypothesis that relating in vivo toxicity to chemical epitope descriptors that mimic the chemical encoding performed by the olfactory system may provide a way to the long term future of complete in silico toxicity prediction. S. Ceccatelli reviewed the implementation of rodent and human neural stem cells (NSCs) as models for in vitro toxicity testing that measures parameters such as cell proliferation, differentiation and migration. These appear to be sensitive endpoints that can identify substances with developmental neurotoxic potential. C. Sun ol reviewed the use of primary neuronal cultures in testing for neurotoxicity of environmental pollutants, including the study of the effects of persistent exposures and/or in differentiating cells, which allow recording of effects that can be extrapolated to human developmental neurotoxicity.
Resumo:
Este trabajo se divide en tres partes: contextualización, estado del arte en evaluación de la usabilidad en dispositivos móviles y propuesta y validación de un método que combina eyetracker de sobremesa y dispositivos móviles. El trabajo culmina con un estudio experimental con un doble propósito: realizar un primer estudio de la validez del método y analizar empíricamente cómo sacarle el máximo rendimiento tratando en todo momento de equipararlo al uso real de un dispositivo físico.
A priori parameterisation of the CERES soil-crop models and tests against several European data sets
Resumo:
Mechanistic soil-crop models have become indispensable tools to investigate the effect of management practices on the productivity or environmental impacts of arable crops. Ideally these models may claim to be universally applicable because they simulate the major processes governing the fate of inputs such as fertiliser nitrogen or pesticides. However, because they deal with complex systems and uncertain phenomena, site-specific calibration is usually a prerequisite to ensure their predictions are realistic. This statement implies that some experimental knowledge on the system to be simulated should be available prior to any modelling attempt, and raises a tremendous limitation to practical applications of models. Because the demand for more general simulation results is high, modellers have nevertheless taken the bold step of extrapolating a model tested within a limited sample of real conditions to a much larger domain. While methodological questions are often disregarded in this extrapolation process, they are specifically addressed in this paper, and in particular the issue of models a priori parameterisation. We thus implemented and tested a standard procedure to parameterize the soil components of a modified version of the CERES models. The procedure converts routinely-available soil properties into functional characteristics by means of pedo-transfer functions. The resulting predictions of soil water and nitrogen dynamics, as well as crop biomass, nitrogen content and leaf area index were compared to observations from trials conducted in five locations across Europe (southern Italy, northern Spain, northern France and northern Germany). In three cases, the model’s performance was judged acceptable when compared to experimental errors on the measurements, based on a test of the model’s root mean squared error (RMSE). Significant deviations between observations and model outputs were however noted in all sites, and could be ascribed to various model routines. In decreasing importance, these were: water balance, the turnover of soil organic matter, and crop N uptake. A better match to field observations could therefore be achieved by visually adjusting related parameters, such as field-capacity water content or the size of soil microbial biomass. As a result, model predictions fell within the measurement errors in all sites for most variables, and the model’s RMSE was within the range of published values for similar tests. We conclude that the proposed a priori method yields acceptable simulations with only a 50% probability, a figure which may be greatly increased through a posteriori calibration. Modellers should thus exercise caution when extrapolating their models to a large sample of pedo-climatic conditions for which they have only limited information.
Resumo:
Differences amongst wheat cultivars in the rate of reproductive development are largely dependent on differences in their sensitivity to photoperiod and vernalization. However, when these responses are accounted for, by growing vernalized seedlings under long photoperiods, cultivars can still differ markedly in time to ear emergence. Control of rate of development by this ‘third factor’ has been poorly understood and is variously referred to as intrinsic earliness, earliness in the narrow sense, basic vegetative period, earliness per se, and basic development rate. Certain assumptions are made in the concept of intrinsic earliness. They are that differences in intrinsic earliness (i) are independent of the responses of the cultivars to photoperiod and vernalization, (ii) apply only to the length of the vegetative period up to floral initiation (as suggested by several authors), (iii) are maintained under different temperatures, measured either in days or degree days. As a consequence of this, the ranking of cultivars (from intrinsically early to intrinsically late) must be maintained at different temperatures. This paper, by the re-analysis of published data, examines the extent to which these assumptions can be supported. Although it is shown that intrinsic earliness operates independently of photoperiod and vernalization responses, the other assumptions were not supported. The differences amongst genotypes in time to ear emergence, grown under above-optimum vernalization and photoperiod (that is when the response to these factors is saturated), were not exclusively due to parallel differences in the length of the vegetative phase, and the length of the reproductive phase was independent of that of the vegetative phase. Thus, it would be possible to change the relative allocation of time to vegetative and reproductive periods with no change in the full period to ear emergence. The differences in intrinsic earliness between cultivars were modified by the temperature regime under which they were grown, i.e. the difference between cultivars (both considering the full phase to ear emergence or some sub-phases) was not a constant amount of time or thermal time at different temperatures. In addition, in some instances genotypes changed their ranking for ‘intrinsic earliness’ depending on the temperature regime. This was interpreted to mean that while all genotypes are sensitive to temperature they differ amongst themselves in the extent of that sensitivity. Therefore, ‘intrinsic earliness’ should not be considered as a static genotypic characteristic, but the result of the interaction between the genotype and temperature. Intrinsic earliness is therefore likely to be related to temperature sensitivity. Some implications of these conclusions for plant breeding and crop simulation modelling are discussed.
Resumo:
Evaluar una arquitectura de la información en un sitio web ya desplegado no resulta una tarea sencilla. La mayoría de las técnicas se centran en examinar la usabilidad del sistema que, aunque afecta a la arquitectura de la información, no es el único factor que influye en ella. La principal técnica que se utiliza es el test de estrés de navegación. Se muestra un aporte metodológico para hacer dicha técnica más informativa, llevándola más allá de la simple anotación en papel por parte del usuario de respuestas a las preguntas de navegación planteadas. Se propone la combinación de ésta con otras técnicas de evaluación de la usabilidad: la técnica de pensar en voz alta o thinking aloud y un cuestionario de usabilidad. Se ha utilizado un sistema de seguimiento de la mirada o eye tracking para complementar la información obtenida mediante las técnicas aplicadas. El enfoque metodológico planteado se ha puesto a prueba analizando una serie de sitios web de bibliotecas de universidades públicas españolas. Se muestra en los resultados la validez del enfoque empleado, así como el valor que dicho enfoque y el uso del eye tracking aportan al análisis de la arquitectura de la información respecto al test de estrés de navegación tradicional.
Resumo:
In this work, a LIDAR-based 3D Dynamic Measurement System is presented and evaluated for the geometric characterization of tree crops. Using this measurement system, trees were scanned from two opposing sides to obtain two three-dimensional point clouds. After registration of the point clouds, a simple and easily obtainable parameter is the number of impacts received by the scanned vegetation. The work in this study is based on the hypothesis of the existence of a linear relationship between the number of impacts of the LIDAR sensor laser beam on the vegetation and the tree leaf area. Tests performed under laboratory conditions using an ornamental tree and, subsequently, in a pear tree orchard demonstrate the correct operation of the measurement system presented in this paper. The results from both the laboratory and field tests confirm the initial hypothesis and the 3D Dynamic Measurement System is validated in field operation. This opens the door to new lines of research centred on the geometric characterization of tree crops in the field of agriculture and, more specifically, in precision fruit growing.
Resumo:
Many studies have shown that IQs have been increasing over the last half century. These increases have come to be known as «the Flynn effect». The «Flynn effect» represents a difference on ability-level between groups of the same age but different cohort. The ability-level differentiation hypothesis represents a difference on the relevance of cognitive factors between groups of high and low ability. Hence, it should be possible to imitate the ability-level differentiation effect by comparing groups of the same age but different cohort. The indifferentiation hypothesis represents no differences on the relevance of cognitive abilities in all age groups within the same cohort. The aim of the present study is to test the relationships between these phenomena. For this purpose we analyzed the American standardisation samples of the WISC, WISC-R and WISC-III. Results support the link between the Flynn effect and the differentiation hypothesis. Also, reported evidence replicate previous findings supporting the indifferentiation hypothesis. Implications for the assessment of the intelligence are discussed.
Resumo:
In this paper we describe a taxonomy of task demands which distinguishes between Task Complexity, Task Condition and Task Difficulty. We then describe three theoretical claims and predictions of the Cognition Hypothesis (Robinson 2001, 2003b, 2005a) concerning the effects of task complexity on: (a) language production; (b) interaction and uptake of information available in the input to tasks; and (c) individual differences-task interactions. Finally we summarize the findings of the empirical studies in this special issue which all address one or more of these predictions and point to some directions for continuing, future research into the effects of task complexity on learning and performance.
Resumo:
The Spreading of the Introduced Seaweed Caulerpa taxifolia (Vahl) C. Agardh in the Mediterranean Sea: Testing the Boat Transportation Hypothesis
Resumo:
This study deals with the statistical properties of a randomization test applied to an ABAB design in cases where the desirable random assignment of the points of change in phase is not possible. In order to obtain information about each possible data division we carried out a conditional Monte Carlo simulation with 100,000 samples for each systematically chosen triplet. Robustness and power are studied under several experimental conditions: different autocorrelation levels and different effect sizes, as well as different phase lengths determined by the points of change. Type I error rates were distorted by the presence of autocorrelation for the majority of data divisions. Satisfactory Type II error rates were obtained only for large treatment effects. The relationship between the lengths of the four phases appeared to be an important factor for the robustness and the power of the randomization test.
Resumo:
It is not known whether rainfall increases the risk of sporadic cases of Legionella pneumonia. We sought to test this hypothesis in a prospective observational cohort study of non-immunosuppressed adults hospitalized for community-acquired pneumonia (1995-2011). Cases with Legionella pneumonia were compared with those with non-Legionella pneumonia. Using daily rainfall data obtained from the regional meteorological service we examined patterns of rainfall over the days prior to admission in each study group. Of 4168 patients, 231 (5.5%) had Legionella pneumonia. The diagnosis was based on one or more of the following: sputum (41 cases), antigenuria (206) and serology (98). Daily rainfall average was 0.556 liters/m2 in the Legionella pneumonia group vs. 0.328 liters/m2 for non-Legionella pneumonia cases (p = 0.04). A ROC curve was plotted to compare the incidence of Legionella pneumonia and the weighted median rainfall. The cut-off point was 0.42 (AUC 0.54). Patients who were admitted to hospital with a prior weighted median rainfall higher than 0.42 were more likely to have Legionella pneumonia (OR 1.35; 95% CI 1.02-1.78; p = .03). Spearman Rho correlations revealed a relationship between Legionella pneumonia and rainfall average during each two-week reporting period (0.14; p = 0.003). No relationship was found between rainfall average and non-Legionella pneumonia cases (−0.06; p = 0.24). As a conclusion, rainfall is a significant risk factor for sporadic Legionella pneumonia. Physicians should carefully consider Legionella pneumonia when selecting diagnostic tests and antimicrobial therapy for patients presenting with CAP after periods of rainfall.
Resumo:
Submarine canyons are sites of intense energy and material exchange between the shelf and the deep adjacent basins. To test the hypothesis that active submarine canyons represent preferential conduits of available food for the deep-sea benthos, two mooring lines were deployed at 1200 m depth from November 2008 to November 2009 inside the Blanes canyon and on the adjacent open slope (Catalan Margin, NW Mediterranean Sea). We investigated the fluxes, biochemical composition and food quality of sinking organic carbon (OC). OC fluxes in the canyon and the open slope varied among sampling periods, though not onsistently in the two sites. In particular, while in the open slope the highest OC fluxes were observed in August 2009, in the canyon the highest OC fluxes occurred in April-May 2009. For almost the entire study period, the OC fluxes in the canyon were significantly higher than those in the open slope, whereas OC contents of sinking particles collected in the open slope were consistently higher than those in the canyon. This result confirms that submarine canyons are effective conveyors of OC to the deep sea. Particles transferred to the deep sea floor through the canyons are predominantly of inorganic origin, significantly higher than that reaching the open slope at a similar water depth. Using multivariate statistical tests, two major clusters of sampling periods were identified: one in the canyon that grouped trap samples collected in December 2008, oncurrently with the occurrence of a major storm at the sea surface, and associated with increased fluxes of nutritionally available particles from the upper shelf. Another cluster grouped samples from both the canyon and the open slope collected in March 2009, concurrently with the occurrence of the seasonal phytoplankton bloom at the sea surface, and associated with increased fluxes of total phytopigments. Our results confirm the key ecological role of submarine canyons for the functioning of deep-sea ecosystems, and highlight the importance of canyons in linking episodic storms and primary production occurring at the sea surface to the deep sea floor.
Resumo:
A robust finding of studies investigating the Aspect Hypothesis is that learners at early stages of acquisition show a strong preference for using the progressive aspect as associated with activity verbs. As they advance in their acquisition of the second or foreign language, learners move from this prototypical association to associations traditionally considered to be more peripheral (e.g.-ing with accomplishments or achievements). Within this framework, the goal of this paper is to provide further evidence from groups of learners with different proficiency levels with regard to the acquisition of progressive aspect by tutored learners of English who are bilingual Catalan-Spanish. This is done by eliciting data by means of two different task types and by looking at both tokens and types. Our results are consistent with previous research according to which-ing morphology is closely associated with durative lexical aspect, although not necessarily with activity predicates. The study also shows that the type of task has an influence on the frequency and the distribution of learners" progressive forms.
Resumo:
[cat] Estudiem les propietats teòriques que una funció d.emparellament ha de satisfer per tal de representar un mercat laboral amb friccions dins d'un model d'equilibri general amb emparellament aleatori. Analitzem el cas Cobb-Douglas, CES i altres formes funcionals per a la funció d.emparellament. Els nostres resultats estableixen restriccions sobre els paràmetres d'aquests formes funcionals per assegurar que l.equilibri és interior. Aquestes restriccions aporten raons teòriques per escollir entre diverses formes funcionals i permeten dissenyar tests d'error d'especificació de model en els treballs empírics.