9 resultados para Illusory Biases
em Biblioteca Digital da Produção Intelectual da Universidade de São Paulo (BDPI/USP)
Resumo:
Cosmic shear requires high precision measurement of galaxy shapes in the presence of the observational point spread function (PSF) that smears out the image. The PSF must therefore be known for each galaxy to a high accuracy. However, for several reasons, the PSF is usually wavelength dependent; therefore, the differences between the spectral energy distribution of the observed objects introduce further complexity. In this paper, we investigate the effect of the wavelength dependence of the PSF, focusing on instruments in which the PSF size is dominated by the diffraction limit of the telescope and which use broad-band filters for shape measurement. We first calculate biases on cosmological parameter estimation from cosmic shear when the stellar PSF is used uncorrected. Using realistic galaxy and star spectral energy distributions and populations and a simple three-component circular PSF, we find that the colour dependence must be taken into account for the next generation of telescopes. We then consider two different methods for removing the effect: (i) the use of stars of the same colour as the galaxies and (ii) estimation of the galaxy spectral energy distribution using multiple colours and using a telescope model for the PSF. We find that both of these methods correct the effect to levels below the tolerances required for per cent level measurements of dark energy parameters. Comparison of the two methods favours the template-fitting method because its efficiency is less dependent on galaxy redshift than the broad-band colour method and takes full advantage of deeper photometry.
Resumo:
Extinction risk has not been evaluated for 96% of all described plant species. Given that the Global Strategy for Plant Conservation proposes preliminary conservation assessments of all described plant species by 2010, herbarium specimens (i.e., primary occurrence data) are increasingly being used to infer threat components from estimates of geographic range size. Nevertheless, estimates of range size based on herbarium data may be inaccurate due to collection bias associated with interspecific variation in detectability. We used data on 377 species of Bignonieae to test the hypothesis that there is a positive relationship between detectability and estimates of geographic range size derived from herbarium specimens. This relationship is expected if the proportion of the true geographic range size of a species that is documented by herbarium specimens is given by the product of the true geographic range size and the detectability of the species, assuming no relationship between true geographic range size and detectability. We developed 4 measures of detectability that can be estimated from herbarium data and examined the relationship between detectability and 2 types of estimates of geographic range size: area of occupancy and extent of occurrence. Our results from regressing estimates of extent of occurrence and area of occupancy on detectability across genera provided no support for this hypothesis. The same was true for regressions of estimated extent of occurrence on detectability across species within genera. Nevertheless, regressions of estimated area of occupancy on detectability across species within genera provided partial support for our hypothesis. We considered 3 possible explanations for this mixed outcome: violation of the assumption of no relationship between true geographic range size and detectability; the relationships between estimated geographic range size and detectability may be an artifact of a negative relationship between estimated area of occupancy and the sampling variance of detectability; detectability may have had 2 opposite effects on estimated species range sizes: one determines the proportion of the true range of a species documented by herbarium specimens and the other determines the distribution of true range size for the species actually observed with herbarium data. Our findings should help improve understanding of the potential biases incurred with the use of herbarium data.
Resumo:
Rationale: Major coronary vessels derive from the proepicardium, the cellular progenitor of the epicardium, coronary endothelium, and coronary smooth muscle cells (CoSMCs). CoSMCs are delayed in their differentiation relative to coronary endothelial cells (CoEs), such that CoSMCs mature only after CoEs have assembled into tubes. The mechanisms underlying this sequential CoE/CoSMC differentiation are unknown. Retinoic acid (RA) is crucial for vascular development and the main RA-synthesizing enzyme is progressively lost from epicardially derived cells as they differentiate into blood vessel types. In parallel, myocardial vascular endothelial growth factor (VEGF) expression also decreases along coronary vessel muscularization. Objective: We hypothesized that RA and VEGF act coordinately as physiological brakes to CoSMC differentiation. Methods and Results: In vitro assays (proepicardial cultures, cocultures, and RALDH2 [retinaldehyde dehydrogenase-2]/VEGF adenoviral overexpression) and in vivo inhibition of RA synthesis show that RA and VEGF act as repressors of CoSMC differentiation, whereas VEGF biases epicardially derived cell differentiation toward the endothelial phenotype. Conclusion: Experiments support a model in which early high levels of RA and VEGF prevent CoSMC differentiation from epicardially derived cells before RA and VEGF levels decline as an extensive endothelial network is established. We suggest this physiological delay guarantees the formation of a complex, hierarchical, tree of coronary vessels. (Circ Res. 2010;107:204-216.)
Resumo:
We used mixtures of genomic DNA from two genetically distinct isolates from Brazil, 42M and 312M, to investigate how accurately 12-locus microsatellite typing describes the overall genetic diversity and characterizes multilocus haplotypes in multiple-clone Plasmodium vivax infections. We found varying PCR amplification efficiencies of microsatellite alleles; for example, from the same 1:1 mixture of 42M and 312M DNA we amplified predominantly 312M-type alleles at 10 loci and 42M-type alleles at 2 loci. All microsatellite alleles were accurately scored in 1:0.5 and 1:0.25 312M:42M DNA mixtures, even when minor peak heights did not meet previously suggested criteria for minor allele detection in multiple-clone infections. Relative proportions of major and minor alleles were unaffected by multiple displacement amplification of template DNA prior to PCR-based microsatellite typing. Although microsatellite typing may detect minor alleles in clone mixtures, amplification biases may lead to inaccurate assignment of predominant haplotypes in multiple-clone P. vivax infections. (C) 2008 Elsevier Inc. All rights reserved.
Resumo:
The substitution of missing values, also called imputation, is an important data preparation task for many domains. Ideally, the substitution of missing values should not insert biases into the dataset. This aspect has been usually assessed by some measures of the prediction capability of imputation methods. Such measures assume the simulation of missing entries for some attributes whose values are actually known. These artificially missing values are imputed and then compared with the original values. Although this evaluation is useful, it does not allow the influence of imputed values in the ultimate modelling task (e.g. in classification) to be inferred. We argue that imputation cannot be properly evaluated apart from the modelling task. Thus, alternative approaches are needed. This article elaborates on the influence of imputed values in classification. In particular, a practical procedure for estimating the inserted bias is described. As an additional contribution, we have used such a procedure to empirically illustrate the performance of three imputation methods (majority, naive Bayes and Bayesian networks) in three datasets. Three classifiers (decision tree, naive Bayes and nearest neighbours) have been used as modelling tools in our experiments. The achieved results illustrate a variety of situations that can take place in the data preparation practice.
Resumo:
The concentrations of the water-soluble inorganic aerosol species, ammonium (NH4+), nitrate (NO3-), chloride (Cl-), and sulfate (SO42-), were measured from September to November 2002 at a pasture site in the Amazon Basin (Rondnia, Brazil) (LBA-SMOCC). Measurements were conducted using a semi-continuous technique (Wet-annular denuder/Steam-Jet Aerosol Collector: WAD/SJAC) and three integrating filter-based methods, namely (1) a denuder-filter pack (DFP: Teflon and impregnated Whatman filters), (2) a stacked-filter unit (SFU: polycarbonate filters), and (3) a High Volume dichotomous sampler (HiVol: quartz fiber filters). Measurements covered the late dry season (biomass burning), a transition period, and the onset of the wet season (clean conditions). Analyses of the particles collected on filters were performed using ion chromatography (IC) and Particle-Induced X-ray Emission spectrometry (PIXE). Season-dependent discrepancies were observed between the WAD/SJAC system and the filter-based samplers. During the dry season, when PM2.5 (D-p <= 2.5 mu m) concentrations were similar to 100 mu g m(-3), aerosol NH4+ and SO42- measured by the filter-based samplers were on average two times higher than those determined by the WAD/SJAC. Concentrations of aerosol NO3- and Cl- measured with the HiVol during daytime, and with the DFP during day- and nighttime also exceeded those of the WAD/SJAC by a factor of two. In contrast, aerosol NO3- and Cl- measured with the SFU during the dry season were nearly two times lower than those measured by the WAD/SJAC. These differences declined markedly during the transition period and towards the cleaner conditions during the onset of the wet season (PM2.5 similar to 5 mu g m(-3)); when filter-based samplers measured on average 40-90% less than the WAD/SJAC. The differences were not due to consistent systematic biases of the analytical techniques, but were apparently a result of prevailing environmental conditions and different sampling procedures. For the transition period and wet season, the significance of our results is reduced by a low number of data points. We argue that the observed differences are mainly attributable to (a) positive and negative filter sampling artifacts, (b) presence of organic compounds and organosulfates on filter substrates, and (c) a SJAC sampling efficiency of less than 100%.
Resumo:
In this paper we discuss bias-corrected estimators for the regression and the dispersion parameters in an extended class of dispersion models (Jorgensen, 1997b). This class extends the regular dispersion models by letting the dispersion parameter vary throughout the observations, and contains the dispersion models as particular case. General formulae for the O(n(-1)) bias are obtained explicitly in dispersion models with dispersion covariates, which generalize previous results obtained by Botter and Cordeiro (1998), Cordeiro and McCullagh (1991), Cordeiro and Vasconcellos (1999), and Paula (1992). The practical use of the formulae is that we can derive closed-form expressions for the O(n(-1)) biases of the maximum likelihood estimators of the regression and dispersion parameters when the information matrix has a closed-form. Various expressions for the O(n(-1)) biases are given for special models. The formulae have advantages for numerical purposes because they require only a supplementary weighted linear regression. We also compare these bias-corrected estimators with two different estimators which are also bias-free to order O(n(-1)) that are based on bootstrap methods. These estimators are compared by simulation. (C) 2011 Elsevier B.V. All rights reserved.
Resumo:
We introduce, for the first time, a new class of Birnbaum-Saunders nonlinear regression models potentially useful in lifetime data analysis. The class generalizes the regression model described by Rieck and Nedelman [Rieck, J.R., Nedelman, J.R., 1991. A log-linear model for the Birnbaum-Saunders distribution. Technometrics 33, 51-60]. We discuss maximum-likelihood estimation for the parameters of the model, and derive closed-form expressions for the second-order biases of these estimates. Our formulae are easily computed as ordinary linear regressions and are then used to define bias corrected maximum-likelihood estimates. Some simulation results show that the bias correction scheme yields nearly unbiased estimates without increasing the mean squared errors. Two empirical applications are analysed and discussed. Crown Copyright (C) 2009 Published by Elsevier B.V. All rights reserved.
Resumo:
This paper derives the second-order biases Of maximum likelihood estimates from a multivariate normal model where the mean vector and the covariance matrix have parameters in common. We show that the second order bias can always be obtained by means of ordinary weighted least-squares regressions. We conduct simulation studies which indicate that the bias correction scheme yields nearly unbiased estimators. (C) 2009 Elsevier B.V. All rights reserved.