930 resultados para Phylogenetic Inference
Resumo:
In this paper, we discuss inferential aspects for the Grubbs model when the unknown quantity x (latent response) follows a skew-normal distribution, extending early results given in Arellano-Valle et al. (J Multivar Anal 96:265-281, 2005b). Maximum likelihood parameter estimates are computed via the EM-algorithm. Wald and likelihood ratio type statistics are used for hypothesis testing and we explain the apparent failure of the Wald statistics in detecting skewness via the profile likelihood function. The results and methods developed in this paper are illustrated with a numerical example.
Resumo:
The Birnbaum-Saunders regression model is commonly used in reliability studies. We address the issue of performing inference in this class of models when the number of observations is small. Our simulation results suggest that the likelihood ratio test tends to be liberal when the sample size is small. We obtain a correction factor which reduces the size distortion of the test. Also, we consider a parametric bootstrap scheme to obtain improved critical values and improved p-values for the likelihood ratio test. The numerical results show that the modified tests are more reliable in finite samples than the usual likelihood ratio test. We also present an empirical application. (C) 2009 Elsevier B.V. All rights reserved.
Resumo:
In this article, we study some results related to a specific class of distributions, called skew-curved-symmetric family of distributions that depends on a parameter controlling the skewness and kurtosis at the same time. Special elements of this family which are studied include symmetric and well-known asymmetric distributions. General results are given for the score function and the observed information matrix. It is shown that the observed information matrix is always singular for some special cases. We illustrate the flexibility of this class of distributions with an application to a real dataset on characteristics of Australian athletes.
Resumo:
Mixed linear models are commonly used in repeated measures studies. They account for the dependence amongst observations obtained from the same experimental unit. Often, the number of observations is small, and it is thus important to use inference strategies that incorporate small sample corrections. In this paper, we develop modified versions of the likelihood ratio test for fixed effects inference in mixed linear models. In particular, we derive a Bartlett correction to such a test, and also to a test obtained from a modified profile likelihood function. Our results generalize those in [Zucker, D.M., Lieberman, O., Manor, O., 2000. Improved small sample inference in the mixed linear model: Bartlett correction and adjusted likelihood. Journal of the Royal Statistical Society B, 62,827-838] by allowing the parameter of interest to be vector-valued. Additionally, our Bartlett corrections allow for random effects nonlinear covariance matrix structure. We report simulation results which show that the proposed tests display superior finite sample behavior relative to the standard likelihood ratio test. An application is also presented and discussed. (C) 2008 Elsevier B.V. All rights reserved.
Resumo:
This paper investigates common nonlinear features in multivariate nonlinear autore-gressive models via testing the estimated residuals. A Wald-type test is proposed and itis asymptotically Chi-squared distributed. Simulation studies are given to examine thefinite-sample properties of the proposed test.
Resumo:
We study semiparametric two-step estimators which have the same structure as parametric doubly robust estimators in their second step. The key difference is that we do not impose any parametric restriction on the nuisance functions that are estimated in a first stage, but retain a fully nonparametric model instead. We call these estimators semiparametric doubly robust estimators (SDREs), and show that they possess superior theoretical and practical properties compared to generic semiparametric two-step estimators. In particular, our estimators have substantially smaller first-order bias, allow for a wider range of nonparametric first-stage estimates, rate-optimal choices of smoothing parameters and data-driven estimates thereof, and their stochastic behavior can be well-approximated by classical first-order asymptotics. SDREs exist for a wide range of parameters of interest, particularly in semiparametric missing data and causal inference models. We illustrate our method with a simulation exercise.
Resumo:
Differences-in-Differences (DID) is one of the most widely used identification strategies in applied economics. However, how to draw inferences in DID models when there are few treated groups remains an open question. We show that the usual inference methods used in DID models might not perform well when there are few treated groups and errors are heteroskedastic. In particular, we show that when there is variation in the number of observations per group, inference methods designed to work when there are few treated groups tend to (under-) over-reject the null hypothesis when the treated groups are (large) small relative to the control groups. This happens because larger groups tend to have lower variance, generating heteroskedasticity in the group x time aggregate DID model. We provide evidence from Monte Carlo simulations and from placebo DID regressions with the American Community Survey (ACS) and the Current Population Survey (CPS) datasets to show that this problem is relevant even in datasets with large numbers of observations per group. We then derive an alternative inference method that provides accurate hypothesis testing in situations where there are few treated groups (or even just one) and many control groups in the presence of heteroskedasticity. Our method assumes that we can model the heteroskedasticity of a linear combination of the errors. We show that this assumption can be satisfied without imposing strong assumptions on the errors in common DID applications. With many pre-treatment periods, we show that this assumption can be relaxed. Instead, we provide an alternative inference method that relies on strict stationarity and ergodicity of the time series. Finally, we consider two recent alternatives to DID when there are many pre-treatment periods. We extend our inference methods to linear factor models when there are few treated groups. We also derive conditions under which a permutation test for the synthetic control estimator proposed by Abadie et al. (2010) is robust to heteroskedasticity and propose a modification on the test statistic that provided a better heteroskedasticity correction in our simulations.
Resumo:
Differences-in-Differences (DID) is one of the most widely used identification strategies in applied economics. However, how to draw inferences in DID models when there are few treated groups remains an open question. We show that the usual inference methods used in DID models might not perform well when there are few treated groups and errors are heteroskedastic. In particular, we show that when there is variation in the number of observations per group, inference methods designed to work when there are few treated groups tend to (under-) over-reject the null hypothesis when the treated groups are (large) small relative to the control groups. This happens because larger groups tend to have lower variance, generating heteroskedasticity in the group x time aggregate DID model. We provide evidence from Monte Carlo simulations and from placebo DID regressions with the American Community Survey (ACS) and the Current Population Survey (CPS) datasets to show that this problem is relevant even in datasets with large numbers of observations per group. We then derive an alternative inference method that provides accurate hypothesis testing in situations where there are few treated groups (or even just one) and many control groups in the presence of heteroskedasticity. Our method assumes that we know how the heteroskedasticity is generated, which is the case when it is generated by variation in the number of observations per group. With many pre-treatment periods, we show that this assumption can be relaxed. Instead, we provide an alternative application of our method that relies on assumptions about stationarity and convergence of the moments of the time series. Finally, we consider two recent alternatives to DID when there are many pre-treatment groups. We extend our inference method to linear factor models when there are few treated groups. We also propose a permutation test for the synthetic control estimator that provided a better heteroskedasticity correction in our simulations than the test suggested by Abadie et al. (2010).
Resumo:
The maternal and paternal genetic profile of Guineans is markedly sub-Saharan West African, with the majority of lineages belonging to L0-L3 mtDNA sub-clusters and E3a-M2 and E1-M33 Y chromosome haplogroups. Despite the sociocultural differences among Guinea-Bissau ethnic groups,marked by the supposedly strict admixture barriers, their genetic pool remains largely common. Their extant variation coalesces at distinct timeframes, from the initial occupation of the area to later inputs of people. Signs of recent expansion in mtDNA haplogroups L2a-L2c and NRY E3a-M2 suggest population growth in the equatorial western fringe, possibly supported by an early local agricultural centre, and to which the Mandenka and the Balanta people may relate. Non-West African signatures are traceable in less frequent extant haplogroups, fitting well with the linguistic and historical evidence regarding particular ethnic groups: the Papel and Felupe-Djola people retain traces of their putative East African relatives; U6 and M1b among Guinea-Bissau Bak-speakers indicate partial diffusion to Sahel of North African lineages; U5b1b lineages in Fulbe and Papel represent a link to North African Berbers, emphasizing the great importance of post-glacial expansions; exact matches of R1b-P25 and E3b1-M78 with Europeans likely trace back to the times of the slave trade.
Resumo:
Andryala (Asteraceae: Cichorieae) is a little-known Mediterranean-Macaronesian genus whose taxonomy is much in need of revision. The aim of the present biosystematic study was to elucidate species relationships within this genus based on morphological and molecular data. In this study several taxa are recognised: 17 species, 14 subspecies, and 3 hybrids. Among these, 5 species are Macaronesian endemics (A. glandulosa, A. sparsiflora, A. crithmifolia Aiton, A. pinnatifida, and A. perezii), 4 species are Northwest African endemics (A. mogadorensis, A. maroccana, A. chevallieri, and A. nigricans) and one species is endemic to Romania (A. laevitomentosa). Historical background regarding taxonomic delimitation in the genus is addressed from Linnaean to present day concepts, as well as the origin of the name Andryala. The origin of Asteraceae and the systematic position of Andryala is shortly summarised. The morphological study was based on a bibliographic review and the revision of 1066 specimens of 13 herbaria as well as additional material collected during fieldwork. The variability of the morphological characters of the genus, including both vegetative taxonomic characters (root, stem, leaf and indumentum characters) and reproductive ones (inflorescence, floret, fruit and pappus characters), is assessed. Numerical analysis of the morphological data was performed using different similarity or dissimilarity measures and coefficients, as well as ordination and clustering methods. Results support the segregation of the recognised taxa and the congruence of the several analyses in the separation of the recognised taxa (using quantitative, binary or multi-state characters). The proposed taxonomy for Andryala includes a new infra-generic classification, new taxa and new combinations and ranks, typifications and diagnostic keys (one for the species and several for subspecies). For each taxon a list of synonyms, typification comments and a detailed description are provided, just as comments on taxonomy and nomenclature, and a brief discussion on karyology. Additionally, information on ecology and conservation status as well as on distribution and a list of studied material are also presented. Phylogenetic analyses based on different nuclear and chloroplast DNA markers, using Bayesian and maximum parsimony methods of inference, were performed. Results support three main lineages: separate ones for the relict species A. agardhii and A. laevitomentosa and a third including the majority of the Andryala species that underwent a relatively rapid and recent speciation. They also suggest a single colonization event of Madeira and the Canary Islands from the Mediterranean region, followed by insular speciation. Biogeography and speciation within the genus are briefly discussed, including a proposal for the centre of origin of the genus and possible dispersal routes.
Resumo:
Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)
Resumo:
Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq)