Biblioteca Digital

117 resultados para Maximum-likelihood-estimation

Evaluating the efficiency of fractional integration parameter estimators

Relevância:

90.00% 90.00%

Publicador:

Resumo:

This article deals with the efficiency of fractional integration parameter estimators. This study was based on Monte Carlo experiments involving simulated stochastic processes with integration orders in the range]-1,1[. The evaluated estimation methods were classified into two groups: heuristics and semiparametric/maximum likelihood (ML). The study revealed that the comparative efficiency of the estimators, measured by the lesser mean squared error, depends on the stationary/non-stationary and persistency/anti-persistency conditions of the series. The ML estimator was shown to be superior for stationary persistent processes; the wavelet spectrum-based estimators were better for non-stationary mean reversible and invertible anti-persistent processes; the weighted periodogram-based estimator was shown to be superior for non-invertible anti-persistent processes.

Evolution of Dendrocolaptes platyrostris (Aves: Furnariidae) between the South American open vegetation corridor and the Atlantic forest

Relevância:

90.00% 90.00%

Publicador:

Resumo:

The open vegetation corridor of South America is a region dominated by savanna biomes. It contains forests (i.e. riverine forests) that may act as corridors for rainforest specialists between the open vegetation corridor and its neighbouring biomes (i.e. the Amazonian and Atlantic forests). A prediction for this scenario is that populations of rainforest specialists in the open vegetation corridor and in the forested biomes show no significant genetic divergence. We addressed this hypothesis by studying plumage and genetic variation of the Planalto woodcreeper Dendrocolaptes platyrostris Spix (1824) (Aves: Furnariidae), a forest specialist that occurs in both open habitat and in the Atlantic forest. The study questions were: (1) is there any evidence of genetic continuity between populations of the open habitat and the Atlantic forest and (2) is plumage variation congruent with patterns of neutral genetic structure or with ecological factors related to habitat type? We used cytochrome b and mitochondrial DNA control region sequences to show that D. platyrostris is monophyletic and presents substantial intraspecific differentiation. We found two areas of plumage stability: one associated with Cerrado and the other associated with southern Atlantic Forest. Multiple Mantel tests showed that most of the plumage variation followed the transition of habitats but not phylogeographical gaps, suggesting that selection may be related to the evolution of the plumage of the species. The results were not compatible with the idea that forest specialists in the open vegetation corridor and in the Atlantic forest are linked at the population level because birds from each region were not part of the same genetic unit. Divergence in the presence of gene flow across the ecotone between both regions might explain our results. Also, our findings indicate that the southern Atlantic forest may have been significantly affected by Pleistocene climatic alteration, although such events did not cause local extinction of most taxa, as occurred in other regions of the globe where forests were significantly affected by global glaciations. Finally, our results neither support plumage stability areas, nor subspecies as full species. (C) 2011 The Linnean Society of London, Biological Journal of the Linnean Society, 2011, 103, 801-820.

On the influence of imputation in classification: practical issues

Relevância:

90.00% 90.00%

Publicador:

Resumo:

The substitution of missing values, also called imputation, is an important data preparation task for many domains. Ideally, the substitution of missing values should not insert biases into the dataset. This aspect has been usually assessed by some measures of the prediction capability of imputation methods. Such measures assume the simulation of missing entries for some attributes whose values are actually known. These artificially missing values are imputed and then compared with the original values. Although this evaluation is useful, it does not allow the influence of imputed values in the ultimate modelling task (e.g. in classification) to be inferred. We argue that imputation cannot be properly evaluated apart from the modelling task. Thus, alternative approaches are needed. This article elaborates on the influence of imputed values in classification. In particular, a practical procedure for estimating the inserted bias is described. As an additional contribution, we have used such a procedure to empirically illustrate the performance of three imputation methods (majority, naive Bayes and Bayesian networks) in three datasets. Three classifiers (decision tree, naive Bayes and nearest neighbours) have been used as modelling tools in our experiments. The achieved results illustrate a variety of situations that can take place in the data preparation practice.

Robust inference in an heteroscedastic measurement error model

Relevância:

90.00% 90.00%

Publicador:

Resumo:

In this paper we deal with robust inference in heteroscedastic measurement error models Rather than the normal distribution we postulate a Student t distribution for the observed variables Maximum likelihood estimates are computed numerically Consistent estimation of the asymptotic covariance matrices of the maximum likelihood and generalized least squares estimators is also discussed Three test statistics are proposed for testing hypotheses of interest with the asymptotic chi-square distribution which guarantees correct asymptotic significance levels Results of simulations and an application to a real data set are also reported (C) 2009 The Korean Statistical Society Published by Elsevier B V All rights reserved

The complementary exponential geometric distribution: Model, properties, and a comparison with its counterpart

Relevância:

90.00% 90.00%

Publicador:

Resumo:

In this paper, we proposed a new two-parameter lifetime distribution with increasing failure rate, the complementary exponential geometric distribution, which is complementary to the exponential geometric model proposed by Adamidis and Loukas (1998). The new distribution arises on a latent complementary risks scenario, in which the lifetime associated with a particular risk is not observable; rather, we observe only the maximum lifetime value among all risks. The properties of the proposed distribution are discussed, including a formal proof of its probability density function and explicit algebraic formulas for its reliability and failure rate functions, moments, including the mean and variance, variation coefficient, and modal value. The parameter estimation is based on the usual maximum likelihood approach. We report the results of a misspecification simulation study performed in order to assess the extent of misspecification errors when testing the exponential geometric distribution against our complementary one in the presence of different sample size and censoring percentage. The methodology is illustrated on four real datasets; we also make a comparison between both modeling approaches. (C) 2011 Elsevier B.V. All rights reserved.

Inference and local influence assessment in skew-normal null intercept measurement error model

Relevância:

90.00% 90.00%

Publicador:

Resumo:

In this article, we discuss inferential aspects of the measurement error regression models with null intercepts when the unknown quantity x (latent variable) follows a skew normal distribution. We examine first the maximum-likelihood approach to estimation via the EM algorithm by exploring statistical properties of the model considered. Then, the marginal likelihood, the score function and the observed information matrix of the observed quantities are presented allowing direct inference implementation. In order to discuss some diagnostics techniques in this type of models, we derive the appropriate matrices to assessing the local influence on the parameter estimates under different perturbation schemes. The results and methods developed in this paper are illustrated considering part of a real data set used by Hadgu and Koch [1999, Application of generalized estimating equations to a dental randomized clinical trial. Journal of Biopharmaceutical Statistics, 9, 161-178].

Likelihood ratio tests for variance components in linear mixed models

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Although the asymptotic distributions of the likelihood ratio for testing hypotheses of null variance components in linear mixed models derived by Stram and Lee [1994. Variance components testing in longitudinal mixed effects model. Biometrics 50, 1171-1177] are valid, their proof is based on the work of Self and Liang [1987. Asymptotic properties of maximum likelihood estimators and likelihood tests under nonstandard conditions. J. Amer. Statist. Assoc. 82, 605-610] which requires identically distributed random variables, an assumption not always valid in longitudinal data problems. We use the less restrictive results of Vu and Zhou [1997. Generalization of likelihood ratio tests under nonstandard conditions. Ann. Statist. 25, 897-916] to prove that the proposed mixture of chi-squared distributions is the actual asymptotic distribution of such likelihood ratios used as test statistics for null variance components in models with one or two random effects. We also consider a limited simulation study to evaluate the appropriateness of the asymptotic distribution of such likelihood ratios in moderately sized samples. (C) 2008 Elsevier B.V. All rights reserved.

Influence diagnostics for linear models with first-order autoregressive elliptical errors

Relevância:

90.00% 90.00%

Publicador:

Resumo:

We introduce in this paper the class of linear models with first-order autoregressive elliptical errors. The score functions and the Fisher information matrices are derived for the parameters of interest and an iterative process is proposed for the parameter estimation. Some robustness aspects of the maximum likelihood estimates are discussed. The normal curvatures of local influence are also derived for some usual perturbation schemes whereas diagnostic graphics to assess the sensitivity of the maximum likelihood estimates are proposed. The methodology is applied to analyse the daily log excess return on the Microsoft whose empirical distributions appear to have AR(1) and heavy-tailed errors. (C) 2008 Elsevier B.V. All rights reserved.

Hypothesis testing in an errors-in-variables model with heteroscedastic measurement errors

Relevância:

90.00% 90.00%

Publicador:

Resumo:

In many epidemiological studies it is common to resort to regression models relating incidence of a disease and its risk factors. The main goal of this paper is to consider inference on such models with error-prone observations and variances of the measurement errors changing across observations. We suppose that the observations follow a bivariate normal distribution and the measurement errors are normally distributed. Aggregate data allow the estimation of the error variances. Maximum likelihood estimates are computed numerically via the EM algorithm. Consistent estimation of the asymptotic variance of the maximum likelihood estimators is also discussed. Test statistics are proposed for testing hypotheses of interest. Further, we implement a simple graphical device that enables an assessment of the model`s goodness of fit. Results of simulations concerning the properties of the test statistics are reported. The approach is illustrated with data from the WHO MONICA Project on cardiovascular disease. Copyright (C) 2008 John Wiley & Sons, Ltd.

The log-bimodal-skew-normal model. A geochemical application

Relevância:

90.00% 90.00%

Publicador:

Resumo:

The main objective of this paper is to study a logarithm extension of the bimodal skew normal model introduced by Elal-Olivero et al. [1]. The model can then be seen as an alternative to the log-normal model typically used for fitting positive data. We study some basic properties such as the distribution function and moments, and discuss maximum likelihood for parameter estimation. We report results of an application to a real data set related to nickel concentration in soil samples. Model fitting comparison with several alternative models indicates that the model proposed presents the best fit and so it can be quite useful in real applications for chemical data on substance concentration. Copyright (C) 2011 John Wiley & Sons, Ltd.

DIAGNOSTIC TECHNIQUES OF LOCAL INFLUENCE IN SPATIAL ANALYSIS OF SOYBEAN YIELD

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Modeling of spatial dependence structure, concerning geoestatistics approach, is an indispensable tool for fixing parameters that define this structure, applied on interpolation of values in places that are not sampled, by kriging techniques. However, the estimation of parameters can be greatly affected by the presence of atypical observations on sampled data. Thus, this trial aimed at using diagnostics techniques of local influence in spatial linear Gaussians models, applied at geoestatistics in order to evaluate sensitivity of maximum likelihood estimators and restrict maximum likelihood to small perturbations in these data. So, studies with simulated and experimental data were performed. Those results, obtained from the study of real data, allowed us to conclude that the presence of atypical values among the sampled data can have a strong influence on thematic maps, changing, therefore, the spatial dependence. The application of diagnostics techniques of local influence should be part of any geoestatistic analysis, ensuring that the information from thematic maps has better quality and can be used with greater security by farmers.

A compound class of Weibull and power series distributions

Relevância:

90.00% 90.00%

Publicador:

Resumo:

In this paper we introduce the Weibull power series (WPS) class of distributions which is obtained by compounding Weibull and power series distributions where the compounding procedure follows same way that was previously carried out by Adamidis and Loukas (1998) This new class of distributions has as a particular case the two-parameter exponential power series (EPS) class of distributions (Chahkandi and Gawk 2009) which contains several lifetime models such as exponential geometric (Adamidis and Loukas 1998) exponential Poisson (Kus 2007) and exponential logarithmic (Tahmasbi and Rezaei 2008) distributions The hazard function of our class can be increasing decreasing and upside down bathtub shaped among others while the hazard function of an EPS distribution is only decreasing We obtain several properties of the WPS distributions such as moments order statistics estimation by maximum likelihood and inference for a large sample Furthermore the EM algorithm is also used to determine the maximum likelihood estimates of the parameters and we discuss maximum entropy characterizations under suitable constraints Special distributions are studied in some detail Applications to two real data sets are given to show the flexibility and potentiality of the new class of distributions (C) 2010 Elsevier B V All rights reserved

Thaptomys Thomas 1915 (Rodentia, Sigmodontinae, Akodontini) with karyotypes 2n = 50, FN = 48, and 2n = 52, FN = 52: two monophyletic lineages recovered by molecular phylogeny

Relevância:

80.00% 80.00%

Publicador:

Resumo:

A novel karyotype with 2n = 50, FN = 48, was described for specimens of Thaptomys collected at Una, State of Bahia, Brazil, which are morphologically indistinguishable from Thaptomys nigrita, 2n = 52, FN = 52, found in other localities. It was hence proposed that the 2n = 50 karyotype could belong to a distinct species, cryptic of Thaptomys nigrita, once chromosomal rearrangements observed, along with the geographic distance, might represent a reproductive barrier between both forms. Phylogenetic analyses using maximum parsimony and maximum likelihood based on partial cytochrome b sequences with 1077 bp were performed, attempting to establish the relationships among the individuals with distinct karyotypes along the geographic distribution of the genus; the sample comprised 18 karyotyped specimens of Thaptomys, encompassing 15 haplotypes, from eight different localities of the Atlantic Rainforest. The intra-generic relationships corroborated the distinct diploid numbers, once both phylogenetic reconstructions recovered two monophyletic lineages, a northeastern clade grouping the 2n = 50 and a southeastern clade with three subclades, grouping the 2n = 52 karyotype. The sequence divergence observed between their individuals ranged from 1.9% to 3.5%.

Genetic trend estimates of meat quality traits in a male broiler line

Relevância:

80.00% 80.00%

Publicador:

Resumo:

The present research was conducted to estimate the genetic trends for meat quality traits in a male broiler line. The traits analyzed were initial pH, pH at 6 h after slaughter, final pH, initial range of falling pH, final range of falling pH, lightness, redness, yellowness, weep loss, drip loss, shrink loss, and shear force. The number of observations varied between 618 and 2125 for each trait. Genetic values were obtained by restricted maximum likelihood, and the numerator relationship matrix had 107,154 animals. The genetic trends were estimated by regression of the broiler average genetic values with respect to unit of time (generations), and the average genetic trend was estimated by regression coefficients. Generally, for the traits analyzed, small genetic trends were obtained, except for drip loss and shear force, which were higher. The small magnitude of the trends found could be a consequence of the absence of selection for meat quality traits in the line analyzed. The estimates of genetic trends obtained were an indication of an improvement in the meat quality traits in the line analyzed, except for drip loss.

Genomic islands in the pathogenic filamentous fungus Aspergillus fumigatus

Relevância:

80.00% 80.00%

Publicador:

Resumo:

We present the genome sequences of a new clinical isolate of the important human pathogen, Aspergillus fumigatus, A1163, and two closely related but rarely pathogenic species, Neosartorya fischeri NRRL181 and Aspergillus clavatus NRRL1. Comparative genomic analysis of A1163 with the recently sequenced A. fumigatus isolate Af293 has identified core, variable and up to 2% unique genes in each genome. While the core genes are 99.8% identical at the nucleotide level, identity for variable genes can be as low 40%. The most divergent loci appear to contain heterokaryon incompatibility ( het) genes associated with fungal programmed cell death such as developmental regulator rosA. Cross-species comparison has revealed that 8.5%, 13.5% and 12.6%, respectively, of A. fumigatus, N. fischeri and A. clavatus genes are species-specific. These genes are significantly smaller in size than core genes, contain fewer exons and exhibit a subtelomeric bias. Most of them cluster together in 13 chromosomal islands, which are enriched for pseudogenes, transposons and other repetitive elements. At least 20% of A. fumigatus-specific genes appear to be functional and involved in carbohydrate and chitin catabolism, transport, detoxification, secondary metabolism and other functions that may facilitate the adaptation to heterogeneous environments such as soil or a mammalian host. Contrary to what was suggested previously, their origin cannot be attributed to horizontal gene transfer ( HGT), but instead is likely to involve duplication, diversification and differential gene loss (DDL). The role of duplication in the origin of lineage-specific genes is further underlined by the discovery of genomic islands that seem to function as designated ""gene dumps'' and, perhaps, simultaneously, as "" gene factories''.

«
1
2
3
4
5
6
7
8
»