91 resultados para MAXIMUM LIKELIHOOD ESTIMATOR
Resumo:
The DNA of three biological variants, G1, Ic and G2, which originated from the same greenhouse isolate of rice tungro bacilliform virus (RTBV) at the International Rice Research Institute (IRRI), was cloned and sequenced. Comparison of the sequences revealed small differences in genome sizes. The variants were between 95 and 99% identical at the nucleotide and amino acid levels. Alignment of the three genome sequences with those of three published RTBV sequences (Phi-1, Phi-2 and Phi-3) revealed numerous nucleotide substitutions and some insertions and deletions. The published RTBV sequences originated from the same greenhouse isolate at IRRI 20, 11 and 9 years ago. All open reading frames (ORFs) and known functional domains were conserved across the six variants. The cysteine-rich region of ORF3 showed the greatest variation. When the six DNA sequences from IRRI were compared with that of an isolate from Malaysia (Serdang), similar changes were observed in the cysteine-rich region in addition to other nucleotide substitutions and deletions across the genome. The aligned nucleotide sequences of the IRRI variants and Serdang were used to analyse phylogenetic relationships by the bootstrapped parsimony, distance and maximum-likelihood methods. The isolates clustered in three groups: Serdang alone; Ic and G1; and Phi-1, Phi-2, Phi-3 and G2. The distribution of phylogenetically informative residues in the IRRI sequences shared with the Serdang sequence and the differing tree topologies for segments of the genome suggested that recombination, as well as substitutions and insertions or deletions, has played a role in the evolution of RTBV variants. The significance and implications of these evolutionary forces are discussed in comparison with badnaviruses and caulimoviruses.
Resumo:
Maximum-likelihood estimates of the parameters of stochastic differential equations are consistent and asymptotically efficient, but unfortunately difficult to obtain if a closed-form expression for the transitional probability density function of the process is not available. As a result, a large number of competing estimation procedures have been proposed. This article provides a critical evaluation of the various estimation techniques. Special attention is given to the ease of implementation and comparative performance of the procedures when estimating the parameters of the Cox–Ingersoll–Ross and Ornstein–Uhlenbeck equations respectively.
Resumo:
This paper discusses the statistical analyses used to derive bridge live loads models for Hong Kong from a 10-year weigh-in-motion (WIM) data. The statistical concepts required and the terminologies adopted in the development of bridge live load models are introduced. This paper includes studies for representative vehicles from the large amount of WIM data in Hong Kong. Different load affecting parameters such as gross vehicle weights, axle weights, axle spacings, average daily number of trucks etc are first analyzed by various stochastic processes in order to obtain the mathematical distributions of these parameters. As a prerequisite to determine accurate bridge design loadings in Hong Kong, this study not only takes advantages of code formulation methods used internationally but also presents a new method for modelling collected WIM data using a statistical approach.
Resumo:
Smut fungi are important pathogens of grasses, including the cultivated crops maize, sorghum and sugarcane. Typically, smut fungi infect the inflorescence of their host plants. Three genera of smut fungi (Ustilago, Sporisorium and Macalpinomyces) form a complex with overlapping morphological characters, making species placement problematic. For example, the newly described Macalpinomyces mackinlayi possesses a combination of morphological characters such that it cannot be unambiguously accommodated in any of the three genera. Previous attempts to define Ustilago, Sporisorium and Macalpinomyces using morphology and molecular phylogenetics have highlighted the polyphyletic nature of the genera, but have failed to produce a satisfactory taxonomic resolution. A detailed systematic study of 137 smut species in the Ustilago-Sporisorium- Macalpinomyces complex was completed in the current work. Morphological and DNA sequence data from five loci were assessed with maximum likelihood and Bayesian inference to reconstruct a phylogeny of the complex. The phylogenetic hypotheses generated were used to identify morphological synapomorphies, some of which had previously been dismissed as a useful way to delimit the complex. These synapomorphic characters are the basis for a revised taxonomic classification of the Ustilago-Sporisorium-Macalpinomyces complex, which takes into account their morphological diversity and coevolution with their grass hosts. The new classification is based on a redescription of the type genus Sporisorium, and the establishment of four genera, described from newly recognised monophyletic groups, to accommodate species expelled from Sporisorium. Over 150 taxonomic combinations have been proposed as an outcome of this investigation, which makes a rigorous and objective contribution to the fungal systematics of these important plant pathogens.
Resumo:
This paper presents an approach to building an observation likelihood function from a set of sparse, noisy training observations taken from known locations by a sensor with no obvious geometric model. The basic approach is to fit an interpolant to the training data, representing the expected observation, and to assume additive sensor noise. This paper takes a Bayesian view of the problem, maintaining a posterior over interpolants rather than simply the maximum-likelihood interpolant, giving a measure of uncertainty in the map at any point. This is done using a Gaussian process framework. To validate the approach experimentally, a model of an environment is built using observations from an omni-directional camera. After a model has been built from the training data, a particle filter is used to localise while traversing this environment
Resumo:
Sequence data often have competing signals that are detected by network programs or Lento plots. Such data can be formed by generating sequences on more than one tree, and combining the results, a mixture model. We report that with such mixture models, the estimates of edge (branch) lengths from maximum likelihood (ML) methods that assume a single tree are biased. Based on the observed number of competing signals in real data, such a bias of ML is expected to occur frequently. Because network methods can recover competing signals more accurately, there is a need for ML methods allowing a network. A fundamental problem is that mixture models can have more parameters than can be recovered from the data, so that some mixtures are not, in principle, identifiable. We recommend that network programs be incorporated into best practice analysis, along with ML and Bayesian trees.
Resumo:
Despite recent methodological advances in inferring the time-scale of biological evolution from molecular data, the fundamental question of whether our substitution models are sufficiently well specified to accurately estimate branch-lengths has received little attention. I examine this implicit assumption of all molecular dating methods, on a vertebrate mitochondrial protein-coding dataset. Comparison with analyses in which the data are RY-coded (AG → R; CT → Y) suggests that even rates-across-sites maximum likelihood greatly under-compensates for multiple substitutions among the standard (ACGT) NT-coded data, which has been subject to greater phylogenetic signal erosion. Accordingly, the fossil record indicates that branch-lengths inferred from the NT-coded data translate into divergence time overestimates when calibrated from deeper in the tree. Intriguingly, RY-coding led to the opposite result. The underlying NT and RY substitution model misspecifications likely relate respectively to “hidden” rate heterogeneity and changes in substitution processes across the tree, for which I provide simulated examples. Given the magnitude of the inferred molecular dating errors, branch-length estimation biases may partly explain current conflicts with some palaeontological dating estimates.
Resumo:
Butterfly long-wavelength (L) photopigments are interesting for comparative studies of adaptive evolution because of the tremendous phenotypic variation that exists in their wavelength of peak absorbance (lambda(max) value). Here we present a comprehensive survey of L photopigment variation by measuring lambda(max) in 12 nymphalid and 1 riodinid species using epi-microspectrophotometry. Together with previous data, we find that L photopigment lambda(max) varies from 510-565 nm in 22 nymphalids, with an even broader 505- to 600-nm range in riodinids. We then surveyed the L opsin genes for which lambda(max) values are available as well as from related taxa and found 2 instances of L opsin gene duplication within nymphalids, in Hermeuptychia hermes and Amathusia phidippus, and 1 instance within riodinids, in the metalmark butterfly Apodemia mormo. Using maximum parsimony and maximum likelihood ancestral state reconstructions to map the evolution of spectral shifts within the L photopigments of nymphalids, we estimate the ancestral pigment had a lambda(max) = 540 nm +/- 10 nm standard error and that blueshifts in wavelength have occurred at least 4 times within the family. We used ancestral state reconstructions to investigate the importance of several amino acid substitutions (Ile17Met, Ala64Ser, Asn70Ser, and Ser137Ala) previously shown to have evolved under positive selection that are correlated with blue spectral shifts. These reconstructions suggest that the Ala64Ser substitution has indeed occurred along the newly identified blueshifted L photopigment lineages. Substitutions at the other 3 sites may also be involved in the functional diversification of L photopigments. Our data strongly suggest that there are limits to the evolution of L photopigment spectral shifts among species with only one L opsin gene and that opsin gene duplication broadens the potential range of lambda(max) values.
Resumo:
Butterflies and primates are interesting for comparative color vision studies, because both have evolved middle- (M) and long-wavelength- (L) sensitive photopigments with overlapping absorbance spectrum maxima (lambda(max) values). Although positive selection is important for the maintenance of spectral variation within the primate pigments, it remains an open question whether it contributes similarly to the diversification of butterfly pigments. To examine this issue, we performed epimicrospectrophotometry on the eyes of five Limenitis butterfly species and found a 31-nm range of variation in the lambda(max) values of the L-sensitive photopigments (514-545 nm). We cloned partial Limenitis L opsin gene sequences and found a significant excess of replacement substitutions relative to polymorphisms among species. Mapping of these L photopigment lambda(max) values onto a phylogeny revealed two instances within Lepidoptera of convergently evolved L photopigment lineages whose lambda(max) values were blue-shifted. A codon-based maximum-likelihood analysis indicated that, associated with the two blue spectral shifts, four amino acid sites (Ile17Met, Ala64Ser, Asn70Ser, and Ser137Ala) have evolved substitutions in parallel and exhibit significant d(N)/d(S) >1. Homology modeling of the full-length Limenitis arthemis astyanax L opsin placed all four substitutions within the chromophore-binding pocket. Strikingly, the Ser137Ala substitution is in the same position as a site that in primates is responsible for a 5- to 7-nm blue spectral shift. Our data show that some of the same amino acid sites are under positive selection in the photopigments of both butterflies and primates, spanning an evolutionary distance >500 million years.
Resumo:
Beak and feather disease virus (BFDV), the causative agent of psittacine beak and feather disease (PBFD) infects psittaciformes worldwide. We provide an annotated sequence record of three full-length unique genomes of BFDV isolates from budgerigars (Melopsittacus undulatus) from a breeding farm in South Africa. The isolates share >99% nucleotide sequence identity with each other and ~96% nucleotide sequence identity to two recent isolates (Melopsittacus undulatus) from Thailand but only between 91. 6 and 86. 6% identity with all other full-length BFDV sequences. Maximum-likelihood analysis and recombination analysis suggest that the South African budgerigar BFDV isolates are unique to budgerigars, are non-recombinant in origin, and represent a new genotype of BFDV. © 2010 Springer-Verlag.
Resumo:
Although the relationship between socioeconomic status (SES) and health is well documented for developed countries, less evidence has been presented for developing countries. The aim of this paper is to analyse this relationship at the household level for Fiji, a developing country in the South Pacific, using original household survey data. To allow for the endogeneity of SES status in the household health production function, we utilize a simultaneous equation approach where estimates are achieved by full information maximum likelihood. By restricting our sample to one, relatively small island, and including area and district hospital effects, physical geography effects are unpacked from income effects. We measure SES, as permanent income which is constructed using principal components analysis. An alternative specification considers transitory household income. We find that a 1% increase in wealth (our measure of permanent income) would lead to a 15% decrease in the probability of an incapacitating illness occurring intra-household. Although the presence of a strong relationship indicates that relatively small improvements in SES status can significantly improve health at the household level, it is argued that the design of appropriate policy would also require an understanding of the various mechanisms through which the relationship operates.
Resumo:
In this paper we present substantial evidence for the existence of a bias in the distribution of births of leading US politicians in favor of those that have been the oldest in their cohort at school. This “relative age effect” has been proven to influence performance at school and in sports,but evidence on its impact on people’s vocational success has been rare. We find a marked break in the density of birthdate of politicians using a maximum likelihood test and McCrary’s (2008) nonparametric test. We conjecture that being relatively old in a peer group may create long term advantages which can create a significant role in the ability to succeed in a highly competitive environment like the race for top political offices in the USA. The magnitude of the effect we estimate is larger than what most other studies on the relative age effect for a broader (adult) population find, but is in general in line with studies that look at populations in high-competition environments.