945 resultados para Maximum likelihood estimate
Resumo:
The 16S rRNA gene (16S rDNA) is currently the most widely used gene for estimating the evolutionary history of prokaryotes, To date, there are more than 30 000 16S rDNA sequences available from the core databases, GenBank, EMBL and DDBJ, This great number may cause a dilemma when composing datasets for phylogenetic analysis, since the choice and number of reference organisms are known to affect the resulting tree topology. A group of sequences appearing monophyletic in one dataset may not be so in another. This can be especially problematic when establishing the relationships of distantly related sequences at the division (phylum) level. In this study, a multiple-outgroup approach to resolving division-level phylogenetic relationships is suggested using 16S rDNA data. The approach is illustrated by two case studies concerning the monophyly of two recently proposed bacterial divisions, OP9 and OP10.
Resumo:
A hybrid zone between the grasshoppers Chorthippus brunneus and C. jacobsi (Orthoptera: Acrididae) in northern Spain has been analyzed for variation in morphology and ecology. These species are readily distinguished by the number of stridulatory pegs on the hind femur. Both sexes are fully winged and inhabit disturbed habitats throughout the study area. We develop a maximum-likelihood approach to fitting a two-dimensional cline to geographical variation in quantitative traits and for estimating associations of population mean with local habitat. This method reveals a cline in peg number approximately 30 km south of the Picos de Europa Mountains that shows substantial deviations in population mean compared with the expectations of simple tension zone models. The inclusion of variation in local vegetation in the model explains a significant proportion of the residual variation in peg number, indicating that habitat-genotype associations contribute to the observed spatial pattern. However, this association is weak, and a number of populations continue to show strong deviations in mean even after habitat is included in the final model. These outliers may be the result of long-distance colonization of sites distant from the cline center or may be due to a patchy pattern of initial contact during postglacial expansion. As well as contrasting with the smooth hybrid zones described for Chorthippus parallelus, this situation also contrasts with the mosaic hybrid zones observed in Gryllus crickets and in parts of the hybrid zone between Bombina toad species, where habitat-genotype associations account for substantial amounts of among-site variation.
Resumo:
We present a novel maximum-likelihood-based algorithm for estimating the distribution of alignment scores from the scores of unrelated sequences in a database search. Using a new method for measuring the accuracy of p-values, we show that our maximum-likelihood-based algorithm is more accurate than existing regression-based and lookup table methods. We explore a more sophisticated way of modeling and estimating the score distributions (using a two-component mixture model and expectation maximization), but conclude that this does not improve significantly over simply ignoring scores with small E-values during estimation. Finally, we measure the classification accuracy of p-values estimated in different ways and observe that inaccurate p-values can, somewhat paradoxically, lead to higher classification accuracy. We explain this paradox and argue that statistical accuracy, not classification accuracy, should be the primary criterion in comparisons of similarity search methods that return p-values that adjust for target sequence length.
Resumo:
Molecular evolution has been considered to be essentially a stochastic process, little influenced by the pace of phenotypic change. This assumption was challenged by a study that demonstrated an association between rates of morphological and molecular change estimated for total-evidence phylogenies, a finding that led some researchers to challenge molecular date estimates of major evolutionary radiations. Here we show that Omland's (1997) result is probably due to methodological bias, particularly phylogenetic nonindependence, rather than being indicative of an underlying evolutionary phenomenon. We apply three new methods specifically designed to overcome phylogenetic bias to 13 published phylogenetic datasets for vertebrate taxa, each of which includes both morphological characters and DNA sequence data. We find no evidence of an association between rates of molecular and morphological rates of change.
Resumo:
The Tully Sugar Mill has collected information about sugarcane supplied for crushing from every block in the mill district from 1970 to 1999. Data from 1988 to 1999 were analysed to understand the extent of the variation in cane yield per hectare and commercial cane sugar in the Tully mill area. The key factors influencing the variation in cane yield and commercial cane sugar in this commercial environment were identified and the variance components computed using a restricted maximum likelihood methodology. Cane yield was predominantly influenced by the year in which it was harvested, the month when the crop was ratooned (month of harvest in the previous year) and the farm of origin. These variables were relatively more important than variety, age of crop or crop class (plant crop, first ratoon through to fourth or older ratoons) and fallowing practice (fallow or ploughout-replant). The month-of-ratooning effect was relatively stable from year-to-year. Commercial cane sugar was influenced by the year of harvest, the month of harvest and their interaction, in that the influence of the month of harvest varied from year to year. Variety and farm differences were also significant but accounted for a much lower portion of the variation in commercial cane sugar. An empirical model was constructed from the key factors that influenced commercial cane sugar and cane yield to quantify their combined influence on sugar yield (t/ha). This may be used to assist mill personnel to predict their activities more accurately, for example to calculate the impact of a late finish to the current harvest season on the following year's crop.
Resumo:
Objectives: The aims of this study were to investigate the population pharmacokinetics of tacrolimus in adult kidney transplant recipients and to identify factors that explain variability. Methods: Population analysis was performed on retrospective data from 70 patients who received oral tacrolimus twice daily. Morning blood trough concentrations were measured by liquid chromatography-tandem mass spectrometry. Maximum likelihood estimates were sought for apparent clearance (CL/F) and apparent volume of distribution (V/F), with the use of NONMEM (GloboMax LLC, Hanover, Md). Factors screened for influence on these parameters were weight, age, gender, postoperative day, days of tacrolimus therapy, liver function tests, creatinine clearance, hematocrit fraction, corticosteroid dose, and potential interacting drugs. Results. CL/F was greater in patients with abnormally low hematocrit fraction (data from 21 patients only), and it decreased with increasing days of therapy and AST concentrations (P
Resumo:
We focus on mixtures of factor analyzers from the perspective of a method for model-based density estimation from high-dimensional data, and hence for the clustering of such data. This approach enables a normal mixture model to be fitted to a sample of n data points of dimension p, where p is large relative to n. The number of free parameters is controlled through the dimension of the latent factor space. By working in this reduced space, it allows a model for each component-covariance matrix with complexity lying between that of the isotropic and full covariance structure models. We shall illustrate the use of mixtures of factor analyzers in a practical example that considers the clustering of cell lines on the basis of gene expressions from microarray experiments. (C) 2002 Elsevier Science B.V. All rights reserved.
Resumo:
The extent to which density-dependent processes regulate natural populations is the subject of an ongoing debate. We contribute evidence to this debate showing that density-dependent processes influence the population dynamics of the ectoparasite Aponomma hydrosauri (Acari: Ixodidae), a tick species that infests reptiles in Australia. The first piece of evidence comes from an unusually long-term dataset on the distribution of ticks among individual hosts. If density-dependent processes are influencing either host mortality or vital rates of the parasite population, and those distributions can be approximated with negative binomial distributions, then general host-parasite models predict that the aggregation coefficient of the parasite distribution will increase with the average intensity of infections. We fit negative binomial distributions to the frequency distributions of ticks on hosts, and find that the estimated aggregation coefficient k increases with increasing average tick density. This pattern indirectly implies that one or more vital rates of the tick population must be changing with increasing tick density, because mortality rates of the tick's main host, the sleepy lizard, Tiliqua rugosa, are unaffected by changes in tick burdens. Our second piece of evidence is a re-analysis of experimental data on the attachment success of individual ticks to lizard hosts using generalized linear modelling. The probability of successful engorgement decreases with increasing numbers of ticks attached to a host. This is direct evidence of a density-dependent process that could lead to an increase in the aggregation coefficient of tick distributions described earlier. The population-scale increase in the aggregation coefficient is indirect evidence of a density-dependent process or processes sufficiently strong to produce a population-wide pattern, and thus also likely to influence population regulation. The direct observation of a density-dependent process is evidence of at least part of the responsible mechanism.
Resumo:
This paper deals with an n-fold Weibull competing risk model. A characterisation of the WPP plot is given along with estimation of model parameters when modelling a given data set. These are illustrated through two examples. A study of the different possible shapes for the density and failure rate functions is also presented. (C) 2003 Elsevier Ltd. All rights reserved.
Resumo:
It has been suggested that twinning may influence handedness through the effects of birth order, intra-uterine crowding and mirror imaging. The influence of these effects on handedness (for writing and throwing) was examined in 3657 Monozygotic (MZ) and 3762 Dizygotic (DZ) twin pairs (born 1893-1992). Maximum likelihood analyses revealed no effects of birth order on the incidence of left-handedness. Twins were no more likely to be left-handed than their singleton siblings (n = 1757), and there were no differences between the DZ co-twin and sibling-twin covariances, suggesting that neither intra-uterine crowding nor the experience of being a twin affects handedness. There was no evidence of mirror imaging; the co-twin correlations of monochorionic and dichorionic MZ twins did not differ. Univariate genetic analyses revealed common environmental factors to be the most parsimonious explanation of familial aggregation for the writing-hand measure, while additive genetic influences provided a better interpretation of the throwing hand data.
Resumo:
Distributed generation unlike centralized electrical generation aims to generate electrical energy on small scale as near as possible to load centers, interchanging electric power with the network. This work presents a probabilistic methodology conceived to assist the electric system planning engineers in the selection of the distributed generation location, taking into account the hourly load changes or the daily load cycle. The hourly load centers, for each of the different hourly load scenarios, are calculated deterministically. These location points, properly weighted according to their load magnitude, are used to calculate the best fit probability distribution. This distribution is used to determine the maximum likelihood perimeter of the area where each source distributed generation point should preferably be located by the planning engineers. This takes into account, for example, the availability and the cost of the land lots, which are factors of special relevance in urban areas, as well as several obstacles important for the final selection of the candidates of the distributed generation points. The proposed methodology has been applied to a real case, assuming three different bivariate probability distributions: the Gaussian distribution, a bivariate version of Freund’s exponential distribution and the Weibull probability distribution. The methodology algorithm has been programmed in MATLAB. Results are presented and discussed for the application of the methodology to a realistic case and demonstrate the ability of the proposed methodology for efficiently handling the determination of the best location of the distributed generation and their corresponding distribution networks.
Resumo:
The species abundance distribution (SAD) has been a central focus of community ecology for over fifty years, and is currently the subject of widespread renewed interest. The gambin model has recently been proposed as a model that provides a superior fit to commonly preferred SAD models. It has also been argued that the model's single parameter (α) presents a potentially informative ecological diversity metric, because it summarises the shape of the SAD in a single number. Despite this potential, few empirical tests of the model have been undertaken, perhaps because the necessary methods and software for fitting the model have not existed. Here, we derive a maximum likelihood method to fit the model, and use it to undertake a comprehensive comparative analysis of the fit of the gambin model. The functions and computational code to fit the model are incorporated in a newly developed free-to-download R package (gambin). We test the gambin model using a variety of datasets and compare the fit of the gambin model to fits obtained using the Poisson lognormal, logseries and zero-sum multinomial distributions. We found that gambin almost universally provided a better fit to the data and that the fit was consistent for a variety of sample grain sizes. We demonstrate how α can be used to differentiate intelligibly between community structures of Azorean arthropods sampled in different land use types. We conclude that gambin presents a flexible model capable of fitting a wide variety of observed SAD data, while providing a useful index of SAD form in its single fitted parameter. As such, gambin has wide potential applicability in the study of SADs, and ecology more generally.
Resumo:
Dissertação de Mestrado, Biodiversidade e Biotecnologia Vegetal, 17 de Março de 2015, Universidade dos Açores.
Resumo:
OBJECTIVE: To evaluate the growth parameters in infants who were born to HIV-1-infected mothers. METHODS: The study was a longitudinal evaluation of the z-scores for the weight-for-age (WAZ), weight-for-length (WLZ) and length-for-age (LAZ) data collected from a cohort. A total of 97 non-infected and 33 HIV-infected infants born to HIV-1-infected mothers in Belo Horizonte, Southeastern Brazil, between 1995 and 2003 was studied. The average follow-up period for the infected and non-infected children was 15.8 months (variation: 6.8 to 18.0 months) and 14.3 months (variation: 6.3 to 18.6 months), respectively. A mixed-effects linear regression model was used and was fitted using a restricted maximum likelihood. RESULTS: There was an observed decrease over time in the WAZ, LAZ and WLZ among the infected infants. At six months of age, the mean differences in the WAZ, LAZ and WLZ between the HIV-infected and non-infected infants were 1.02, 0.59, and 0.63 standard deviations, respectively. At 12 months, the mean differences in the WAZ, LAZ and WLZ between the HIV-infected and non-infected infants were 1.15, 1.01, and 0.87 standard deviations, respectively. CONCLUSIONS: The precocious and increasing deterioration of the HIV-infected infants' anthropometric indicators demonstrates the importance of the early identification of HIV-infected infants who are at nutritional risk and the importance of the continuous assessment of nutritional interventions for these infants.