61 resultados para Maximum-likelihood-estimation


Relevância:

90.00% 90.00%

Publicador:

Resumo:

In simultaneous analyses of multiple data partitions, the trees relevant when measuring support for a clade are the optimal tree, and the best tree lacking the clade (i.e., the most reasonable alternative). The parsimony-based method of partitioned branch support (PBS) forces each data set to arbitrate between the two relevant trees. This value is the amount each data set contributes to clade support in the combined analysis, and can be very different to support apparent in separate analyses. The approach used in PBS can also be employed in likelihood: a simultaneous analysis of all data retrieves the maximum likelihood tree, and the best tree without the clade of interest is also found. Each data set is fitted to the two trees and the log-likelihood difference calculated, giving partitioned likelihood support (PLS) for each data set. These calculations can be performed regardless of the complexity of the ML model adopted. The significance of PLS can be evaluated using a variety of resampling methods, such as the Kishino-Hasegawa test, the Shimodiara-Hasegawa test, or likelihood weights, although the appropriateness and assumptions of these tests remains debated.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Mixture models implemented via the expectation-maximization (EM) algorithm are being increasingly used in a wide range of problems in pattern recognition such as image segmentation. However, the EM algorithm requires considerable computational time in its application to huge data sets such as a three-dimensional magnetic resonance (MR) image of over 10 million voxels. Recently, it was shown that a sparse, incremental version of the EM algorithm could improve its rate of convergence. In this paper, we show how this modified EM algorithm can be speeded up further by adopting a multiresolution kd-tree structure in performing the E-step. The proposed algorithm outperforms some other variants of the EM algorithm for segmenting MR images of the human brain. (C) 2004 Pattern Recognition Society. Published by Elsevier Ltd. All rights reserved.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

The purpose of this work was to model lung cancer mortality as a function of past exposure to tobacco and to forecast age-sex-specific lung cancer mortality rates. A 3-factor age-period-cohort (APC) model, in which the period variable is replaced by the product of average tar content and adult tobacco consumption per capita, was estimated for the US, UK, Canada and Australia by the maximum likelihood method. Age- and sex-specific tobacco consumption was estimated from historical data on smoking prevalence and total tobacco consumption. Lung cancer mortality was derived from vital registration records. Future tobacco consumption, tar content and the cohort parameter were projected by autoregressive moving average (ARIMA) estimation. The optimal exposure variable was found to be the product of average tar content and adult cigarette consumption per capita, lagged for 2530 years for both males and females in all 4 countries. The coefficient of the product of average tar content and tobacco consumption per capita differs by age and sex. In all models, there was a statistically significant difference in the coefficient of the period variable by sex. In all countries, male age-standardized lung cancer mortality rates peaked in the 1980s and declined thereafter. Female mortality rates are projected to peak in the first decade of this century. The multiplicative models of age, tobacco exposure and cohort fit the observed data between 1950 and 1999 reasonably well, and time-series models yield plausible past trends of relevant variables. Despite a significant reduction in tobacco consumption and average tar content of cigarettes sold over the past few decades, the effect on lung cancer mortality is affected by the time lag between exposure and established disease. As a result, the burden of lung cancer among females is only just reaching, or soon will reach, its peak but has been declining for I to 2 decades in men. Future sex differences in lung cancer mortality are likely to be greater in North America than Australia and the UK due to differences in exposure patterns between the sexes. (c) 2005 Wiley-Liss, Inc.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Marine invertebrate sperm proteins are particularly interesting because they are characterized by positive selection and are likely to be involved in prezyogotic isolation and, thus, speciation. Here, we present the first survey of inter and intraspecific variation of a bivalve sperm protein among a group of species that regularly hybridize in nature. M7 lysin is found in sperm acrosomes of mussels and dissolves the egg vitelline coat, permitting fertilization. We sequenced multiple alleles of the mature protein-coding region of M7 lysin from allopatric populations of mussels in the Mytilus edulis species group (M. edulis, M. galloprovincialis, and M. trossulus). A significant McDonald-Kreitman test showed an excess of fixed amino acid replacing substitutions between species, consistent with positive selection. In addition, Kolmogorov-Smirnov tests showed significant heterogeneity in polymorphism to divergence ratios for both synonymous variation and combined synonymous and non-synonymous variation within M. galloprovincialis. These results indicate that there has been adaptive evolution at M7 lysin and, furthermore, shows that positive selection on sperm proteins can occur even when post-zygotic reproductive isolation is incomplete.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

The generalized Gibbs sampler (GGS) is a recently developed Markov chain Monte Carlo (MCMC) technique that enables Gibbs-like sampling of state spaces that lack a convenient representation in terms of a fixed coordinate system. This paper describes a new sampler, called the tree sampler, which uses the GGS to sample from a state space consisting of phylogenetic trees. The tree sampler is useful for a wide range of phylogenetic applications, including Bayesian, maximum likelihood, and maximum parsimony methods. A fast new algorithm to search for a maximum parsimony phylogeny is presented, using the tree sampler in the context of simulated annealing. The mathematics underlying the algorithm is explained and its time complexity is analyzed. The method is tested on two large data sets consisting of 123 sequences and 500 sequences, respectively. The new algorithm is shown to compare very favorably in terms of speed and accuracy to the program DNAPARS from the PHYLIP package.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Most populations and some species of ticks of the genera Boophilus (5 spp.) and Rhipicephalus (ca. 75 spp.) cannot be distinguished phenotypically. Moreover, there is doubt about the validity of species in these genera. I studied the entire second internal transcribed spacer (ITS 2) rRNA of 16 populations of rhipicephaline ticks to address these problems: Boophilus,microplus from Australia, Kenya, South Africa and Brazil (4 populations); Boophilus decoloratus from Kenya; Rhipicephalus appendiculatus from Kenya, Zimbabwe and Zambia (7 populations); Rhipicephalus zambesiensis from Zimbabwe (3 populations); and Rhipicephalus evertsi from Kenya. Each of the 16 populations had a unique ITS 2, but most of the nucleotide variation occurred among species and genera. ITS 2 rRNA can be used to distinguish the populations and species of Boophilus and Rhipicephalus studied here. Little support was found for the hypothesis that B. microplus from Australia and South Africa are different species. ITS 2 appears useful for phylogenetic inference in the Rhipicephalinae because in genetic distance, maximum likelihood, and maximum parsimony analyses, most branches leading to species had >95% bootstrap support. Rhipicephalus appendiculatus and R, zambeziensis are closely related, yet their ITS 2 sequences could be distinguished unambiguously. This lends weight to a previous proposal that Rhipicephalus sanguineus and Rhipicephalus turanicus, and Rhipicephalus pumlilio and Rhipicephalus camicasi, respectively, are conspecific, because each of these pairs of species had identical sequences for ca. 250 bp of ITS 2 rRNA.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

A mixture model for long-term survivors has been adopted in various fields such as biostatistics and criminology where some individuals may never experience the type of failure under study. It is directly applicable in situations where the only information available from follow-up on individuals who will never experience this type of failure is in the form of censored observations. In this paper, we consider a modification to the model so that it still applies in the case where during the follow-up period it becomes known that an individual will never experience failure from the cause of interest. Unless a model allows for this additional information, a consistent survival analysis will not be obtained. A partial maximum likelihood (ML) approach is proposed that preserves the simplicity of the long-term survival mixture model and provides consistent estimators of the quantities of interest. Some simulation experiments are performed to assess the efficiency of the partial ML approach relative to the full ML approach for survival in the presence of competing risks.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Hemichordates were traditionally allied to the chordates, but recent molecular analyses have suggested that hemichordates are a sister group to the echinoderms, a relationship that has important consequences for the interpretation of the evolution of deuterostome body plans. However, the molecular phylogenetic analyses to date have not provided robust support for the hemichordate + echinoderm clade. We use a maximum likelihood framework, including the parametric bootstrap, to reanalyze DNA data from complete mitochondrial genomes and nuclear 18S rRNA. This approach provides the first statistically significant support for the hemichordate + echinoderm clade from molecular data. This grouping implies that the ancestral deuterostome had features that included an adult with a pharynx and a dorsal nerve cord and an indirectly developing dipleurula-like larva.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Objective: To measure prevalence and model incidence of HIV infection. Setting: 2013 consecutive pregnant women attending public sector antenatal clinics in 1997 in Hlabisa health district, South Africa. Historical seroprevalence data, 1992-1995. Methods: Serum remaining from syphilis testing was tested anonymously for antibodies to HIV to determine seroprevalence. Two models, allowing for differential mortality between HIV-positive and HIV-negative people, were used. The first used serial seroprevalence data to estimate trends in annual incidence. The second, a maximum likelihood model, took account of changing force of infection and age-dependent risk of infection, to estimate age-specific HIV incidence in 1997. Multiple logistic regression provided adjusted odds ratios (OR) for risk factors for prevalent HIV infection. Results: Estimated annual HIV incidence increased from 4% in 1992/1993 to 10% in 1996/1997. In 1997, highest age-specific incidence was 16% among women aged between 20 and 24 years. in 1997, overall prevalence was 26% (95% confidence interval [CI], 24%-28%) and at 34% was highest among women aged between 20 and 24 years. Young age (<30 years; odds ratio [OR], 2.1; p = .001), unmarried status (OR 2.2; p = .001) and living in less remote parts of the district (OR 1.5; p = .002) were associated with HIV prevalence in univariate analysis. Associations were less strong in multivariate analysis. Partner's migration status was not associated with HIV infection. Substantial heterogeneity of HIV prevalence by clinic was observed (range 17%-31%; test for trend, p = .001). Conclusions: This community is experiencing an explosive HIV epidemic. Young, single women in the more developed parts of the district would form an appropriate cohort to test, and benefit from, interventions such as vaginal microbicides and HIV vaccines.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

We present a method of estimating HIV incidence rates in epidemic situations from data on age-specific prevalence and changes in the overall prevalence over time. The method is applied to women attending antenatal clinics in Hlabisa, a rural district of KwaZulu/Natal, South Africa, where transmission of HIV is overwhelmingly through heterosexual contact. A model which gives age-specific prevalence rates in the presence of a progressing epidemic is fitted to prevalence data for 1998 using maximum likelihood methods and used to derive the age-specific incidence. Error estimates are obtained using a Monte Carlo procedure. Although the method is quite general some simplifying assumptions are made concerning the form of the risk function and sensitivity analyses are performed to explore the importance of these assumptions. The analysis shows that in 1998 the annual incidence of infection per susceptible woman increased from 5.4 per cent (3.3-8.5 per cent; here and elsewhere ranges give 95 per cent confidence limits) at age 15 years to 24.5 per cent (20.6-29.1 per cent) at age 22 years and declined to 1.3 per cent (0.5-2.9 per cent) at age 50 years; standardized to a uniform age distribution, the overall incidence per susceptible woman aged 15 to 59 was 11.4 per cent (10.0-13.1 per cent); per women in the population it was 8.4 per cent (7.3-9.5 per cent). Standardized to the age distribution of the female population the average incidence per woman was 9.6 per cent (8.4-11.0 per cent); standardized to the age distribution of women attending antenatal clinics, it was 11.3 per cent (9.8-13.3 per cent). The estimated incidence depends on the values used for the epidemic growth rate and the AIDS related mortality. To ensure that, for this population, errors in these two parameters change the age specific estimates of the annual incidence by less than the standard deviation of the estimates of the age specific incidence, the AIDS related mortality should be known to within +/-50 per cent and the epidemic growth rate to within +/-25 per cent, both of which conditions are met. In the absence of cohort studies to measure the incidence of HIV infection directly, useful estimates of the age-specific incidence can be obtained from cross-sectional, age-specific prevalence data and repeat cross-sectional data on the overall prevalence of HIV infection. Several assumptions were made because of the lack of data but sensitivity analyses show that they are unlikely to affect the overall estimates significantly. These estimates are important in assessing the magnitude of the public health problem, for designing vaccine trials and for evaluating the impact of interventions. Copyright (C) 2001 John Wiley & Sons, Ltd.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Matrix population models, elasticity analysis and loop analysis can potentially provide powerful techniques for the analysis of life histories. Data from a capture-recapture study on a population of southern highland water skinks (Eulamprus tympanum) were used to construct a matrix population model. Errors in elasticities were calculated by using the parametric bootstrap technique. Elasticity and loop analyses were then conducted to identify the life history stages most important to fitness. The same techniques were used to investigate the relative importance of fast versus slow growth, and rapid versus delayed reproduction. Mature water skinks were long-lived, but there was high immature mortality. The most sensitive life history stage was the subadult stage. It is suggested that life history evolution in E. tympanum may be strongly affected by predation, particularly by birds. Because our population declined over the study, slow growth and delayed reproduction were the optimal life history strategies over this period. Although the techniques of evolutionary demography provide a powerful approach for the analysis of life histories, there are formidable logistical obstacles in gathering enough high-quality data for robust estimates of the critical parameters.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

The phylogeny of the Australian legume genus Daviesia was estimated using sequences of the internal transcribed spacers of nuclear ribosomal DNA. Partial congruence was found with previous analyses using morphology, including strong support for monophyly of the genus and for a sister group relationship between the clade D. pachyloma and the rest of the genus. A previously unplaced bird-pollinated species, anceps + D. D. epiphyllum, was well supported as sister to the only other bird-pollinated species in the genus, D. speciosa, indicating a single origin of bird pollination in their common ancestor. Other morphological groups within Daviesia were not supported and require reassessment. A strong and previously unreported sister clade of Daviesia consists of the two monotypic genera Erichsenia and Viminaria. These share phyllode-like leaves and indehiscent fruits. The evolutionary history of cord roots, which have anomalous secondary thickening, was explored using parsimony. Cord roots are limited to three separate clades but have a complex history involving a small number of gains (most likely 0-3) and losses (0-5). The anomalous structure of cord roots ( adventitious vascular strands embedded in a parenchymatous matrix) may facilitate nutrient storage, and the roots may be contractile. Both functions may be related to a postfire resprouting adaptation. Alternatively, cord roots may be an adaptation to the low-nutrient lateritic soils of Western Australia. However, tests for association between root type, soil type, and growth habit were equivocal, depending on whether the variables were treated as phylogenetically dependent (insignificant) or independent ( significant).

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Using the classical twin design, this study investigates the influence of genetic factors on the large phenotypic variance in inspection time (IT), and whether the well established IT-IQ association can be explained by a common genetic factor. Three hundred ninety pairs of twins (184 monozygotic, MZ; 206 dizygotic, DZ) with a mean age of 16 years participated, and 49 pairs returned approximately 3 months, later for retesting. As in many IT studies, the pi figure stimulus was used and IT was estimated from the cumulative normal ogive. IT ranged from 39.4 to 774.1 ms (159 +/- 110.1 ms) with faster ITs (by an average of 26.9 ms) found in the retest session from which a reliability of .69 was estimated. Full-scale IQ (FIQ) was assessed by the Multidimensional Aptitude Battery (MAB) and ranged from 79 to 145 (111 +/- 13). The phenotypic association between IT and FIQ was confirmed (- .35) and bivariate results showed that a common genetic factor accounted for 36% of the variance in IT and 32% of the variance in FIQ. The maximum likelihood estimate of the genetic correlation was - .63. When performance and verbal IQ (PIQ & VIQ) were analysed with IT, a stronger phenotypic and genetic relationship was found between PIQ and IT than with VIQ. A large part of the IT variance (64%) was accounted for by a unique environmental factor. Further genetic factors were needed to explain the remaining variance in IQ with a small component of unique environmental variance present. The separability of a shared genetic factor influencing IT and IQ from the total genetic variance in IQ suggests that IT affects a specific subcomponent of intelligence rather than a generalised efficiency. (C) 2001 Elsevier Science Inc. All rights reserved.