949 resultados para Maximum likelihood estimator (MLE)
Resumo:
Hybrid zones provide excellent opportunities to study processes and mechanisms underlying reproductive isolation and speciation. Here we investigated sex-specific clines of molecular markers in hybrid zones of morphologically cryptic yet genetically highly-diverged evolutionary lineages of the European common vole (Microtus arvalis). We analyzed the position and width of four secondary contact zones along three independent transects in the region of the Alps using maternally (mitochondrial DNA) and paternally (Y-chromosome) inherited genetic markers. Given male-biased dispersal in the common vole, a selectively neutral secondary contact would show broader paternal marker clines than maternal ones. In a selective case, for example, involving a form of Haldane’s rule, Y-chromosomal clines would not be expected to be broader than maternal markers because they are transmitted by the heterogametic sex and thus gene flow would be restricted. Consistent with the selective case, paternal clines were significantly narrower or at most equal in width to maternal clines in all contact zones. In addition, analyses using maximum likelihood cline-fitting detected a shift of paternal relative to maternal clines in three of four contact zones. These patterns suggest that processes at the contact zones in the common vole are not selectively neutral, and that partial reproductive isolation is already established between these evolutionary lineages. We conclude that hybrid zone movement, sexual selection and/or genetic incompatibilities are likely associated with an unusual unidirectional manifestation of Haldane’s rule in this common European mammal.
Resumo:
The distribution of the number of heterozygous loci in two randomly chosen gametes or in a random diploid zygote provides information regarding the nonrandom association of alleles among different genetic loci. Two alternative statistics may be employed for detection of nonrandom association of genes of different loci when observations are made on these distributions: observed variance of the number of heterozygous loci (s2k) and a goodness-of-fit criterion (X2) to contrast the observed distribution with that expected under the hypothesis of random association of genes. It is shown, by simulation, that s2k is statistically more efficient than X2 to detect a given extent of nonrandom association. Asymptotic normality of s2k is justified, and X2 is shown to follow a chi-square (chi 2) distribution with partial loss of degrees of freedom arising because of estimation of parameters from the marginal gene frequency data. Whenever direct evaluations of linkage disequilibrium values are possible, tests based on maximum likelihood estimators of linkage disequilibria require a smaller sample size (number of zygotes or gametes) to detect a given level of nonrandom association in comparison with that required if such tests are conducted on the basis of s2k. Summarization of multilocus genotype (or haplotype) data, into the different number of heterozygous loci classes, thus, amounts to appreciable loss of information.
Resumo:
Variable number of tandem repeats (VNTR) are genetic loci at which short sequence motifs are found repeated different numbers of times among chromosomes. To explore the potential utility of VNTR loci in evolutionary studies, I have conducted a series of studies to address the following questions: (1) What are the population genetic properties of these loci? (2) What are the mutational mechanisms of repeat number change at these loci? (3) Can DNA profiles be used to measure the relatedness between a pair of individuals? (4) Can DNA fingerprint be used to measure the relatedness between populations in evolutionary studies? (5) Can microsatellite and short tandem repeat (STR) loci which mutate stepwisely be used in evolutionary analyses?^ A large number of VNTR loci typed in many populations were studied by means of statistical methods developed recently. The results of this work indicate that there is no significant departure from Hardy-Weinberg expectation (HWE) at VNTR loci in most of the human populations examined, and the departure from HWE in some VNTR loci are not solely caused by the presence of population sub-structure.^ A statistical procedure is developed to investigate the mutational mechanisms of VNTR loci by studying the allele frequency distributions of these loci. Comparisons of frequency distribution data on several hundreds VNTR loci with the predictions of two mutation models demonstrated that there are differences among VNTR loci grouped by repeat unit sizes.^ By extending the ITO method, I derived the distribution of the number of shared bands between individuals with any kinship relationship. A maximum likelihood estimation procedure is proposed to estimate the relatedness between individuals from the observed number of shared bands between them.^ It was believed that classical measures of genetic distance are not applicable to analysis of DNA fingerprints which reveal many minisatellite loci simultaneously in the genome, because the information regarding underlying alleles and loci is not available. I proposed a new measure of genetic distance based on band sharing between individuals that is applicable to DNA fingerprint data.^ To address the concern that microsatellite and STR loci may not be useful for evolutionary studies because of the convergent nature of their mutation mechanisms, by a theoretical study as well as by computer simulation, I conclude that the possible bias caused by the convergent mutations can be corrected, and a novel measure of genetic distance that makes the correction is suggested. In summary, I conclude that hypervariable VNTR loci are useful in evolutionary studies of closely related populations or species, especially in the study of human evolution and the history of geographic dispersal of Homo sapiens. (Abstract shortened by UMI.) ^
Resumo:
Models of DNA sequence evolution and methods for estimating evolutionary distances are needed for studying the rate and pattern of molecular evolution and for inferring the evolutionary relationships of organisms or genes. In this dissertation, several new models and methods are developed.^ The rate variation among nucleotide sites: To obtain unbiased estimates of evolutionary distances, the rate heterogeneity among nucleotide sites of a gene should be considered. Commonly, it is assumed that the substitution rate varies among sites according to a gamma distribution (gamma model) or, more generally, an invariant+gamma model which includes some invariable sites. A maximum likelihood (ML) approach was developed for estimating the shape parameter of the gamma distribution $(\alpha)$ and/or the proportion of invariable sites $(\theta).$ Computer simulation showed that (1) under the gamma model, $\alpha$ can be well estimated from 3 or 4 sequences if the sequence length is long; and (2) the distance estimate is unbiased and robust against violations of the assumptions of the invariant+gamma model.^ However, this ML method requires a huge amount of computational time and is useful only for less than 6 sequences. Therefore, I developed a fast method for estimating $\alpha,$ which is easy to implement and requires no knowledge of tree. A computer program was developed for estimating $\alpha$ and evolutionary distances, which can handle the number of sequences as large as 30.^ Evolutionary distances under the stationary, time-reversible (SR) model: The SR model is a general model of nucleotide substitution, which assumes (i) stationary nucleotide frequencies and (ii) time-reversibility. It can be extended to SRV model which allows rate variation among sites. I developed a method for estimating the distance under the SR or SRV model, as well as the variance-covariance matrix of distances. Computer simulation showed that the SR method is better than a simpler method when the sequence length $L>1,000$ bp and is robust against deviations from time-reversibility. As expected, when the rate varies among sites, the SRV method is much better than the SR method.^ The evolutionary distances under nonstationary nucleotide frequencies: The statistical properties of the paralinear and LogDet distances under nonstationary nucleotide frequencies were studied. First, I developed formulas for correcting the estimation biases of the paralinear and LogDet distances. The performances of these formulas and the formulas for sampling variances were examined by computer simulation. Second, I developed a method for estimating the variance-covariance matrix of the paralinear distance, so that statistical tests of phylogenies can be conducted when the nucleotide frequencies are nonstationary. Third, a new method for testing the molecular clock hypothesis was developed in the nonstationary case. ^
Resumo:
The use of group-randomized trials is particularly widespread in the evaluation of health care, educational, and screening strategies. Group-randomized trials represent a subset of a larger class of designs often labeled nested, hierarchical, or multilevel and are characterized by the randomization of intact social units or groups, rather than individuals. The application of random effects models to group-randomized trials requires the specification of fixed and random components of the model. The underlying assumption is usually that these random components are normally distributed. This research is intended to determine if the Type I error rate and power are affected when the assumption of normality for the random component representing the group effect is violated. ^ In this study, simulated data are used to examine the Type I error rate, power, bias and mean squared error of the estimates of the fixed effect and the observed intraclass correlation coefficient (ICC) when the random component representing the group effect possess distributions with non-normal characteristics, such as heavy tails or severe skewness. The simulated data are generated with various characteristics (e.g. number of schools per condition, number of students per school, and several within school ICCs) observed in most small, school-based, group-randomized trials. The analysis is carried out using SAS PROC MIXED, Version 6.12, with random effects specified in a random statement and restricted maximum likelihood (REML) estimation specified. The results from the non-normally distributed data are compared to the results obtained from the analysis of data with similar design characteristics but normally distributed random effects. ^ The results suggest that the violation of the normality assumption for the group component by a skewed or heavy-tailed distribution does not appear to influence the estimation of the fixed effect, Type I error, and power. Negative biases were detected when estimating the sample ICC and dramatically increased in magnitude as the true ICC increased. These biases were not as pronounced when the true ICC was within the range observed in most group-randomized trials (i.e. 0.00 to 0.05). The normally distributed group effect also resulted in bias ICC estimates when the true ICC was greater than 0.05. However, this may be a result of higher correlation within the data. ^
Resumo:
(1) A mathematical theory for computing the probabilities of various nucleotide configurations is developed, and the probability of obtaining the correct phylogenetic tree (model tree) from sequence data is evaluated for six phylogenetic tree-making methods (UPGMA, distance Wagner method, transformed distance method, Fitch-Margoliash's method, maximum parsimony method, and compatibility method). The number of nucleotides (m*) necessary to obtain the correct tree with a probability of 95% is estimated with special reference to the human, chimpanzee, and gorilla divergence. m* is at least 4,200, but the availability of outgroup species greatly reduces m* for all methods except UPGMA. m* increases if transitions occur more frequently than transversions as in the case of mitochondrial DNA. (2) A new tree-making method called the neighbor-joining method is proposed. This method is applicable either for distance data or character state data. Computer simulation has shown that the neighbor-joining method is generally better than UPGMA, Farris' method, Li's method, and modified Farris method on recovering the true topology when distance data are used. A related method, the simultaneous partitioning method, is also discussed. (3) The maximum likelihood (ML) method for phylogeny reconstruction under the assumption of both constant and varying evolutionary rates is studied, and a new algorithm for obtaining the ML tree is presented. This method gives a tree similar to that obtained by UPGMA when constant evolutionary rate is assumed, whereas it gives a tree similar to that obtained by the maximum parsimony tree and the neighbor-joining method when varying evolutionary rate is assumed. ^
Resumo:
Approximately 350 base pairs (bp) of the mitochondrial 16S rRNA gene were used to study the phylogenetic relationships among 5 genera of the clawed lobster family Nephropidae (infraorder Astacidea), including Homarus, Homarinus, Metanephrops, Nephrops, and Nephropsis. Maximum-parsimony analysis, using a hermit crab, Pagurus pollicaris (infraorder Anomura), as an outgroup. produced a tree topology in which Homarus and Nephrops formed a well-supported clade that excluded Homarinus. The same tree topology was obtained from both neighbor-joining and maximum-likelihood analyses, Some morphological characters that appear synapomorphic for Nephrops and Metanephrops may be due to convergence rather than symplesiomorphy. The current taxonomy, therefore, does not reflect the phylogeny of this group as suggested by the molecular data. More molecular data and studies using homologous morphological characters me needed to reach a better understanding of the phylogenetic history of clawed lobsters.
Resumo:
The 2014 Ebola virus (EBOV) outbreak in West Africa is the largest outbreak of the genus Ebolavirus to date. To better understand the spread of infection in the affected countries, it is crucial to know the number of secondary cases generated by an infected index case in the absence and presence of control measures, i.e., the basic and effective reproduction number. In this study, I describe the EBOV epidemic using an SEIR (susceptible-exposed-infectious-recovered) model and fit the model to the most recent reported data of infected cases and deaths in Guinea, Sierra Leone and Liberia. The maximum likelihood estimates of the basic reproduction number are 1.51 (95% confidence interval [CI]: 1.50-1.52) for Guinea, 2.53 (95% CI: 2.41-2.67) for Sierra Leone and 1.59 (95% CI: 1.57-1.60) for Liberia. The model indicates that in Guinea and Sierra Leone the effective reproduction number might have dropped to around unity by the end of May and July 2014, respectively. In Liberia, however, the model estimates no decline in the effective reproduction number by end-August 2014. This suggests that control efforts in Liberia need to be improved substantially in order to stop the current outbreak.
Resumo:
We present an image quality assessment and enhancement method for high-resolution Fourier-Domain OCT imaging like in sub-threshold retina therapy. A Maximum-Likelihood deconvolution algorithm as well as a histogram-based quality assessment method are evaluated.
Resumo:
OBJECTIVE: Bell, Marcus, and Goodlad (2013) recently conducted a meta-analysis of randomized controlled additive trials and found that adding an additional component to an existing treatment vis-à-vis the existing treatment produced larger effect sizes on targeted outcomes at 6-months follow-up than at termination, an effect they labeled as a sleeper effect. One of the limitations with Bell et al.'s detection of the sleeper effect was that they did not conduct a statistical test of the size of the effect at follow-up versus termination. METHOD: To statistically test if the differences of effect sizes between the additive conditions and the control conditions at follow-up differed from those at termination, we used a restricted maximum-likelihood random-effect model with known variances to conduct a multilevel longitudinal meta-analysis (k = 30). RESULTS: Although the small effects at termination detected by Bell et al. were replicated (ds = 0.17-0.23), none of the analyses of growth from termination to follow-up produced statistically significant effects (ds < 0.08; p > .20), and when asymmetry was considered using trim-and-fill procedure or the studies after 2000 were analyzed, magnitude of the sleeper effect was negligible (d = 0.00). CONCLUSION: There is no empirical evidence to support the sleeper effect.
Resumo:
A measurement of the mass difference between top and anti-top quarks is presented. In a 4.7 fb−14.7 fb−1 data sample of proton–proton collisions at View the MathML sources=7 TeV recorded with the ATLAS detector at the LHC, events consistent with View the MathML sourcett¯ production and decay into a single charged lepton final state are reconstructed. For each event, the mass difference between the top and anti-top quark candidate is calculated. A two b -tag requirement is used in order to reduce the background contribution. A maximum likelihood fit to these per-event mass differences yields View the MathML sourceΔm≡mt−mt¯=0.67±0.61(stat)±0.41(syst) GeV, consistent with CPT invariance.
Resumo:
For probability distributions on ℝq, a detailed study of the breakdown properties of some multivariate M-functionals related to Tyler's [Ann. Statist. 15 (1987) 234] ‘distribution-free’ M-functional of scatter is given. These include a symmetrized version of Tyler's M-functional of scatter, and the multivariate t M-functionals of location and scatter. It is shown that for ‘smooth’ distributions, the (contamination) breakdown point of Tyler's M-functional of scatter and of its symmetrized version are 1/q and inline image, respectively. For the multivariate t M-functional which arises from the maximum likelihood estimate for the parameters of an elliptical t distribution on ν ≥ 1 degrees of freedom the breakdown point at smooth distributions is 1/(q + ν). Breakdown points are also obtained for general distributions, including empirical distributions. Finally, the sources of breakdown are investigated. It turns out that breakdown can only be caused by contaminating distributions that are concentrated near low-dimensional subspaces.
Resumo:
BACKGROUND Blood pressure (BP) is known to aggregate in families. Yet, heritability estimates are population-specific and no Swiss data have been published so far. We estimated the heritability of ambulatory and office BP in a Swiss population-based sample. METHODS The Swiss Kidney Project on Genes in Hypertension is a population-based family study focusing on BP genetics. Office and ambulatory BP were measured in 1009 individuals from 271 nuclear families. Heritability was estimated for SBP, DBP, and pulse pressure using a maximum likelihood method implanted in the Statistical Analysis in Genetic Epidemiology software. RESULTS The 518 women and 491 men included in this analysis had a mean (±SD) age of 48.3 (±17.4) and 47.3 (±17.7) years, and a mean BMI of 23.8 (±4.2) and 25.9 (±4.1) kg/m, respectively. Narrow-sense heritability estimates (±standard error) for ambulatory SBP, DBP, and pulse pressure were 0.37 ± 0.07, 0.26 ± 0.07, and 0.29 ± 0.07 for 24-h BP; 0.39 ± 0.07, 0.28 ± 0.07, and 0.27 ± 0.07 for day BP; and 0.25 ± 0.07, 0.20 ± 0.07, and 0.30 ± 0.07 for night BP, respectively (all P < 0.001). Heritability estimates for office SBP, DBP, and pulse pressure were 0.21 ± 0.08, 0.25 ± 0.08, and 0.18 ± 0.07 (all P < 0.01). CONCLUSIONS We found significant heritability estimates for both ambulatory and office BP in this Swiss population-based study. Our findings justify the ongoing search for the genetic determinants of BP.
Resumo:
Allostatic load (AL) is a marker of physiological dysregulation which reflects exposure to chronic stress. High AL has been related to poorer health outcomes including mortality. We examine here the association of socioeconomic and lifestyle factors with AL. Additionally, we investigate the extent to which AL is genetically determined. We included 803 participants (52% women, mean age 48±16years) from a population and family-based Swiss study. We computed an AL index aggregating 14 markers from cardiovascular, metabolic, lipidic, oxidative, hypothalamus-pituitary-adrenal and inflammatory homeostatic axes. Education and occupational position were used as indicators of socioeconomic status. Marital status, stress, alcohol intake, smoking, dietary patterns and physical activity were considered as lifestyle factors. Heritability of AL was estimated by maximum likelihood. Women with a low occupational position had higher AL (low vs. high OR=3.99, 95%CI [1.22;13.05]), while the opposite was observed for men (middle vs. high OR=0.48, 95%CI [0.23;0.99]). Education tended to be inversely associated with AL in both sexes(low vs. high OR=3.54, 95%CI [1.69;7.4]/OR=1.59, 95%CI [0.88;2.90] in women/men). Heavy drinking men as well as women abstaining from alcohol had higher AL than moderate drinkers. Physical activity was protective against AL while high salt intake was related to increased AL risk. The heritability of AL was estimated to be 29.5% ±7.9%. Our results suggest that generalized physiological dysregulation, as measured by AL, is determined by both environmental and genetic factors. The genetic contribution to AL remains modest when compared to the environmental component, which explains approximately 70% of the phenotypic variance.
Resumo:
Introduction: According to the ecological view, coordination establishes byvirtueof social context. Affordances thought of as situational opportunities to interact are assumed to represent the guiding principles underlying decisions involved in interpersonal coordination. It’s generally agreed that affordances are not an objective part of the (social) environment but that they depend on the constructive perception of involved subjects. Theory and empirical data hold that cognitive operations enabling domain-specific efficacy beliefs are involved in the perception of affordances. The aim of the present study was to test the effects of these cognitive concepts in the subjective construction of local affordances and their influence on decision making in football. Methods: 71 football players (M = 24.3 years, SD = 3.3, 21 % women) from different divisions participated in the study. Participants were presented scenarios of offensive game situations. They were asked to take the perspective of the person on the ball and to indicate where they would pass the ball from within each situation. The participants stated their decisions in two conditions with different game score (1:0 vs. 0:1). The playing fields of all scenarios were then divided into ten zones. For each zone, participants were asked to rate their confidence in being able to pass the ball there (self-efficacy), the likelihood of the group staying in ball possession if the ball were passed into the zone (group-efficacy I), the likelihood of the ball being covered safely by a team member (pass control / group-efficacy II), and whether a pass would establish a better initial position to attack the opponents’ goal (offensive convenience). Answers were reported on visual analog scales ranging from 1 to 10. Data were analyzed specifying general linear models for binomially distributed data (Mplus). Maximum likelihood with non-normality robust standard errors was chosen to estimate parameters. Results: Analyses showed that zone- and domain-specific efficacy beliefs significantly affected passing decisions. Because of collinearity with self-efficacy and group-efficacy I, group-efficacy II was excluded from the models to ease interpretation of the results. Generally, zones with high values in the subjective ratings had a higher probability to be chosen as passing destination (βself-efficacy = 0.133, p < .001, OR = 1.142; βgroup-efficacy I = 0.128, p < .001, OR = 1.137; βoffensive convenience = 0.057, p < .01, OR = 1.059). There were, however, characteristic differences in the two score conditions. While group-efficacy I was the only significant predictor in condition 1 (βgroup-efficacy I = 0.379, p < .001), only self-efficacy and offensive convenience contributed to passing decisions in condition 2 (βself-efficacy = 0.135, p < .01; βoffensive convenience = 0.120, p < .001). Discussion: The results indicate that subjectively distinct attributes projected to playfield zones affect passing decisions. The study proposes a probabilistic alternative to Lewin’s (1951) hodological and deterministic field theory and enables insight into how dimensions of the psychological landscape afford passing behavior. Being part of a team, this psychological landscape is not only constituted by probabilities that refer to the potential and consequences of individual behavior, but also to that of the group system of which individuals are part of. Hence, in regulating action decisions in group settings, informers are extended to aspects referring to the group-level. References: Lewin, K. (1951). In D. Cartwright (Ed.), Field theory in social sciences: Selected theoretical papers by Kurt Lewin. New York: Harper & Brothers.