930 resultados para PHYLOGENETIC INFERENCE
Resumo:
Geostatistics involves the fitting of spatially continuous models to spatially discrete data (Chil`es and Delfiner, 1999). Preferential sampling arises when the process that determines the data-locations and the process being modelled are stochastically dependent. Conventional geostatistical methods assume, if only implicitly, that sampling is non-preferential. However, these methods are often used in situations where sampling is likely to be preferential. For example, in mineral exploration samples may be concentrated in areas thought likely to yield high-grade ore. We give a general expression for the likelihood function of preferentially sampled geostatistical data and describe how this can be evaluated approximately using Monte Carlo methods. We present a model for preferential sampling, and demonstrate through simulated examples that ignoring preferential sampling can lead to seriously misleading inferences. We describe an application of the model to a set of bio-monitoring data from Galicia, northern Spain, in which making allowance for preferential sampling materially changes the inferences.
Resumo:
In this paper, we consider estimation of the causal effect of a treatment on an outcome from observational data collected in two phases. In the first phase, a simple random sample of individuals are drawn from a population. On these individuals, information is obtained on treatment, outcome, and a few low-dimensional confounders. These individuals are then stratified according to these factors. In the second phase, a random sub-sample of individuals are drawn from each stratum, with known, stratum-specific selection probabilities. On these individuals, a rich set of confounding factors are collected. In this setting, we introduce four estimators: (1) simple inverse weighted, (2) locally efficient, (3) doubly robust and (4)enriched inverse weighted. We evaluate the finite-sample performance of these estimators in a simulation study. We also use our methodology to estimate the causal effect of trauma care on in-hospital mortality using data from the National Study of Cost and Outcomes of Trauma.
Resumo:
We analysed a 610-bp mitochondrial (mt)DNA D-loop fragment in a sample of German draught horse breeds and compared the polymorphic sites with sequences from Arabian, Hanoverian, Exmoor, Icelandic, Sorraia and Przewalski's Horses as well as with Suffolk, Shire and Belgian horses. In a total of 65 horses, 70 polymorphic sites representing 47 haplotypes were observed. The average percentage of polymorphic sites was 11.5% for the mtDNA fragment analysed. In the nine different draught horse breeds including South German, Mecklenburg, Saxon Thuringa coldblood, Rhenisch German, Schleswig Draught Horse, Black Forest Horse, Shire, Suffolk and Belgian, 61 polymorphic sites and 24 haplotypes were found. The phylogenetic analysis failed to show monophyletic groups for the draught horses. The analysis indicated that the draught horse populations investigated consist of diverse genetic groups with respect to their maternal lineage.
Resumo:
In this study, we present a novel genotyping scheme to classify German wild-type varicella-zoster virus (VZV) strains and to differentiate them from the Oka vaccine strain (genotype B). This approach is based on analysis of four loci in open reading frames (ORFs) 51 to 58, encompassing a total length of 1,990 bp. The new genotyping scheme produced identical clusters in phylogenetic analyses compared to full-genome sequences from well-characterized VZV strains. Based on genotype A, D, B, and C reference strains, a dichotomous identification key (DIK) was developed and applied for VZV strains obtained from vesicle fluid and liquor samples originating from 42 patients suffering from varicella or zoster between 2003 and 2006. Sequencing of regions in ORFs 51, 52, 53, 56, 57, and 58 identified 18 single-nucleotide polymorphisms (SNPs), including two novel ones, SNP 89727 and SNP 92792 in ORF51 and ORF52, respectively. The DIK as well as phylogenetic analysis by Bayesian inference showed that 14 VZV strains belonged to genotype A, and 28 VZV strains were classified as genotype D. Neither Japanese (vaccine)-like B strains nor recombinant-like C strains were found within the samples from Germany. The novel genotyping scheme and the DIK were demonstrated to be practical and simple and allow the highly efficient replication of phylogenetic patterns in VZV initially derived from full-genome DNA sequence analyses. Therefore, this approach may allow us to draw a more comprehensive picture of wild-type VZV strains circulating in Germany and Central Europe by high-throughput procedures in the future.
Resumo:
The present distribution of freshwater fish in the Alpine region has been strongly affected by colonization events occurring after the last glacial maximum (LGM), some 20,000 years ago. We use here a spatially explicit simulation framework to model and better understand their colonization dynamics in the Swiss Rhine basin. This approach is applied to the European bullhead (Cottus gobio), which is an ideal model organism to study fish past demographic processes since it has not been managed by humans. The molecular diversity of eight sampled populations is simulated and compared to observed data at six microsatellite loci under an approximate Bayesian computation framework to estimate the parameters of the colonization process. Our demographic estimates fit well with current knowledge about the biology of this species, but they suggest that the Swiss Rhine basin was colonized very recently, after the Younger Dryas some 6600 years ago. We discuss the implication of this result, as well as the strengths and limits of the spatially explicit approach coupled to the approximate Bayesian computation framework.
Resumo:
Wood formation is an economically and environmentally important process and has played a significant role in the evolution of terrestrial plants. Despite its significance, the molecular underpinnings of the process are still poorly understood. We have previously shown that four Lateral Boundary Domain (LBD) transcription factors have important roles in the regulation of wood formation with two (LBD1 and LBD4) involved in secondary phloem and ray cell development and two (LBD15 and LBD18) in secondary xylem formation. Here, we used comparative phylogenetic analyses to test potential roles of the four LBD genes in the evolution of woodiness. We studied the copy number and variation in DNA and amino acid sequences of the four LBDs in a wide range of woody and herbaceous plant taxa with fully sequenced and annotated genomes. LBD1 showed the highest gene copy number across the studied species, and LBD1 gene copy number was strongly and significantly correlated with the level of ray seriation. The lianas, cucumber and grape, with multiseriate ray cells showed the highest gene copy number (12 and 11, respectively). Because lianas’ growth habit requires significant twisting and bending, the less lignified ray parenchyma cells likely facilitate stem flexibility and maintenance of xylem conductivity. We further demonstrate conservation of amino acids in the LBD18 protein sequences that are specific to woody taxa. Neutrality tests showed evidence for strong purifying selection on these gene regions across various orders, indicating adaptive convergent evolution of LBD18. Structural modeling demonstrates that the conserved amino acids have a significant impact on the tertiary protein structure and thus are likely of significant functional importance.