929 resultados para simultaneous inference


Relevância:

20.00% 20.00%

Publicador:

Resumo:

Recent axiomatic derivations of the maximum entropy principle from consistency conditions are critically examined. We show that proper application of consistency conditions alone allows a wider class of functionals, essentially of the form ∝ dx p(x)[p(x)/g(x)] s , for some real numbers, to be used for inductive inference and the commonly used form − ∝ dx p(x)ln[p(x)/g(x)] is only a particular case. The role of the prior densityg(x) is clarified. It is possible to regard it as a geometric factor, describing the coordinate system used and it does not represent information of the same kind as obtained by measurements on the system in the form of expectation values.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Evidence that complex traits are highly polygenic has been presented by population-based genome-wide association studies (GWASs) through the identification of many significant variants, as well as by family-based de novo sequencing studies indicating that several traits have a large mutational target size. Here, using a third study design, we show results consistent with extreme polygenicity for body mass index (BMI) and height. On a sample of 20,240 siblings (from 9,570 nuclear families), we used a within-family method to obtain narrow-sense heritability estimates of 0.42 (SE = 0.17, p = 0.01) and 0.69 (SE = 0.14, p = 6 x 10(-)(7)) for BMI and height, respectively, after adjusting for covariates. The genomic inflation factors from locus-specific linkage analysis were 1.69 (SE = 0.21, p = 0.04) for BMI and 2.18 (SE = 0.21, p = 2 x 10(-10)) for height. This inflation is free of confounding and congruent with polygenicity, consistent with observations of ever-increasing genomic-inflation factors from GWASs with large sample sizes, implying that those signals are due to true genetic signals across the genome rather than population stratification. We also demonstrate that the distribution of the observed test statistics is consistent with both rare and common variants underlying a polygenic architecture and that previous reports of linkage signals in complex traits are probably a consequence of polygenic architecture rather than the segregation of variants with large effects. The convergent empirical evidence from GWASs, de novo studies, and within-family segregation implies that family-based sequencing studies for complex traits require very large sample sizes because the effects of causal variants are small on average.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Acidity in terms of pH and titratable acids influences the texture and flavour of fermented dairy products, such as Kefir. However, the methods for determining pH and titratable acidity (TA) are time consuming. Near infrared (NIR) spectroscopy is a non-destructive method, which simultaneously predicts multiple traits from a single scan and can be used to predict pH and TA. The best pH NIR calibration model was obtained with no spectral pre-treatment applied, whereas smoothing was found to be the best pre-treatment to develop the TA calibration model. Using cross-validation, the prediction results were found acceptable for both pH and TA. With external validation, similar results were found for pH and TA, and both models were found to be acceptable for screening purposes.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Al13 pillared montmorillonites (AlPMts) prepared with different Al/clay ratios were used to remove Cd(II) and phosphate from aqueous solution. The structure of AlPMts was characterized by X-ray diffraction (XRD), Thermogravimetric analysis (TG), and N2 adsorption–desorption. The basal spacing, intercalated amount of Al13 cations, and specific surface area of AlPMts increased with the increase of the Al/clay ratio. In the single adsorption system, with the increase of the Al/clay ratio, the adsorption of phosphate on AlPMts increased but that of Cd(II) decreased. Significantly enhanced adsorptions of Cd(II) and phosphate on AlPMts were observed in a simultaneous system. For both contaminants, the adsorption of one contaminant would increase with the increase of the initial concentration of the other one and increase in the Al/clay ratio. The enhancement of the adsorption of Cd(II) was much higher than that of phosphate on AlPMt. This suggests that the intercalated Al13 cations are the primary co-adsorption sites for phosphate and Cd(II). X-ray photoelectron spectroscopy (XPS) indicated comparable binding energy of P2p but a different binding energy of Cd3d in single and simultaneous systems. The adsorption and XPS results suggested that the formation of P-bridge ternary surface complexes was the possible adsorption mechanism for promoted uptake of Cd(II) and phosphate on AlPMt.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The simultaneous state and parameter estimation problem for a linear discrete-time system with unknown noise statistics is treated as a large-scale optimization problem. The a posterioriprobability density function is maximized directly with respect to the states and parameters subject to the constraint of the system dynamics. The resulting optimization problem is too large for any of the standard non-linear programming techniques and hence an hierarchical optimization approach is proposed. It turns out that the states can be computed at the first levelfor given noise and system parameters. These, in turn, are to be modified at the second level.The states are to be computed from a large system of linear equations and two solution methods are considered for solving these equations, limiting the horizon to a suitable length. The resulting algorithm is a filter-smoother, suitable for off-line as well as on-line state estimation for given noise and system parameters. The second level problem is split up into two, one for modifying the noise statistics and the other for modifying the system parameters. An adaptive relaxation technique is proposed for modifying the noise statistics and a modified Gauss-Newton technique is used to adjust the system parameters.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

A very general and numerically quite robust algorithm has been proposed by Sastry and Gauvrit (1980) for system identification. The present paper takes it up and examines its performance on a real test example. The example considered is the lateral dynamics of an aircraft. This is used as a vehicle for demonstrating the performance of various aspects of the algorithm in several possible modes.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Coccidiosis is a costly worldwide enteric disease of chickens caused by parasites of the genus Eimeria. At present, there are seven described species that occur globally and a further three undescribed, operational taxonomic units (OTUs X, Y, and Z) that are known to infect chickens from Australia. Species of Eimeria have both overlapping morphology and pathology and frequently occur as mixed-species infections. This makes definitive diagnosis with currently available tests difficult and, to date, there is no test for the detection of the three OTUs. This paper describes the development of a PCR-based assay that is capable of detecting all ten species of Eimeria, including OTUs X, Y, and Z in field samples. The assay is based on a single set of generic primers that amplifies a single diagnostic fragment from the mitochondrial genome of each species. This one-tube assay is simple, low-cost, and has the capacity to be high throughput. It will therefore be of great benefit to the poultry industry for Eimeria detection and control, and the confirmation of identity and purity of vaccine strains.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Mit einer direkten Methode, bei der der Erdelyi-Kober- und der modifizierte Hankel-Operator Anwendung finden, werden gewisse Systeme aus zwei bzw. drei Paaren dualer Integralgleichungen mit Bessel-Kernen in geschlossener Form gelöst. Für bestimmte Funktionenklassen und Ordnungen der Bessel-Funktionen ist die Vorgehensweise angebrachter und geeigneter als die bereits existierenden Methoden.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The family of location and scale mixtures of Gaussians has the ability to generate a number of flexible distributional forms. The family nests as particular cases several important asymmetric distributions like the Generalized Hyperbolic distribution. The Generalized Hyperbolic distribution in turn nests many other well known distributions such as the Normal Inverse Gaussian. In a multivariate setting, an extension of the standard location and scale mixture concept is proposed into a so called multiple scaled framework which has the advantage of allowing different tail and skewness behaviours in each dimension with arbitrary correlation between dimensions. Estimation of the parameters is provided via an EM algorithm and extended to cover the case of mixtures of such multiple scaled distributions for application to clustering. Assessments on simulated and real data confirm the gain in degrees of freedom and flexibility in modelling data of varying tail behaviour and directional shape.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Whether a statistician wants to complement a probability model for observed data with a prior distribution and carry out fully probabilistic inference, or base the inference only on the likelihood function, may be a fundamental question in theory, but in practice it may well be of less importance if the likelihood contains much more information than the prior. Maximum likelihood inference can be justified as a Gaussian approximation at the posterior mode, using flat priors. However, in situations where parametric assumptions in standard statistical models would be too rigid, more flexible model formulation, combined with fully probabilistic inference, can be achieved using hierarchical Bayesian parametrization. This work includes five articles, all of which apply probability modeling under various problems involving incomplete observation. Three of the papers apply maximum likelihood estimation and two of them hierarchical Bayesian modeling. Because maximum likelihood may be presented as a special case of Bayesian inference, but not the other way round, in the introductory part of this work we present a framework for probability-based inference using only Bayesian concepts. We also re-derive some results presented in the original articles using the toolbox equipped herein, to show that they are also justifiable under this more general framework. Here the assumption of exchangeability and de Finetti's representation theorem are applied repeatedly for justifying the use of standard parametric probability models with conditionally independent likelihood contributions. It is argued that this same reasoning can be applied also under sampling from a finite population. The main emphasis here is in probability-based inference under incomplete observation due to study design. This is illustrated using a generic two-phase cohort sampling design as an example. The alternative approaches presented for analysis of such a design are full likelihood, which utilizes all observed information, and conditional likelihood, which is restricted to a completely observed set, conditioning on the rule that generated that set. Conditional likelihood inference is also applied for a joint analysis of prevalence and incidence data, a situation subject to both left censoring and left truncation. Other topics covered are model uncertainty and causal inference using posterior predictive distributions. We formulate a non-parametric monotonic regression model for one or more covariates and a Bayesian estimation procedure, and apply the model in the context of optimal sequential treatment regimes, demonstrating that inference based on posterior predictive distributions is feasible also in this case.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Genetics, the science of heredity and variation in living organisms, has a central role in medicine, in breeding crops and livestock, and in studying fundamental topics of biological sciences such as evolution and cell functioning. Currently the field of genetics is under a rapid development because of the recent advances in technologies by which molecular data can be obtained from living organisms. In order that most information from such data can be extracted, the analyses need to be carried out using statistical models that are tailored to take account of the particular genetic processes. In this thesis we formulate and analyze Bayesian models for genetic marker data of contemporary individuals. The major focus is on the modeling of the unobserved recent ancestry of the sampled individuals (say, for tens of generations or so), which is carried out by using explicit probabilistic reconstructions of the pedigree structures accompanied by the gene flows at the marker loci. For such a recent history, the recombination process is the major genetic force that shapes the genomes of the individuals, and it is included in the model by assuming that the recombination fractions between the adjacent markers are known. The posterior distribution of the unobserved history of the individuals is studied conditionally on the observed marker data by using a Markov chain Monte Carlo algorithm (MCMC). The example analyses consider estimation of the population structure, relatedness structure (both at the level of whole genomes as well as at each marker separately), and haplotype configurations. For situations where the pedigree structure is partially known, an algorithm to create an initial state for the MCMC algorithm is given. Furthermore, the thesis includes an extension of the model for the recent genetic history to situations where also a quantitative phenotype has been measured from the contemporary individuals. In that case the goal is to identify positions on the genome that affect the observed phenotypic values. This task is carried out within the Bayesian framework, where the number and the relative effects of the quantitative trait loci are treated as random variables whose posterior distribution is studied conditionally on the observed genetic and phenotypic data. In addition, the thesis contains an extension of a widely-used haplotyping method, the PHASE algorithm, to settings where genetic material from several individuals has been pooled together, and the allele frequencies of each pool are determined in a single genotyping.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This thesis which consists of an introduction and four peer-reviewed original publications studies the problems of haplotype inference (haplotyping) and local alignment significance. The problems studied here belong to the broad area of bioinformatics and computational biology. The presented solutions are computationally fast and accurate, which makes them practical in high-throughput sequence data analysis. Haplotype inference is a computational problem where the goal is to estimate haplotypes from a sample of genotypes as accurately as possible. This problem is important as the direct measurement of haplotypes is difficult, whereas the genotypes are easier to quantify. Haplotypes are the key-players when studying for example the genetic causes of diseases. In this thesis, three methods are presented for the haplotype inference problem referred to as HaploParser, HIT, and BACH. HaploParser is based on a combinatorial mosaic model and hierarchical parsing that together mimic recombinations and point-mutations in a biologically plausible way. In this mosaic model, the current population is assumed to be evolved from a small founder population. Thus, the haplotypes of the current population are recombinations of the (implicit) founder haplotypes with some point--mutations. HIT (Haplotype Inference Technique) uses a hidden Markov model for haplotypes and efficient algorithms are presented to learn this model from genotype data. The model structure of HIT is analogous to the mosaic model of HaploParser with founder haplotypes. Therefore, it can be seen as a probabilistic model of recombinations and point-mutations. BACH (Bayesian Context-based Haplotyping) utilizes a context tree weighting algorithm to efficiently sum over all variable-length Markov chains to evaluate the posterior probability of a haplotype configuration. Algorithms are presented that find haplotype configurations with high posterior probability. BACH is the most accurate method presented in this thesis and has comparable performance to the best available software for haplotype inference. Local alignment significance is a computational problem where one is interested in whether the local similarities in two sequences are due to the fact that the sequences are related or just by chance. Similarity of sequences is measured by their best local alignment score and from that, a p-value is computed. This p-value is the probability of picking two sequences from the null model that have as good or better best local alignment score. Local alignment significance is used routinely for example in homology searches. In this thesis, a general framework is sketched that allows one to compute a tight upper bound for the p-value of a local pairwise alignment score. Unlike the previous methods, the presented framework is not affeced by so-called edge-effects and can handle gaps (deletions and insertions) without troublesome sampling and curve fitting.

Relevância:

20.00% 20.00%

Publicador:

Relevância:

20.00% 20.00%

Publicador:

Resumo:

To develop and test a custom-built instrument to simultaneously assess tear film surface quality (TFSQ) and subjective vision score (SVS).

Relevância:

20.00% 20.00%

Publicador:

Resumo:

High Intensity Exercise (HIE) stimulates greater physiological remodeling when compared to workload matched low-moderate intensity exercise. This study utilized an untargeted metabolomics approach to examine the metabolic perturbations that occur following two workload matched supramaximal low volume HIE trials. In a randomized order, 7 untrained males completed two exercise protocols separated by one week; 1) HIE150%: 30 x 20s cycling at 150% VO2peak, 40s passive rest; 2) HIE300%: 30 x 10s cycling at 300% VO2peak, 50 s passive rest. Total exercise duration was 30 minutes for both trials. Blood samples were taken at rest, during and immediately following exercise and at 60 minutes post exercise. Gas chromatography-mass spectrometry (GC-MS) analysis of plasma identified 43 known metabolites of which 3 demonstrated significant fold changes (HIE300% compared to the HIE150% value) during exercise, 14 post exercise and 23 at the end of the recovery period. Significant changes in plasma metabolites relating to lipid metabolism [fatty acids: dodecanoate (p=0.042), hexadecanoate (p=0.001), octadecanoate (p=0.001)], total cholesterol (p=0.001), and glycolysis [lactate (p=0.018)] were observed following exercise and during the recovery period. The HIE300% protocol elicited greater metabolic changes relating to lipid metabolism and glycolysis when compared to HIE150% protocol. These changes were more pronounced throughout the recovery period rather than during the exercise bout itself. Data from the current study demonstrate the use of metabolomics to monitor intensity-dependent changes in multiple metabolic pathways following exercise. The small sample size indicates a need for further studies in a larger sample cohort to validate these findings.