935 resultados para Gibbs sampling
Resumo:
Em testes nos quais uma quantidade considerável de indivíduos não dispõe de tempo suciente para responder todos os itens temos o que é chamado de efeito de Speededness. O uso do modelo unidimensional da Teoria da Resposta ao Item (TRI) em testes com speededness pode nos levar a uma série de interpretações errôneas uma vez que nesse modelo é suposto que os respondentes possuem tempo suciente para responder todos os itens. Nesse trabalho, desenvolvemos uma análise Bayesiana do modelo tri-dimensional da TRI proposto por Wollack e Cohen (2005) considerando uma estrutura de dependência entre as distribuições a priori dos traços latentes a qual modelamos com o uso de cópulas. Apresentamos um processo de estimação para o modelo proposto e fazemos um estudo de simulação comparativo com a análise realizada por Bazan et al. (2010) na qual foi utilizada distribuições a priori independentes para os traços latentes. Finalmente, fazemos uma análise de sensibilidade do modelo em estudo e apresentamos uma aplicação levando em conta um conjunto de dados reais proveniente de um subteste do EGRA, chamado de Nonsense Words, realizado no Peru em 2007. Nesse subteste os alunos são avaliados por via oral efetuando a leitura, sequencialmente, de 50 palavras sem sentidos em 60 segundos o que caracteriza a presença do efeito speededness.
Resumo:
We investigate whether relative contributions of genetic and shared environmental factors are associated with an increased risk in melanoma. Data from the Queensland Familial Melanoma Project comprising 15,907 subjects arising from 1912 families were analyzed to estimate the additive genetic, common and unique environmental contributions to variation in the age at onset of melanoma. Two complementary approaches for analyzing correlated time-to-onset family data were considered: the generalized estimating equations (GEE) method in which one can estimate relationship-specific dependence simultaneously with regression coefficients that describe the average population response to changing covariates; and a subject-specific Bayesian mixed model in which heterogeneity in regression parameters is explicitly modeled and the different components of variation may be estimated directly. The proportional hazards and Weibull models were utilized, as both produce natural frameworks for estimating relative risks while adjusting for simultaneous effects of other covariates. A simple Markov Chain Monte Carlo method for covariate imputation of missing data was used and the actual implementation of the Bayesian model was based on Gibbs sampling using the free ware package BUGS. In addition, we also used a Bayesian model to investigate the relative contribution of genetic and environmental effects on the expression of naevi and freckles, which are known risk factors for melanoma.
Resumo:
Two stochastic production frontier models are formulated within the generalized production function framework popularized by Zellner and Revankar (Rev. Econ. Stud. 36 (1969) 241) and Zellner and Ryu (J. Appl. Econometrics 13 (1998) 101). This framework is convenient for parsimonious modeling of a production function with returns to scale specified as a function of output. Two alternatives for introducing the stochastic inefficiency term and the stochastic error are considered. In the first the errors are added to an equation of the form h(log y, theta) = log f (x, beta) where y denotes output, x is a vector of inputs and (theta, beta) are parameters. In the second the equation h(log y,theta) = log f(x, beta) is solved for log y to yield a solution of the form log y = g[theta, log f(x, beta)] and the errors are added to this equation. The latter alternative is novel, but it is needed to preserve the usual definition of firm efficiency. The two alternative stochastic assumptions are considered in conjunction with two returns to scale functions, making a total of four models that are considered. A Bayesian framework for estimating all four models is described. The techniques are applied to USDA state-level data on agricultural output and four inputs. Posterior distributions for all parameters, for firm efficiencies and for the efficiency rankings of firms are obtained. The sensitivity of the results to the returns to scale specification and to the stochastic specification is examined. (c) 2004 Elsevier B.V. All rights reserved.
Resumo:
We examine financial constraints and forms of finance used for investment, by analysing survey data on 157 large privatised companies in Hungary and Poland for the period 1998 - 2000. The Bayesian analysis using Gibbs sampling is carried out to obtain inferences about the sample companies' access to finance from a model for categorical outcome. By applying alternative measures of financial constraints we find that foreign companies, companies that are part of domestic industrial groups and enterprises with concentrated ownership are all less constrained in their access to finance. Moreover, we identify alternative modes of finance since different corporate control and past performance characteristics influence the sample firms' choice of finance source. In particular, while being industry-specific, the access to domestic credit is positively associated with company size and past profitability. Industrial group members tend to favour bond issues as well as sells-offs of assets as appropriate types of finance for their investment programmes. Preferences for raising finance in the form of equity are associated with share concentration in a non-monotonic way, being most prevalent in those companies where the dominant owner holds 25%-49% of shares. Close links with a leading bank not only increase the possibility of bond issues but also appear to facilitate access to non-banking sources of funds, in particular, to finance supplied by industrial partners. Finally, reliance on state finance is less likely for the companies whose profiles resemble the case of unconstrained finance, namely, for companies with foreign partners, companies that are part of domestic industrial groups and companies with a strategic investor. Model implications also include that the use of state funds is less likely for Polish than for Hungarian companies.
Resumo:
Extensive investigation has been conducted on network data, especially weighted network in the form of symmetric matrices with discrete count entries. Motivated by statistical inference on multi-view weighted network structure, this paper proposes a Poisson-Gamma latent factor model, not only separating view-shared and view-specific spaces but also achieving reduced dimensionality. A multiplicative gamma process shrinkage prior is implemented to avoid over parameterization and efficient full conditional conjugate posterior for Gibbs sampling is accomplished. By the accommodating of view-shared and view-specific parameters, flexible adaptability is provided according to the extents of similarity across view-specific space. Accuracy and efficiency are tested by simulated experiment. An application on real soccer network data is also proposed to illustrate the model.
Resumo:
The Dirichlet process mixture model (DPMM) is a ubiquitous, flexible Bayesian nonparametric statistical model. However, full probabilistic inference in this model is analytically intractable, so that computationally intensive techniques such as Gibbs sampling are required. As a result, DPMM-based methods, which have considerable potential, are restricted to applications in which computational resources and time for inference is plentiful. For example, they would not be practical for digital signal processing on embedded hardware, where computational resources are at a serious premium. Here, we develop a simplified yet statistically rigorous approximate maximum a-posteriori (MAP) inference algorithm for DPMMs. This algorithm is as simple as DP-means clustering, solves the MAP problem as well as Gibbs sampling, while requiring only a fraction of the computational effort. (For freely available code that implements the MAP-DP algorithm for Gaussian mixtures see http://www.maxlittle.net/.) Unlike related small variance asymptotics (SVA), our method is non-degenerate and so inherits the “rich get richer” property of the Dirichlet process. It also retains a non-degenerate closed-form likelihood which enables out-of-sample calculations and the use of standard tools such as cross-validation. We illustrate the benefits of our algorithm on a range of examples and contrast it to variational, SVA and sampling approaches from both a computational complexity perspective as well as in terms of clustering performance. We demonstrate the wide applicabiity of our approach by presenting an approximate MAP inference method for the infinite hidden Markov model whose performance contrasts favorably with a recently proposed hybrid SVA approach. Similarly, we show how our algorithm can applied to a semiparametric mixed-effects regression model where the random effects distribution is modelled using an infinite mixture model, as used in longitudinal progression modelling in population health science. Finally, we propose directions for future research on approximate MAP inference in Bayesian nonparametrics.
Resumo:
The generalized Gibbs sampler (GGS) is a recently developed Markov chain Monte Carlo (MCMC) technique that enables Gibbs-like sampling of state spaces that lack a convenient representation in terms of a fixed coordinate system. This paper describes a new sampler, called the tree sampler, which uses the GGS to sample from a state space consisting of phylogenetic trees. The tree sampler is useful for a wide range of phylogenetic applications, including Bayesian, maximum likelihood, and maximum parsimony methods. A fast new algorithm to search for a maximum parsimony phylogeny is presented, using the tree sampler in the context of simulated annealing. The mathematics underlying the algorithm is explained and its time complexity is analyzed. The method is tested on two large data sets consisting of 123 sequences and 500 sequences, respectively. The new algorithm is shown to compare very favorably in terms of speed and accuracy to the program DNAPARS from the PHYLIP package.
Resumo:
Despite extensive efforts to confirm a direct association between Chlamydia pneumoniae and atherosclerosis, different laboratories continue to report a large variability in detection rates. In this study, we analyzed multiple sections from atherosclerotic carotid arteries from 10 endartectomy patients to determine the location of C. pneumoniae DNA and the number of sections of the plaque required for analysis to obtain a 95% confidence of detecting the bacterium. A sensitive nested PCR assay detected C. pneumoniae DNA in all patients at one or more locations within the plaque. On average, 42% (ranging from 5 to 91%) of the sections from any single patient had C. pneumoniae DNA present. A patchy distribution of C. pneumoniae in the atherosclerotic lesions was observed, with no area of the carotid having significantly more C. pneumoniae DNA present. If a single random 30-mum-thick section was tested, there was only a 35.6 to 41.6% (95% confidence interval) chance of detecting C. pneumoniae DNA in a patient with carotid artery disease. A minimum of 15 sections would therefore be required to obtain a 95% chance of detecting all true positives. The low concentration and patchy distribution of C. pneumoniae DNA in atherosclerotic plaque appear to be among the reasons for inconsistency between laboratories in the results reported.
Resumo:
The aim of this study was to determine how abiotic factors drive the phytoplankton community in a water supply reservoir within short sampling intervals. Samples were collected at the subsurface (0.1 m) and bottom of limnetic (8 m) and littoral (2 m) zones in both the dry and rainy seasons. The following abiotic variables were analyzed: water temperature, dissolved oxygen, electrical conductivity, total dissolved solids, turbidity, pH, total nitrogen, nitrite, nitrate, total phosphorus, total dissolved phosphorus and orthophosphate. Phytoplankton biomass was determined from biovolume values. The role abiotic variables play in the dynamics of phytoplankton species was determined by means of Canonical Correspondence Analysis. Algae biomass ranged from 1.17×10(4) to 9.21×10(4) µg.L-1; cyanobacteria had biomass values ranging from 1.07×10(4) to 8.21×10(4) µg.L-1. High availability of phosphorous, nitrogen limitation, alkaline pH and thermal stability all favored cyanobacteria blooms, particularly during the dry season. Temperature, pH, total phosphorous and turbidity were key factors in characterizing the phytoplankton community between sampling times and stations. Of the species studied, Cylindrospermopsis raciborskii populations were dominant in the phytoplankton in both the dry and rainy seasons. We conclude that the phytoplankton was strongly influenced by abiotic variables, particularly in relation to seasonal distribution patterns.
Resumo:
Some factors complicate comparisons between linkage maps from different studies. This problem can be resolved if measures of precision, such as confidence intervals and frequency distributions, are associated with markers. We examined the precision of distances and ordering of microsatellite markers in the consensus linkage maps of chromosomes 1, 3 and 4 from two F 2 reciprocal Brazilian chicken populations, using bootstrap sampling. Single and consensus maps were constructed. The consensus map was compared with the International Consensus Linkage Map and with the whole genome sequence. Some loci showed segregation distortion and missing data, but this did not affect the analyses negatively. Several inversions and position shifts were detected, based on 95% confidence intervals and frequency distributions of loci. Some discrepancies in distances between loci and in ordering were due to chance, whereas others could be attributed to other effects, including reciprocal crosses, sampling error of the founder animals from the two populations, F(2) population structure, number of and distance between microsatellite markers, number of informative meioses, loci segregation patterns, and sex. In the Brazilian consensus GGA1, locus LEI1038 was in a position closer to the true genome sequence than in the International Consensus Map, whereas for GGA3 and GGA4, no such differences were found. Extending these analyses to the remaining chromosomes should facilitate comparisons and the integration of several available genetic maps, allowing meta-analyses for map construction and quantitative trait loci (QTL) mapping. The precision of the estimates of QTL positions and their effects would be increased with such information.
Resumo:
Background: With nearly 1,100 species, the fish family Characidae represents more than half of the species of Characiformes, and is a key component of Neotropical freshwater ecosystems. The composition, phylogeny, and classification of Characidae is currently uncertain, despite significant efforts based on analysis of morphological and molecular data. No consensus about the monophyly of this group or its position within the order Characiformes has been reached, challenged by the fact that many key studies to date have non-overlapping taxonomic representation and focus only on subsets of this diversity. Results: In the present study we propose a new definition of the family Characidae and a hypothesis of relationships for the Characiformes based on phylogenetic analysis of DNA sequences of two mitochondrial and three nuclear genes (4,680 base pairs). The sequences were obtained from 211 samples representing 166 genera distributed among all 18 recognized families in the order Characiformes, all 14 recognized subfamilies in the Characidae, plus 56 of the genera so far considered incertae sedis in the Characidae. The phylogeny obtained is robust, with most lineages significantly supported by posterior probabilities in Bayesian analysis, and high bootstrap values from maximum likelihood and parsimony analyses. Conclusion: A monophyletic assemblage strongly supported in all our phylogenetic analysis is herein defined as the Characidae and includes the characiform species lacking a supraorbital bone and with a derived position of the emergence of the hyoid artery from the anterior ceratohyal. To recognize this and several other monophyletic groups within characiforms we propose changes in the limits of several families to facilitate future studies in the Characiformes and particularly the Characidae. This work presents a new phylogenetic framework for a speciose and morphologically diverse group of freshwater fishes of significant ecological and evolutionary importance across the Neotropics and portions of Africa.
Resumo:
Pollen counts from samples taken from storage pots throughout one year (from October to September) were adjusted by Tasei's volumetric correction coefficient for the determination of pollen sources exploited by two colonies of Nannotrigona testaceicornis in Sao Paulo, Brazil. The results obtained by this sampling technique for seven months (December to June) were compared with those from corbicula load samples taken within the same period. This species visited a large variety of plant species, but few of them were frequently used. As a rule, pollen sources that appeared at frequencies greater than 1% were found with both sampling methods and significant positive correlations (Spearman correlation coefficient) were found between their values. The pollen load sample data showed that N. testaceicornis gathered pollen throughout the external activity period.
Resumo:
Consider a discrete locally finite subset Gamma of R(d) and the cornplete graph (Gamma, E), with vertices Gamma and edges E. We consider Gibbs measures on the set of sub-graphs with vertices Gamma and edges E` subset of E. The Gibbs interaction acts between open edges having a vertex in common. We study percolation properties of the Gibbs distribution of the graph ensemble. The main results concern percolation properties of the open edges in two cases: (a) when Gamma is sampled from a homogeneous Poisson process; and (b) for a fixed Gamma with sufficiently sparse points. (c) 2010 American Institute of Physics. [doi:10.1063/1.3514605]
Resumo:
This paper describes methods for the direct determination of Cd and Pb in hair segments (c.a. 5 mm similar to 80 mu g) by solid sampling graphite furnace atomic absorption spectrometry, becoming possible longitudinal profiles in a single strand of hair. To distinguish endogenous and exogenous content. strands of hair were washed by using two different procedures: IAEA protocol (acetone + water + acetone) and the combination of IAEA protocol with HCl washing (acetone + water + acetone + 0.1 mol l(-1) HCl). The concentration of Cd and Pb increased from the root Until the tip of hair washed according to IAEA protocol. However, when the strand of hair was washed using the combination of IAEA protocol and 0.1 mol l(-1) HCl, Cd concentrations decreased in all segments, and Pb concentrations decreased drastically near to the root (5 to 12 mm) and was systematically higher ill the end. The proposed method showed to be useful to assess the temporal variation to Cd and Pb exposure and call be Used for toxicological and environmental investigations. The limits of detection were 2.8 ng g(-1) for Cd and 40 ng g(-1) for Pb. The characteristic masses based oil integrated absorbance were 2.4 pg for Cd and 22 pg for Pb.
Resumo:
In this work a simple and reliable method for the simultaneous determination of Cr, Fe, Ni and V in crude oil, using emulsion sampling graphite furnace atomic absorption spectrometry is proposed. Under the best conditions, sample masses around 50 mg were weighed in polypropylene tubes and emulsified in a mixture of 0.5% (v v(-1)) hexane + 6% (m v(-1)) Triton X-100 (R). Considering the compromised conditions, the pyrolysis an atomization temperatures for the simultaneous determination of Cr, Fe, Ni and V were 1400 degrees C and 2500 degrees C, respectively. Aliquots of 20 mu L of reference solution and sample emulsion were co-injected into the graphite tube with 10 mu L of 1.0 g L(-1) Mg(NO(3))(2) as chemical modifier. The detection limits (n = 10, 3 sigma) and characteristic masses were, respectively: 0.07 mu g g(-1) and 19 pg for Cr; 2.15 mu g g(-1) and 31 pg for Fe; 1.25 mu g g(-1) and 44 pg for Ni; and 1.15 mu g g(-1) and 149 pg for V. The reliability of the proposed method was checked by fuel oil Standard Reference Material (SRMTriton X-100 (R) 1634c - NIST) analysis. The concentrations found presented no statistical differences compared to the certified values at 95% confidence level.