15 resultados para random regression
em Duke University
Resumo:
In regression analysis of counts, a lack of simple and efficient algorithms for posterior computation has made Bayesian approaches appear unattractive and thus underdeveloped. We propose a lognormal and gamma mixed negative binomial (NB) regression model for counts, and present efficient closed-form Bayesian inference; unlike conventional Poisson models, the proposed approach has two free parameters to include two different kinds of random effects, and allows the incorporation of prior information, such as sparsity in the regression coefficients. By placing a gamma distribution prior on the NB dispersion parameter r, and connecting a log-normal distribution prior with the logit of the NB probability parameter p, efficient Gibbs sampling and variational Bayes inference are both developed. The closed-form updates are obtained by exploiting conditional conjugacy via both a compound Poisson representation and a Polya-Gamma distribution based data augmentation approach. The proposed Bayesian inference can be implemented routinely, while being easily generalizable to more complex settings involving multivariate dependence structures. The algorithms are illustrated using real examples. Copyright 2012 by the author(s)/owner(s).
Resumo:
This paper provides a root-n consistent, asymptotically normal weighted least squares estimator of the coefficients in a truncated regression model. The distribution of the errors is unknown and permits general forms of unknown heteroskedasticity. Also provided is an instrumental variables based two-stage least squares estimator for this model, which can be used when some regressors are endogenous, mismeasured, or otherwise correlated with the errors. A simulation study indicates that the new estimators perform well in finite samples. Our limiting distribution theory includes a new asymptotic trimming result addressing the boundary bias in first-stage density estimation without knowledge of the support boundary. © 2007 Cambridge University Press.
A mathematical theory of stochastic microlensing. II. Random images, shear, and the Kac-Rice formula
Resumo:
Continuing our development of a mathematical theory of stochastic microlensing, we study the random shear and expected number of random lensed images of different types. In particular, we characterize the first three leading terms in the asymptotic expression of the joint probability density function (pdf) of the random shear tensor due to point masses in the limit of an infinite number of stars. Up to this order, the pdf depends on the magnitude of the shear tensor, the optical depth, and the mean number of stars through a combination of radial position and the star's mass. As a consequence, the pdf's of the shear components are seen to converge, in the limit of an infinite number of stars, to shifted Cauchy distributions, which shows that the shear components have heavy tails in that limit. The asymptotic pdf of the shear magnitude in the limit of an infinite number of stars is also presented. All the results on the random microlensing shear are given for a general point in the lens plane. Extending to the general random distributions (not necessarily uniform) of the lenses, we employ the Kac-Rice formula and Morse theory to deduce general formulas for the expected total number of images and the expected number of saddle images. We further generalize these results by considering random sources defined on a countable compact covering of the light source plane. This is done to introduce the notion of global expected number of positive parity images due to a general lensing map. Applying the result to microlensing, we calculate the asymptotic global expected number of minimum images in the limit of an infinite number of stars, where the stars are uniformly distributed. This global expectation is bounded, while the global expected number of images and the global expected number of saddle images diverge as the order of the number of stars. © 2009 American Institute of Physics.
Resumo:
Genome rearrangement often produces chromosomes with two centromeres (dicentrics) that are inherently unstable because of bridge formation and breakage during cell division. However, mammalian dicentrics, and particularly those in humans, can be quite stable, usually because one centromere is functionally silenced. Molecular mechanisms of centromere inactivation are poorly understood since there are few systems to experimentally create dicentric human chromosomes. Here, we describe a human cell culture model that enriches for de novo dicentrics. We demonstrate that transient disruption of human telomere structure non-randomly produces dicentric fusions involving acrocentric chromosomes. The induced dicentrics vary in structure near fusion breakpoints and like naturally-occurring dicentrics, exhibit various inter-centromeric distances. Many functional dicentrics persist for months after formation. Even those with distantly spaced centromeres remain functionally dicentric for 20 cell generations. Other dicentrics within the population reflect centromere inactivation. In some cases, centromere inactivation occurs by an apparently epigenetic mechanism. In other dicentrics, the size of the alpha-satellite DNA array associated with CENP-A is reduced compared to the same array before dicentric formation. Extra-chromosomal fragments that contained CENP-A often appear in the same cells as dicentrics. Some of these fragments are derived from the same alpha-satellite DNA array as inactivated centromeres. Our results indicate that dicentric human chromosomes undergo alternative fates after formation. Many retain two active centromeres and are stable through multiple cell divisions. Others undergo centromere inactivation. This event occurs within a broad temporal window and can involve deletion of chromatin that marks the locus as a site for CENP-A maintenance/replenishment.
Child work and labour among orphaned and abandoned children in five low and middle income countries.
Resumo:
BACKGROUND: The care and protection of the estimated 143,000,000 orphaned and abandoned children (OAC) worldwide is of great importance to global policy makers and child service providers in low and middle income countries (LMICs), yet little is known about rates of child labour among OAC, what child and caregiver characteristics predict child engagement in work and labour, or when such work infers with schooling. This study examines rates and correlates of child labour among OAC and associations of child labour with schooling in a cohort of OAC in 5 LMICs. METHODS: The Positive Outcomes for Orphans (POFO) study employed a two-stage random sampling survey methodology to identify 1480 single and double orphans and children abandoned by both parents ages 6-12 living in family settings in five LMICs: Cambodia, Ethiopia, India, Kenya, and Tanzania. Regression models examined child and caregiver associations with: any work versus no work; and with working <21, 21-27, and 28+ hours during the past week, and child labour (UNICEF definition). RESULTS: The majority of OAC (60.7%) engaged in work during the past week, and of those who worked, 17.8% (10.5% of the total sample) worked 28 or more hours. More than one-fifth (21.9%; 13% of the total sample) met UNICEF's child labour definition. Female OAC and those in good health had increased odds of working. OAC living in rural areas, lower household wealth and caregivers not earning an income were associated with increased child labour. Child labour, but not working fewer than 28 hours per week, was associated with decreased school attendance. CONCLUSIONS: One in seven OAC in this study were reported to be engaged in child labour. Policy makers and social service providers need to pay close attention to the demands being placed on female OAC, particularly in rural areas and poor households with limited income sources. Programs to promote OAC school attendance may need to focus on the needs of families as well as the OAC.
Resumo:
The authors of this study evaluated a structured 10-session psychosocial support group intervention for newly HIV-diagnosed pregnant South African women. Participants were expected to display increases in HIV disclosure, self-esteem, active coping and positive social support, and decreases in depression, avoidant coping, and negative social support. Three hundred sixty-one pregnant HIV-infected women were recruited from four antenatal clinics in Tshwane townships from April 2005 to September 2006. Using a quasi-experimental design, assessments were conducted at baseline and two and eight months post-intervention. A series of random effects regression analyses were conducted, with the three assessment points treated as a random effect of time. At both follow-ups, the rate of disclosure in the intervention group was significantly higher than that of the comparison group (p<0.001). Compared to the comparison group at the first follow-up, the intervention group displayed higher levels of active coping (t=2.68, p<0.05) and lower levels of avoidant coping (t=-2.02, p<0.05), and those who attended at least half of the intervention sessions exhibited improved self-esteem (t=2.11, p<0.05). Group interventions tailored for newly HIV positive pregnant women, implemented in resource-limited settings, may accelerate the process of adjusting to one's HIV status, but may not have sustainable benefits over time.
Resumo:
BACKGROUND: More than 153 million children worldwide have been orphaned by the loss of one or both parents, and millions more have been abandoned. We investigated relationships between the health of orphaned and abandoned children (OAC) and child, caregiver, and household characteristics among randomly selected OAC in five countries. METHODOLOGY: Using a two-stage random sampling strategy in 6 study areas in Cambodia, Ethiopia, India, Kenya, and Tanzania, the Positive Outcomes for Orphans (POFO) study identified 1,480 community-living OAC ages 6 to 12. Detailed interviews were conducted with 1,305 primary caregivers at baseline and after 6 and 12 months. Multivariable logistic regression models describe associations between the characteristics of children, caregivers, and households and child health outcomes: fair or poor child health; fever, cough, or diarrhea within the past two weeks; illness in the past 6 months; and fair or poor health on at least two assessments. PRINCIPAL FINDINGS: Across the six study areas, 23% of OAC were reported to be in fair or poor health; 19%, 18%, and 2% had fever, cough, or diarrhea, respectively, within the past two weeks; 55% had illnesses within the past 6 months; and 23% were in fair or poor health on at least two assessments. Female gender, suspected HIV infection, experiences of potentially traumatic events, including the loss of both parents, urban residence, eating fewer than 3 meals per day, and low caregiver involvement were associated with poorer child health outcomes. Particularly strong associations were observed between child health measures and the health of their primary caregivers. CONCLUSIONS: Poor caregiver health is a strong signal for poor health of OAC. Strategies to support OAC should target the caregiver-child dyad. Steps to ensure food security, foster gender equality, and prevent and treat traumatic events are needed.
Resumo:
Indoor residual spraying (IRS) has become an increasingly popular method of insecticide use for malaria control, and many recent studies have reported on its effectiveness in reducing malaria burden in a single community or region. There is a need for systematic review and integration of the published literature on IRS and the contextual determining factors of its success in controlling malaria. This study reports the findings of a meta-regression analysis based on 13 published studies, which were chosen from more than 400 articles through a systematic search and selection process. The summary relative risk for reducing malaria prevalence was 0.38 (95% confidence interval = 0.31-0.46), which indicated a risk reduction of 62%. However, an excessive degree of heterogeneity was found between the studies. The meta-regression analysis indicates that IRS is more effective with high initial prevalence, multiple rounds of spraying, use of DDT, and in regions with a combination of Plasmodium falciparum and P. vivax malaria.
Resumo:
Although many feature selection methods for classification have been developed, there is a need to identify genes in high-dimensional data with censored survival outcomes. Traditional methods for gene selection in classification problems have several drawbacks. First, the majority of the gene selection approaches for classification are single-gene based. Second, many of the gene selection procedures are not embedded within the algorithm itself. The technique of random forests has been found to perform well in high-dimensional data settings with survival outcomes. It also has an embedded feature to identify variables of importance. Therefore, it is an ideal candidate for gene selection in high-dimensional data with survival outcomes. In this paper, we develop a novel method based on the random forests to identify a set of prognostic genes. We compare our method with several machine learning methods and various node split criteria using several real data sets. Our method performed well in both simulations and real data analysis.Additionally, we have shown the advantages of our approach over single-gene-based approaches. Our method incorporates multivariate correlations in microarray data for survival outcomes. The described method allows us to better utilize the information available from microarray data with survival outcomes.
Resumo:
© 2015 IOP Publishing Ltd & London Mathematical Society.This is a detailed analysis of invariant measures for one-dimensional dynamical systems with random switching. In particular, we prove the smoothness of the invariant densities away from critical points and describe the asymptotics of the invariant densities at critical points.
Resumo:
© 2015 Society for Industrial and Applied Mathematics.We consider parabolic PDEs with randomly switching boundary conditions. In order to analyze these random PDEs, we consider more general stochastic hybrid systems and prove convergence to, and properties of, a stationary distribution. Applying these general results to the heat equation with randomly switching boundary conditions, we find explicit formulae for various statistics of the solution and obtain almost sure results about its regularity and structure. These results are of particular interest for biological applications as well as for their significant departure from behavior seen in PDEs forced by disparate Gaussian noise. Our general results also have applications to other types of stochastic hybrid systems, such as ODEs with randomly switching right-hand sides.
Resumo:
Estimation of the skeleton of a directed acyclic graph (DAG) is of great importance for understanding the underlying DAG and causal effects can be assessed from the skeleton when the DAG is not identifiable. We propose a novel method named PenPC to estimate the skeleton of a high-dimensional DAG by a two-step approach. We first estimate the nonzero entries of a concentration matrix using penalized regression, and then fix the difference between the concentration matrix and the skeleton by evaluating a set of conditional independence hypotheses. For high-dimensional problems where the number of vertices p is in polynomial or exponential scale of sample size n, we study the asymptotic property of PenPC on two types of graphs: traditional random graphs where all the vertices have the same expected number of neighbors, and scale-free graphs where a few vertices may have a large number of neighbors. As illustrated by extensive simulations and applications on gene expression data of cancer patients, PenPC has higher sensitivity and specificity than the state-of-the-art method, the PC-stable algorithm.