3 resultados para Multiple escales method
em Duke University
Resumo:
While the Stokes-Einstein (SE) equation predicts that the diffusion coefficient of a solute will be inversely proportional to the viscosity of the solvent, this relation is commonly known to fail for solutes, which are the same size or smaller than the solvent. Multiple researchers have reported that for small solutes, the diffusion coefficient is inversely proportional to the viscosity to a fractional power, and that solutes actually diffuse faster than SE predicts. For other solvent systems, attractive solute-solvent interactions, such as hydrogen bonding, are known to retard the diffusion of a solute. Some researchers have interpreted the slower diffusion due to hydrogen bonding as resulting from the effective diffusion of a larger complex of a solute and solvent molecules. We have developed and used a novel micropipette technique, which can form and hold a single microdroplet of water while it dissolves in a diffusion controlled environment into the solvent. This method has been used to examine the diffusion of water in both n-alkanes and n-alcohols. It was found that the polar solute water, diffusing in a solvent with which it cannot hydrogen bond, closely resembles small nonpolar solutes such as xenon and krypton diffusing in n-alkanes, with diffusion coefficients ranging from 12.5x10(-5) cm(2)/s for water in n-pentane to 1.15x10(-5) cm(2)/s for water in hexadecane. Diffusion coefficients were found to be inversely proportional to viscosity to a fractional power, and diffusion coefficients were faster than SE predicts. For water diffusing in a solvent (n-alcohols) with which it can hydrogen bond, diffusion coefficient values ranged from 1.75x10(-5) cm(2)/s in n-methanol to 0.364x10(-5) cm(2)/s in n-octanol, and diffusion was slower than an alkane of corresponding viscosity. We find no evidence for solute-solvent complex diffusion. Rather, it is possible that the small solute water may be retarded by relatively longer residence times (compared to non-H-bonding solvents) as it moves through the liquid.
Resumo:
BACKGROUND: In a time-course microarray experiment, the expression level for each gene is observed across a number of time-points in order to characterize the temporal trajectories of the gene-expression profiles. For many of these experiments, the scientific aim is the identification of genes for which the trajectories depend on an experimental or phenotypic factor. There is an extensive recent body of literature on statistical methodology for addressing this analytical problem. Most of the existing methods are based on estimating the time-course trajectories using parametric or non-parametric mean regression methods. The sensitivity of these regression methods to outliers, an issue that is well documented in the statistical literature, should be of concern when analyzing microarray data. RESULTS: In this paper, we propose a robust testing method for identifying genes whose expression time profiles depend on a factor. Furthermore, we propose a multiple testing procedure to adjust for multiplicity. CONCLUSIONS: Through an extensive simulation study, we will illustrate the performance of our method. Finally, we will report the results from applying our method to a case study and discussing potential extensions.
Resumo:
Surveys can collect important data that inform policy decisions and drive social science research. Large government surveys collect information from the U.S. population on a wide range of topics, including demographics, education, employment, and lifestyle. Analysis of survey data presents unique challenges. In particular, one needs to account for missing data, for complex sampling designs, and for measurement error. Conceptually, a survey organization could spend lots of resources getting high-quality responses from a simple random sample, resulting in survey data that are easy to analyze. However, this scenario often is not realistic. To address these practical issues, survey organizations can leverage the information available from other sources of data. For example, in longitudinal studies that suffer from attrition, they can use the information from refreshment samples to correct for potential attrition bias. They can use information from known marginal distributions or survey design to improve inferences. They can use information from gold standard sources to correct for measurement error.
This thesis presents novel approaches to combining information from multiple sources that address the three problems described above.
The first method addresses nonignorable unit nonresponse and attrition in a panel survey with a refreshment sample. Panel surveys typically suffer from attrition, which can lead to biased inference when basing analysis only on cases that complete all waves of the panel. Unfortunately, the panel data alone cannot inform the extent of the bias due to attrition, so analysts must make strong and untestable assumptions about the missing data mechanism. Many panel studies also include refreshment samples, which are data collected from a random sample of new
individuals during some later wave of the panel. Refreshment samples offer information that can be utilized to correct for biases induced by nonignorable attrition while reducing reliance on strong assumptions about the attrition process. To date, these bias correction methods have not dealt with two key practical issues in panel studies: unit nonresponse in the initial wave of the panel and in the
refreshment sample itself. As we illustrate, nonignorable unit nonresponse
can significantly compromise the analyst's ability to use the refreshment samples for attrition bias correction. Thus, it is crucial for analysts to assess how sensitive their inferences---corrected for panel attrition---are to different assumptions about the nature of the unit nonresponse. We present an approach that facilitates such sensitivity analyses, both for suspected nonignorable unit nonresponse
in the initial wave and in the refreshment sample. We illustrate the approach using simulation studies and an analysis of data from the 2007-2008 Associated Press/Yahoo News election panel study.
The second method incorporates informative prior beliefs about
marginal probabilities into Bayesian latent class models for categorical data.
The basic idea is to append synthetic observations to the original data such that
(i) the empirical distributions of the desired margins match those of the prior beliefs, and (ii) the values of the remaining variables are left missing. The degree of prior uncertainty is controlled by the number of augmented records. Posterior inferences can be obtained via typical MCMC algorithms for latent class models, tailored to deal efficiently with the missing values in the concatenated data.
We illustrate the approach using a variety of simulations based on data from the American Community Survey, including an example of how augmented records can be used to fit latent class models to data from stratified samples.
The third method leverages the information from a gold standard survey to model reporting error. Survey data are subject to reporting error when respondents misunderstand the question or accidentally select the wrong response. Sometimes survey respondents knowingly select the wrong response, for example, by reporting a higher level of education than they actually have attained. We present an approach that allows an analyst to model reporting error by incorporating information from a gold standard survey. The analyst can specify various reporting error models and assess how sensitive their conclusions are to different assumptions about the reporting error process. We illustrate the approach using simulations based on data from the 1993 National Survey of College Graduates. We use the method to impute error-corrected educational attainments in the 2010 American Community Survey using the 2010 National Survey of College Graduates as the gold standard survey.