10 resultados para Asymptotic Variance of Estimate
em DigitalCommons@The Texas Medical Center
Resumo:
Although many family-based genetic studies have collected dietary data, very few have used the dietary information in published findings. No single solution has been presented or discussed in the literature to deal with the problem of using factor analyses for the analyses of dietary data from several related individuals from a given household. The standard statistical approach of factor analysis cannot be applied to the VIVA LA FAMILIA Study diet data to ascertain dietary patterns since this population consists of three children from each family, thus the dietary patterns of the related children may be correlated and non-independent. Addressing this problem in this project will enable us to describe the dietary patterns in Hispanic families and to explore the relationships between dietary patterns and childhood obesity. ^ In the VIVA LA FAMILIA Study, an overweight child was first identified and then his/her siblings and parents were brought in for data collection which included 24 hour recalls and food frequency questionnaire (FFQ). Dietary intake data were collected using FFQ and 24 hour recalls on 1030 Hispanic children from 319 families. ^ The design of the VIVA LA FAMILIA Study has important and unique statistical considerations since its participants are related to each other, the majority form distinct nuclear families. Thus, the standard approach of factor analysis cannot be applied to these diet data to ascertain dietary patterns. In this project we propose to investigate whether the determinants of the correlation matrix of each family unit will allow us to adjust the original correlation matrix of the dietary intake data prior to ascertaining dietary intake patterns. If these methods are appropriate, then in the future the dietary patterns among related individuals could be assessed by standard orthogonal principal component factor analysis.^
Resumo:
Calcium levels in spines play a significant role in determining the sign and magnitude of synaptic plasticity. The magnitude of calcium influx into spines is highly dependent on influx through N-methyl D-aspartate (NMDA) receptors, and therefore depends on the number of postsynaptic NMDA receptors in each spine. We have calculated previously how the number of postsynaptic NMDA receptors determines the mean and variance of calcium transients in the postsynaptic density, and how this alters the shape of plasticity curves. However, the number of postsynaptic NMDA receptors in the postsynaptic density is not well known. Anatomical methods for estimating the number of NMDA receptors produce estimates that are very different than those produced by physiological techniques. The physiological techniques are based on the statistics of synaptic transmission and it is difficult to experimentally estimate their precision. In this paper we use stochastic simulations in order to test the validity of a physiological estimation technique based on failure analysis. We find that the method is likely to underestimate the number of postsynaptic NMDA receptors, explain the source of the error, and re-derive a more precise estimation technique. We also show that the original failure analysis as well as our improved formulas are not robust to small estimation errors in key parameters.
Resumo:
The distribution of the number of heterozygous loci in two randomly chosen gametes or in a random diploid zygote provides information regarding the nonrandom association of alleles among different genetic loci. Two alternative statistics may be employed for detection of nonrandom association of genes of different loci when observations are made on these distributions: observed variance of the number of heterozygous loci (s2k) and a goodness-of-fit criterion (X2) to contrast the observed distribution with that expected under the hypothesis of random association of genes. It is shown, by simulation, that s2k is statistically more efficient than X2 to detect a given extent of nonrandom association. Asymptotic normality of s2k is justified, and X2 is shown to follow a chi-square (chi 2) distribution with partial loss of degrees of freedom arising because of estimation of parameters from the marginal gene frequency data. Whenever direct evaluations of linkage disequilibrium values are possible, tests based on maximum likelihood estimators of linkage disequilibria require a smaller sample size (number of zygotes or gametes) to detect a given level of nonrandom association in comparison with that required if such tests are conducted on the basis of s2k. Summarization of multilocus genotype (or haplotype) data, into the different number of heterozygous loci classes, thus, amounts to appreciable loss of information.
Resumo:
Coronary heart disease remains the leading cause of death in the United States and increased blood cholesterol level has been found to be a major risk factor with roots in childhood. Tracking of cholesterol, i.e., the tendency to maintain a particular cholesterol level relative to the rest of the population, and variability in blood lipid levels with increase in age have implications for cholesterol screening and assessment of lipid levels in children for possible prevention of further rise to prevent adulthood heart disease. In this study the pattern of change in plasma lipids, over time, and their tracking were investigated. Also, within-person variance and retest reliability defined as the square root of within-person variance for plasma total cholesterol, HDL-cholesterol, LDL-cholesterol, and triglycerides and their relation to age, sex and body mass index among participants from age 8 to 18 years were investigated. ^ In Project HeartBeat!, 678 healthy children aged 8, 11 and 14 years at baseline were enrolled and examined at 4-monthly intervals for up to 4 years. We examined the relationship between repeated observations by Pearson's correlations. Age- and sex-specific quintiles were calculated and the probability of participants to remain in the uppermost quintile of their respective distribution was evaluated with life table methods. Plasma total cholesterol, HDL-C and LDL-C at baseline were strongly and significantly correlated with measurements at subsequent visits across the sex and age groups. Plasma triglyceride at baseline was also significantly correlated with subsequent measurements but less strongly than was the case for other plasma lipids. The probability to remain in the upper quintile was also high (60 to 70%) for plasma total cholesterol, HDL-C and LDL-C. ^ We used a mixed longitudinal, or synthetic cohort design with continuous observations from age 8 to 18 years to estimate within person variance of plasma total cholesterol, HDL-C, LDL-C and triglycerides. A total of 5809 measurements were available for both cholesterol and triglycerides. A multilevel linear model was used. Within-person variance among repeated measures over up to four years of follow-up was estimated for total cholesterol, HDL-C, LDL-C and triglycerides separately. The relationship of within-person and inter-individual variance with age, sex, and body mass index was evaluated. Likelihood ratio tests were conducted by calculating the deviation of −2log (likelihood) within the basic model and alternative models. The square root of within-person variance provided the retest reliability (within person standard deviation) for plasma total cholesterol, HDL-C, LDL-C and triglycerides. We found 13.6 percent retest reliability for plasma cholesterol, 6.1 percent for HDL-cholesterol, 11.9 percent for LDL-cholesterol and 32.4 percent for triglycerides. Retest reliability of plasma lipids was significantly related with age and body mass index. It increased with increase in body mass index and age. These findings have implications for screening guidelines, as participants in the uppermost quintile tended to maintain their status in each of the age groups during a four-year follow-up. The magnitude of within-person variability of plasma lipids influences the ability to classify children into risk categories recommended by the National Cholesterol Education Program. ^
Resumo:
Any functionally important mutation is embedded in an evolutionary matrix of other mutations. Cladistic analysis, based on this, is a method of investigating gene effects using a haplotype phylogeny to define a set of tests which localize causal mutations to branches of the phylogeny. Previous implementations of cladistic analysis have not addressed the issue of analyzing data from related individuals, though in human studies, family data are usually needed to obtain unambiguous haplotypes. In this study, a method of cladistic analysis is described in which haplotype effects are parameterized in a linear model which accounts for familial correlations. The method was used to study the effect of apolipoprotein (Apo) B gene variation on total-, LDL-, and HDL-cholesterol, triglyceride, and Apo B levels in 121 French families. Five polymorphisms defined Apo B haplotypes: the signal peptide Insertion/deletion, Bsp 1286I, XbaI, MspI, and EcoRI. Eleven haplotypes were found, and a haplotype phylogeny was constructed and used to define a set of tests of haplotype effects on lipid and apo B levels.^ This new method of cladistic analysis, the parametric method, found significant effects for single haplotypes for all variables. For HDL-cholesterol, 3 clusters of evolutionarily-related haplotypes affecting levels were found. Haplotype effects accounted for about 10% of the genetic variance of triglyceride and HDL-cholesterol levels. The results of the parametric method were compared to those of a method of cladistic analysis based on permutational testing. The permutational method detected fewer haplotype effects, even when modified to account for correlations within families. Simulation studies exploring these differences found evidence of systematic errors in the permutational method due to the process by which haplotype groups were selected for testing.^ The applicability of cladistic analysis to human data was shown. The parametric method is suggested as an improvement over the permutational method. This study has identified candidate haplotypes for sequence comparisons in order to locate the functional mutations in the Apo B gene which may influence plasma lipid levels. ^
Resumo:
The persistence of low birth weight and intrauterine growth retardation (IUGR) in the United States has puzzled researchers for decades. Much of the work that has been conducted on adverse birth outcomes has focused on low birth weight in general and not on IUGR. Studies that have examined IUGR specifically thus far have focused primarily on individual-level maternal risk factors. These risk factors have only been able to explain a small portion of the variance in IUGR. Therefore, recent work has begun to focus on community-level risk factors in addition to the individual-level maternal characteristics. This study uses Social Ecology to examine the relationship of individual and community-level risk factors and IUGR. Logistic regression was used to establish an individual-level model based on 155, 856 births recorded in Harris County, TX during 1999-2001. IUGR was characterized using a fetal growth ratio method with race/ethnic and sex specific mean birth weights calculated from national vital records. The spatial distributions of 114,460 birth records spatially located within the City of Houston were examined using choropleth, probability and density maps. Census tracts with higher than expected rates of IUGR and high levels of neighborhood disadvantage were highlighted. Neighborhood disadvantage was constructed using socioeconomic variables from the 2000 U.S. Census. Factor analysis was used to create a unified single measure. Lastly, a random coefficients model was used to examine the relationship between varying levels of community disadvantage, given the set of individual-level risk factors for 152,997 birth records spatially located within Harris County, TX. Neighborhood disadvantage was measured using three different indices adapted from previous work. The findings show that pregnancy-induced hypertension, previous preterm infant, tobacco use and insufficient weight gain have the highest association with IUGR. Neighborhood disadvantage only slightly further increases the risk of IUGR (OR 1.12 to 1.23). Although community level disadvantage only helped to explain a small proportion of the variance of IUGR, it did have a significant impact. This finding suggests that community level risk factors should be included in future work with IUGR and that more work needs to be conducted. ^
Resumo:
With the recognition of the importance of evidence-based medicine, there is an emerging need for methods to systematically synthesize available data. Specifically, methods to provide accurate estimates of test characteristics for diagnostic tests are needed to help physicians make better clinical decisions. To provide more flexible approaches for meta-analysis of diagnostic tests, we developed three Bayesian generalized linear models. Two of these models, a bivariate normal and a binomial model, analyzed pairs of sensitivity and specificity values while incorporating the correlation between these two outcome variables. Noninformative independent uniform priors were used for the variance of sensitivity, specificity and correlation. We also applied an inverse Wishart prior to check the sensitivity of the results. The third model was a multinomial model where the test results were modeled as multinomial random variables. All three models can include specific imaging techniques as covariates in order to compare performance. Vague normal priors were assigned to the coefficients of the covariates. The computations were carried out using the 'Bayesian inference using Gibbs sampling' implementation of Markov chain Monte Carlo techniques. We investigated the properties of the three proposed models through extensive simulation studies. We also applied these models to a previously published meta-analysis dataset on cervical cancer as well as to an unpublished melanoma dataset. In general, our findings show that the point estimates of sensitivity and specificity were consistent among Bayesian and frequentist bivariate normal and binomial models. However, in the simulation studies, the estimates of the correlation coefficient from Bayesian bivariate models are not as good as those obtained from frequentist estimation regardless of which prior distribution was used for the covariance matrix. The Bayesian multinomial model consistently underestimated the sensitivity and specificity regardless of the sample size and correlation coefficient. In conclusion, the Bayesian bivariate binomial model provides the most flexible framework for future applications because of its following strengths: (1) it facilitates direct comparison between different tests; (2) it captures the variability in both sensitivity and specificity simultaneously as well as the intercorrelation between the two; and (3) it can be directly applied to sparse data without ad hoc correction. ^
Resumo:
The nonresidual concentrations of five trace metals were determined for 322 sediments that were the product of a systematic sampling program of the entire Galveston Bay system. The nonresidual component of the trace metal concentration (e.g. that fraction of the metals that can be relatively easily removed from the sediments without complete destruction of the sediment particle) was considered to be more indicative of the anthropogenic metal pollution that has impacted the Galveston Bay ecosystem.^ For spatial analysis of the metal concentrations, the Galveston Bay system was divided into nine bay-areas, based on easily definable geological and geographical characteristics. Isopleth mapping analyses of these metal concentrations indicated a direct relationship with the $<$63$\mu$m fraction of the sediment (%FINE) in all of the bay areas. Covariate regression analyses indicated that position of the sediment within the Galveston Bay system (e.g. bay-area) was a better predictor of metal concentration than %FINE. Analysis of variance of the metals versus the bay-areas indicated that the five metals maintained a relatively constant order and magnitude of concentration for all the bay-areas.^ The major shipping channels of the Galveston Bay system, with their associated vessels and transported materials, are a likely source of metal pollution. However, these channels were not depositional corridors of high metal concentration. All metal concentration highs were found to be located away from the channels and associated with %FINE highs in the deeper portions of the bay-areas.^ Disturbance of the sediments, by the proposed widening and deepening of these channels, is not predicted to remobilize the trace metals. A more likely adverse effect on the health of the Galveston Bay ecosystem would come from the increase in turbidity of the water due to the dredging and in an extension of the salt water wedge farther north into the bay system. ^
Resumo:
In geographical epidemiology, maps of disease rates and disease risk provide a spatial perspective for researching disease etiology. For rare diseases or when the population base is small, the rate and risk estimates may be unstable. Empirical Bayesian (EB) methods have been used to spatially smooth the estimates by permitting an area estimate to "borrow strength" from its neighbors. Such EB methods include the use of a Gamma model, of a James-Stein estimator, and of a conditional autoregressive (CAR) process. A fully Bayesian analysis of the CAR process is proposed. One advantage of this fully Bayesian analysis is that it can be implemented simply by using repeated sampling from the posterior densities. Use of a Markov chain Monte Carlo technique such as Gibbs sampler was not necessary. Direct resampling from the posterior densities provides exact small sample inferences instead of the approximate asymptotic analyses of maximum likelihood methods (Clayton & Kaldor, 1987). Further, the proposed CAR model provides for covariates to be included in the model. A simulation demonstrates the effect of sample size on the fully Bayesian analysis of the CAR process. The methods are applied to lip cancer data from Scotland, and the results are compared. ^
Resumo:
InGen of Creative Production in the Health Sciences is a compendium of innovative thinking exercises for individuals and groups, derived from an eclectic array of practical guides for professionals in a variety of fields. Segmented into five subcategories across twenty two chapters, the effort seeks to make techniques for increasing innovative problem solving more accessible to a diverse audience of problem solvers. The chapters of Roberta Ness. Innovation Generation (2012, Oxford University Press) provide the themes for each of the chapters in the workbook. It is intended that those who read Ness. Innovation Generation will benefit from practicing the constructs of innovative thinking exemplified in each exercise.^ The methods used to gather data, in this case mostly innovative thinking exercises, included literature reviews of existing innovative thinking tools, classroom materials, and theory-driven exploration of exercises to fill in gaps in extant materials. Specifically, Google.com and Amazon.com searches were conducted using the terms “innovation,” “innovative,” “innovator,” “creative,” “novelty,” “thinking,” together with some variance of “book,” “workbook,” and “exercise.” The results were sorted thematically to show correspondence with the themes in Ness (2012) and compared to suggested best practices of 50 years of scientific research on innovative thinking. Where themes were suggested by Ness (2012) and peer-reviewed research on innovation but unavailable in published innovation thinking workbooks, new exercises were developed. The five type subcategories into which these results were organized are: individual direct, individual indirect, group direct, group indirect and probing question. It is anticipated that the five type subcategories and spectrum of themes will equip problem solvers in a variety of capacities.^