14 resultados para Illusory Biases

em DigitalCommons@The Texas Medical Center


Relevância:

20.00% 20.00%

Publicador:

Resumo:

Treatment retention is of paramount importance in cocaine treatment research as treatment completion rates are often 50% or less. Failure to retain cocaine patients in treatment has both significant research and clinical implications. In this paper we qualitatively and quantitatively demonstrate the inconsistency found across analyses of retention predictors in order to highlight the problem. First, a qualitative review of the published literature was undertaken to identify the frequency of predictors studied and their relations to treatment retention. Second, an empirical demonstration of predictor stability was conducted by testing a common set of variables across three similar 12-week cocaine clinical trials conducted by the same investigators in the same research clinic within a five-year period. Results of the literature review indicated inconsistently selected variables of convenience, widely varying statistical procedures, and discrepant findings of significance. Further, quantitative analyses resulted in discrepancies in variables identified as significant predictors of retention among the three studies. Potential sources of heterogeneity affecting the consistency of findings across studies and recommendations to improve the validity and generalizability of predictor findings in future studies are proposed.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

The interpretation of data on genetic variation with regard to the relative roles of different evolutionary factors that produce and maintain genetic variation depends critically on our assumptions concerning effective population size and the level of migration between neighboring populations. In humans, recent population growth and movements of specific ethnic groups across wide geographic areas mean that any theory based on assumptions of constant population size and absence of substructure is generally untenable. We examine the effects of population subdivision on the pattern of protein genetic variation in a total sample drawn from an artificial agglomerate of 12 tribal populations of Central and South America, analyzing the pooled sample as though it were a single population. Several striking findings emerge. (1) Mean heterozygosity is not sensitive to agglomeration, but the number of different alleles (allele count) is inflated, relative to neutral mutation/drift/equilibrium expectation. (2) The inflation is most serious for rare alleles, especially those which originally occurred as tribally restricted "private" polymorphisms. (3) The degree of inflation is an increasing function of both the number of populations encompassed by the sample and of the genetic divergence among them. (4) Treating an agglomerated population as though it were a panmictic unit of long standing can lead to serious biases in estimates of mutation rates, selection pressures, and effective population sizes. Current DNA studies indicate the presence of numerous genetic variants in human populations. The findings and conclusions of this paper are all fully applicable to the study of genetic variation at the DNA level as well.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Small bistratified cells (SBCs) in the primate retina carry a major blue-yellow opponent signal to the brain. We found that SBCs also carry signals from rod photoreceptors, with the same sign as S cone input. SBCs exhibited robust responses under low scotopic conditions. Physiological and anatomical experiments indicated that this rod input arose from the AII amacrine cell-mediated rod pathway. Rod and cone signals were both present in SBCs at mesopic light levels. These findings have three implications. First, more retinal circuits may multiplex rod and cone signals than were previously thought to, efficiently exploiting the limited number of optic nerve fibers. Second, signals from AII amacrine cells may diverge to most or all of the approximately 20 retinal ganglion cell types in the peripheral primate retina. Third, rod input to SBCs may be the substrate for behavioral biases toward perception of blue at mesopic light levels.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

The place-specific activity of hippocampal cells provides downstream structures with information regarding an animal's position within an environment and, perhaps, the location of goals within that environment. In rodents, recent research has suggested that distal cues primarily set the orientation of the spatial representation, whereas the boundaries of the behavioral apparatus determine the locations of place activity. The current study was designed to address possible biases in some previous research that may have minimized the likelihood of observing place activity bound to distal cues. Hippocampal single-unit activity was recorded from six freely moving rats as they were trained to perform a tone-initiated place-preference task on an open-field platform. To investigate whether place activity was bound to the room- or platform-based coordinate frame (or both), the platform was translated within the room at an "early" and at a "late" phase of task acquisition (Shift 1 and Shift 2). At both time points, CA1 and CA3 place cells demonstrated room-associated and/or platform-associated activity, or remapped in response to the platform shift. Shift 1 revealed place activity that reflected an interaction between a dominant platform-based (proximal) coordinate frame and a weaker room-based (distal) frame because many CA1 and CA3 place fields shifted to a location intermediate to the two reference frames. Shift 2 resulted in place activity that became more strongly bound to either the platform- or room-based coordinate frame, suggesting the emergence of two independent spatial frames of reference (with many more cells participating in platform-based than in room-based representations).

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Models of DNA sequence evolution and methods for estimating evolutionary distances are needed for studying the rate and pattern of molecular evolution and for inferring the evolutionary relationships of organisms or genes. In this dissertation, several new models and methods are developed.^ The rate variation among nucleotide sites: To obtain unbiased estimates of evolutionary distances, the rate heterogeneity among nucleotide sites of a gene should be considered. Commonly, it is assumed that the substitution rate varies among sites according to a gamma distribution (gamma model) or, more generally, an invariant+gamma model which includes some invariable sites. A maximum likelihood (ML) approach was developed for estimating the shape parameter of the gamma distribution $(\alpha)$ and/or the proportion of invariable sites $(\theta).$ Computer simulation showed that (1) under the gamma model, $\alpha$ can be well estimated from 3 or 4 sequences if the sequence length is long; and (2) the distance estimate is unbiased and robust against violations of the assumptions of the invariant+gamma model.^ However, this ML method requires a huge amount of computational time and is useful only for less than 6 sequences. Therefore, I developed a fast method for estimating $\alpha,$ which is easy to implement and requires no knowledge of tree. A computer program was developed for estimating $\alpha$ and evolutionary distances, which can handle the number of sequences as large as 30.^ Evolutionary distances under the stationary, time-reversible (SR) model: The SR model is a general model of nucleotide substitution, which assumes (i) stationary nucleotide frequencies and (ii) time-reversibility. It can be extended to SRV model which allows rate variation among sites. I developed a method for estimating the distance under the SR or SRV model, as well as the variance-covariance matrix of distances. Computer simulation showed that the SR method is better than a simpler method when the sequence length $L>1,000$ bp and is robust against deviations from time-reversibility. As expected, when the rate varies among sites, the SRV method is much better than the SR method.^ The evolutionary distances under nonstationary nucleotide frequencies: The statistical properties of the paralinear and LogDet distances under nonstationary nucleotide frequencies were studied. First, I developed formulas for correcting the estimation biases of the paralinear and LogDet distances. The performances of these formulas and the formulas for sampling variances were examined by computer simulation. Second, I developed a method for estimating the variance-covariance matrix of the paralinear distance, so that statistical tests of phylogenies can be conducted when the nucleotide frequencies are nonstationary. Third, a new method for testing the molecular clock hypothesis was developed in the nonstationary case. ^

Relevância:

10.00% 10.00%

Publicador:

Resumo:

The purposes of this study were to examine (1) the relationship between selected components of the content of prenatal care and spontaneous preterm birth; and (2) the degree of comparability between maternal and caregivers' responses regarding the number of prenatal care visits, selected components of the content of prenatal care, and gestational age, based on analyses of the 1988 National Maternal and Infant Health Survey conducted by the National Centers for Health Statistics. Spontaneous preterm birth was subcategorized into very preterm and moderately preterm births, with term birth as the controls. The study population was limited to non-Hispanic Anglo- and African-American mothers. The racial differences in terms of birth outcomes were also compared.^ This study concluded that: (1) there was not a high degree of comparability (less than 80%) between maternal and prenatal care provider's responses regarding the number of prenatal care visits and the content of prenatal care; (2) there was a low degree of comparability (less than 50%) between maternal and infant's hospital of delivery responses regarding gestational age at birth; (3) there were differences in selected components of the content of prenatal care between the cases and controls, overall and stratified by ethnicity (i.e., hemoglobin/hematocrit test, weight measurement, and breast-feeding counseling), but they were confounded with missing values and associated preterm delivery bias; (4) there were differences in selected components of the content of prenatal care between Anglo- and African-American cases (i.e., vitamin/mineral supplement advice, weight measurement, smoking cessation and drug abuse counseling), but they, too, were difficult to interpret definitively due to item nonresponse and preterm delivery biases; (5) no significant predictive association between selected components of the content of prenatal care and spontaneous preterm birth was found; and (6) inadequate/intermediate prenatal care and birth out of wedlock were found to be associated with moderately preterm birth.^ Future research is needed to examine the validity of maternal and prenatal care providers' responses and identify the sources of disagreement between their responses. In addition, further studies are needed to examine the relationship between the quality of prenatal care and preterm birth. Finally, the completeness and quality of patient and provider data on the utilization and content of prenatal care needs to be strengthened in subsequent studies. ^

Relevância:

10.00% 10.00%

Publicador:

Resumo:

The use of group-randomized trials is particularly widespread in the evaluation of health care, educational, and screening strategies. Group-randomized trials represent a subset of a larger class of designs often labeled nested, hierarchical, or multilevel and are characterized by the randomization of intact social units or groups, rather than individuals. The application of random effects models to group-randomized trials requires the specification of fixed and random components of the model. The underlying assumption is usually that these random components are normally distributed. This research is intended to determine if the Type I error rate and power are affected when the assumption of normality for the random component representing the group effect is violated. ^ In this study, simulated data are used to examine the Type I error rate, power, bias and mean squared error of the estimates of the fixed effect and the observed intraclass correlation coefficient (ICC) when the random component representing the group effect possess distributions with non-normal characteristics, such as heavy tails or severe skewness. The simulated data are generated with various characteristics (e.g. number of schools per condition, number of students per school, and several within school ICCs) observed in most small, school-based, group-randomized trials. The analysis is carried out using SAS PROC MIXED, Version 6.12, with random effects specified in a random statement and restricted maximum likelihood (REML) estimation specified. The results from the non-normally distributed data are compared to the results obtained from the analysis of data with similar design characteristics but normally distributed random effects. ^ The results suggest that the violation of the normality assumption for the group component by a skewed or heavy-tailed distribution does not appear to influence the estimation of the fixed effect, Type I error, and power. Negative biases were detected when estimating the sample ICC and dramatically increased in magnitude as the true ICC increased. These biases were not as pronounced when the true ICC was within the range observed in most group-randomized trials (i.e. 0.00 to 0.05). The normally distributed group effect also resulted in bias ICC estimates when the true ICC was greater than 0.05. However, this may be a result of higher correlation within the data. ^

Relevância:

10.00% 10.00%

Publicador:

Resumo:

With the aim of understanding the mechanism of molecular evolution, mathematical problems on the evolutionary change of DNA sequences are studied. The problems studied and the results obtained are as follows: (1) Estimation of evolutionary distance between nucleotide sequences. Studying the pattern of nucleotide substitution for the case of unequal substitution rates, a new mathematical formula for estimating the average number of nucleotide substitutions per site between two homologous DNA sequences is developed. It is shown that this formula has a wider applicability than currently available formulae. A statistical method for estimating the number of nucleotide changes due to deletion and insertion is also developed. (2) Biases of the estimates of nucleotide substitutions obtained by the restriction enzyme method. The deviation of the estimate of nucleotide substitutions obtained by the restriction enzyme method from the true value is investigated theoretically. It is shown that the amount of the deviation depends on the nucleotides in the recognition sequence of the restriction enzyme used, unequal rates of substitution among different nucleotides, and nucleotide frequences, but the primary factor is the unequal rates of nucleotide substitution. When many different kinds of enzymes are used, however, the amount of average deviation is generally small. (3) Distribution of restriction fragment lengths. To see the effect of undetectable restriction fragments and fragment differences on the estimate of nucleotide differences, the theoretical distribution of fragment lengths is studied. This distribution depends on the type of restriction enzymes used as well as on the relative frequencies of four nucleotides. It is shown that undetectability of small fragments or fragment differences gives a serious underestimate of nucleotide substitutions when the length-difference method of estimation is used, but the extent of underestimation is small when the site-difference method is used. (4) Evolutionary relationships of DNA sequences in finite populations. A mathematical theory on the expected evolutionary relationships among DNA sequences (nucleons) randomly chosen from the same or different populations is developed under the assumption that the evolutionary change of nucleons is determined solely by mutation and random genetic drift. . . . (Author's abstract exceeds stipulated maximum length. Discontinued here with permission of author). UMI ^

Relevância:

10.00% 10.00%

Publicador:

Resumo:

The purpose of this study was to assess the accuracy and precision of airborne volatile organic compound (VOC) concentrations measured using passive air samplers (3M 3500 organic vapor monitors) over extended sampling durations (9 and 15 days). A total of forty-five organic vapor monitor samples were collected at a State of Texas air monitoring site during two different sampling periods (July/August and November 2008). The results of this study indicate that for most of the tested compounds, there was no significant difference between long-term (9 or 15 days) sample concentrations and the means of parallel consecutive short-term (3 days) sample concentrations. Biases of 9 or 15-day measurements vs. consecutive 3-day measurements showed considerable variability. Those compounds that had percent bias values of <10% are suggested as acceptable for long-term sampling (9 and 15 days). Of the twenty-one compounds examined, 10 compounds are classified as acceptable for long-term sampling; these include m,p-xylene, 1,2,4-trimethylbenzene, n-hexane, ethylbenzene, benzene, toluene, o-xylene, d-limonene, dimethylpentane and methyl tertbutyl ether. The ratio of sampling procedure variability relative to variability within days was approximately 1.89 for both sampling periods for the 3-day vs. 9-day comparisons and approximately 2.19 for both sampling periods for the 3-day vs. 15-day comparisons. Considerably higher concentrations of most VOCs were measured during the November sampling period compared to the July/August period. These differences may be a result of varying meteorological conditions during these two time periods, e.g., the differences in wind direction, and wind speed. Further studies are suggested to further evaluate the accuracy and precision of 3M 3500 organic vapor monitors over extended sampling durations. ^

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Many public health agencies and researchers are interested in comparing hospital outcomes, for example, morbidity, mortality, and hospitalization across areas and hospitals. However, since there is variation of rates in clinical trials among hospitals because of several biases, we are interested in controlling for the bias and assessing real differences in clinical practices. In this study, we compared the variations between hospitals in rates of severe Intraventricular Haemorrhage (IVH) infant using Frequentist statistical approach vs. Bayesian hierarchical model through simulation study. The template data set for simulation study was included the number of severe IVH infants of 24 intensive care units in Australian and New Zealand Neonatal Network from 1995 to 1997 in severe IVH rate in preterm babies. We evaluated the rates of severe IVH for 24 hospitals with two hierarchical models in Bayesian approach comparing their performances with the shrunken rates in Frequentist method. Gamma-Poisson (BGP) and Beta-Binomial (BBB) were introduced into Bayesian model and the shrunken estimator of Gamma-Poisson (FGP) hierarchical model using maximum likelihood method were calculated as Frequentist approach. To simulate data, the total number of infants in each hospital was kept and we analyzed the simulated data for both Bayesian and Frequentist models with two true parameters for severe IVH rate. One was the observed rate and the other was the expected severe IVH rate by adjusting for five predictors variables for the template data. The bias in the rate of severe IVH infant estimated by both models showed that Bayesian models gave less variable estimates than Frequentist model. We also discussed and compared the results from three models to examine the variation in rate of severe IVH by 20th centile rates and avoidable number of severe IVH cases. ^

Relevância:

10.00% 10.00%

Publicador:

Resumo:

HIV/AIDS is a treatable although incurable disease that presents immense challenges to those infected including physical, social and psychological effects. As of 2009, an estimated 2.4 million people were living with HIV or AIDS in India, 0.3% of the country's population. In India, it is difficult to not only treat but also to track because it is associated with socio-economic factors such as illiteracy, social biases, poor sanitation, malnutrition and social class. Nevertheless, it is important to know the prevalence of HIV/AIDS for several reasons. At the individual level, the quality of life of people living with HIV/AIDS is markedly lower than their counterparts without the disease and is associated with challenges. At the community level, it is important to identify high risk groups, monitor prevention efforts, and allocate appropriate resources to target programs for the reduction of transmission of HIV. ^

Relevância:

10.00% 10.00%

Publicador:

Resumo:

My dissertation focuses on two aspects of RNA sequencing technology. The first is the methodology for modeling the overdispersion inherent in RNA-seq data for differential expression analysis. This aspect is addressed in three sections. The second aspect is the application of RNA-seq data to identify the CpG island methylator phenotype (CIMP) by integrating datasets of mRNA expression level and DNA methylation status. Section 1: The cost of DNA sequencing has reduced dramatically in the past decade. Consequently, genomic research increasingly depends on sequencing technology. However it remains elusive how the sequencing capacity influences the accuracy of mRNA expression measurement. We observe that accuracy improves along with the increasing sequencing depth. To model the overdispersion, we use the beta-binomial distribution with a new parameter indicating the dependency between overdispersion and sequencing depth. Our modified beta-binomial model performs better than the binomial or the pure beta-binomial model with a lower false discovery rate. Section 2: Although a number of methods have been proposed in order to accurately analyze differential RNA expression on the gene level, modeling on the base pair level is required. Here, we find that the overdispersion rate decreases as the sequencing depth increases on the base pair level. Also, we propose four models and compare them with each other. As expected, our beta binomial model with a dynamic overdispersion rate is shown to be superior. Section 3: We investigate biases in RNA-seq by exploring the measurement of the external control, spike-in RNA. This study is based on two datasets with spike-in controls obtained from a recent study. We observe an undiscovered bias in the measurement of the spike-in transcripts that arises from the influence of the sample transcripts in RNA-seq. Also, we find that this influence is related to the local sequence of the random hexamer that is used in priming. We suggest a model of the inequality between samples and to correct this type of bias. Section 4: The expression of a gene can be turned off when its promoter is highly methylated. Several studies have reported that a clear threshold effect exists in gene silencing that is mediated by DNA methylation. It is reasonable to assume the thresholds are specific for each gene. It is also intriguing to investigate genes that are largely controlled by DNA methylation. These genes are called “L-shaped” genes. We develop a method to determine the DNA methylation threshold and identify a new CIMP of BRCA. In conclusion, we provide a detailed understanding of the relationship between the overdispersion rate and sequencing depth. And we reveal a new bias in RNA-seq and provide a detailed understanding of the relationship between this new bias and the local sequence. Also we develop a powerful method to dichotomize methylation status and consequently we identify a new CIMP of breast cancer with a distinct classification of molecular characteristics and clinical features.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Staphylococcus aureus is a common microorganism in humans, typically colonizing the nasopharynx, skin and other mucosal surfaces. It is among the most frequent causes of clinically-significant bacterial infections accounting for increased morbidity and mortality among individuals with HIV/AIDS. Evidence of higher colonization rates among high-risk HIV populations have been observed however, prevalence estimates have varied. Additionally, behavioral, biological, and/or environmental factors that may account for these high colonization rates are not understood. Previous literature on clinic-based surveys were subject to considerable biases. Additionally, representative samples of high-risk HIV populations were difficult to obtain due in part to an underrepresentation of individuals who may not regularly obtain health care. ^ The main objective of this project is to determine the prevalence of methicillin-sensitive S. aureus (MSSA) and methicillin-resistant (MRSA) nasal colonization in two populations: 1) men who have sex with men (MSM) and 2) injection drug users (IDU). Both of these populations are included in the third round of the National HIV Behavioral Surveillance System (NHBS) in Houston, Texas. ^ In the NHBS-MSM3 study, logistic regression was used to report odds ratios and 95% confidence intervals (CI). For the NHBS-IDU3 study, to account for the lack of independence between samples, the method of generalized estimating equations was utilized to report adjusted odds ratios and 95% CI. The NHBS-MSM3 study enrolled 202 participants with a MSSA colonization rate of 26.7% and MRSA rate of 3%. In the NHBS-IDU3 study, 18.4% were nasally colonized with MSSA and 5.7% were nasally colonized with MRSA. Among the NHBS-MSM3 population, high-risk sexual practices were associated with colonization. For the NHBS-IDU3 population, age, marital status, employment status, and the presence of scabs, were associated with colonization status when controlling for size of recruitment network. In multivariate GEE analyses, the use of antiretroviral medications and age remained significantly associated with S. aureus nasal colonization when controlling for size of recruitment network and gender. In both studies, a significantly higher than expected S. aureus and MRSA colonization rate was observed as compared to colonization rates described for the general population. However, these estimates were moderate in comparison to reported clinic-based MSM and IDU S. aureus colonization findings. This study validates substantial prevalence differences and biases that may exist with data collected from clinic-based MSM and IDU. The prevalence of MSSA and MRSA nasal colonization did not differ significantly with respect to HIV status among NHBS-MSM3/NHBS-IDU3 participants. Continued examination on the effects of S. aureus colonization and infection should be examined longitudinally to confirm additional community-based determinants in populations that are disproportionately affected.^

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Many statistical studies feature data with both exact-time and interval-censored events. While a number of methods currently exist to handle interval-censored events and multivariate exact-time events separately, few techniques exist to deal with their combination. This thesis develops a theoretical framework for analyzing a multivariate endpoint comprised of a single interval-censored event plus an arbitrary number of exact-time events. The approach fuses the exact-time events, modeled using the marginal method of Wei, Lin, and Weissfeld, with a piecewise-exponential interval-censored component. The resulting model incorporates more of the information in the data and also removes some of the biases associated with the exclusion of interval-censored events. A simulation study demonstrates that our approach produces reliable estimates for the model parameters and their variance-covariance matrix. As a real-world data example, we apply this technique to the Systolic Hypertension in the Elderly Program (SHEP) clinical trial, which features three correlated events: clinical non-fatal myocardial infarction, fatal myocardial infarction (two exact-time events), and silent myocardial infarction (one interval-censored event). ^