13 resultados para RANDOM PERMUTATION MODEL
em DigitalCommons@The Texas Medical Center
Resumo:
Brain tumor is one of the most aggressive types of cancer in humans, with an estimated median survival time of 12 months and only 4% of the patients surviving more than 5 years after disease diagnosis. Until recently, brain tumor prognosis has been based only on clinical information such as tumor grade and patient age, but there are reports indicating that molecular profiling of gliomas can reveal subgroups of patients with distinct survival rates. We hypothesize that coupling molecular profiling of brain tumors with clinical information might improve predictions of patient survival time and, consequently, better guide future treatment decisions. In order to evaluate this hypothesis, the general goal of this research is to build models for survival prediction of glioma patients using DNA molecular profiles (U133 Affymetrix gene expression microarrays) along with clinical information. First, a predictive Random Forest model is built for binary outcomes (i.e. short vs. long-term survival) and a small subset of genes whose expression values can be used to predict survival time is selected. Following, a new statistical methodology is developed for predicting time-to-death outcomes using Bayesian ensemble trees. Due to a large heterogeneity observed within prognostic classes obtained by the Random Forest model, prediction can be improved by relating time-to-death with gene expression profile directly. We propose a Bayesian ensemble model for survival prediction which is appropriate for high-dimensional data such as gene expression data. Our approach is based on the ensemble "sum-of-trees" model which is flexible to incorporate additive and interaction effects between genes. We specify a fully Bayesian hierarchical approach and illustrate our methodology for the CPH, Weibull, and AFT survival models. We overcome the lack of conjugacy using a latent variable formulation to model the covariate effects which decreases computation time for model fitting. Also, our proposed models provides a model-free way to select important predictive prognostic markers based on controlling false discovery rates. We compare the performance of our methods with baseline reference survival methods and apply our methodology to an unpublished data set of brain tumor survival times and gene expression data, selecting genes potentially related to the development of the disease under study. A closing discussion compares results obtained by Random Forest and Bayesian ensemble methods under the biological/clinical perspectives and highlights the statistical advantages and disadvantages of the new methodology in the context of DNA microarray data analysis.
Resumo:
BACKGROUND: High cost, poor compliance, and systemic toxicity have limited the use of pentavalent antimony compounds (SbV), the treatment of choice for cutaneous leishmaniasis (CL). Paromomycin (PR) has been developed as an alternative to SbV, but existing data are conflicting. METHODOLOGY/PRINCIPAL FINDINGS: We searched PubMed, Scopus, and Cochrane Central Register of Controlled Trials, without language restriction, through August 2007, to identify randomized controlled trials that compared the efficacy or safety between PR and placebo or SbV. Primary outcome was clinical cure, defined as complete healing, disappearance, or reepithelialization of all lesions. Data were extracted independently by two investigators, and pooled using a random-effects model. Fourteen trials including 1,221 patients were included. In placebo-controlled trials, topical PR appeared to have therapeutic activity against the old world and new world CL, with increased local reactions, when used with methylbenzethonium chloride (MBCL) compared to when used alone (risk ratio [RR] for clinical cure, 2.58 versus 1.01: RR for local reactions, 1.60 versus 1.07). In SbV-controlled trials, the efficacy of topical PR was not significantly different from that of intralesional SbV in the old world CL (RR, 0.70; 95% confidence interval, 0.26-1.89), whereas topical PR was inferior to parenteral SbV in treating the new world CL (0.67; 0.54-0.82). No significant difference in efficacy was found between parenteral PR and parenteral SbV in the new world CL (0.88; 0.56-1.38). Systemic side effects were fewer with topical or parenteral PR than parenteral SbV. CONCLUSIONS/SIGNIFICANCE: Topical PR with MBCL could be a therapeutic alternative to SbV in selected cases of the old world CL. Development of new formulations with better efficacy and tolerability remains to be an area of future research.
Resumo:
The persistence of low birth weight and intrauterine growth retardation (IUGR) in the United States has puzzled researchers for decades. Much of the work that has been conducted on adverse birth outcomes has focused on low birth weight in general and not on IUGR. Studies that have examined IUGR specifically thus far have focused primarily on individual-level maternal risk factors. These risk factors have only been able to explain a small portion of the variance in IUGR. Therefore, recent work has begun to focus on community-level risk factors in addition to the individual-level maternal characteristics. This study uses Social Ecology to examine the relationship of individual and community-level risk factors and IUGR. Logistic regression was used to establish an individual-level model based on 155, 856 births recorded in Harris County, TX during 1999-2001. IUGR was characterized using a fetal growth ratio method with race/ethnic and sex specific mean birth weights calculated from national vital records. The spatial distributions of 114,460 birth records spatially located within the City of Houston were examined using choropleth, probability and density maps. Census tracts with higher than expected rates of IUGR and high levels of neighborhood disadvantage were highlighted. Neighborhood disadvantage was constructed using socioeconomic variables from the 2000 U.S. Census. Factor analysis was used to create a unified single measure. Lastly, a random coefficients model was used to examine the relationship between varying levels of community disadvantage, given the set of individual-level risk factors for 152,997 birth records spatially located within Harris County, TX. Neighborhood disadvantage was measured using three different indices adapted from previous work. The findings show that pregnancy-induced hypertension, previous preterm infant, tobacco use and insufficient weight gain have the highest association with IUGR. Neighborhood disadvantage only slightly further increases the risk of IUGR (OR 1.12 to 1.23). Although community level disadvantage only helped to explain a small proportion of the variance of IUGR, it did have a significant impact. This finding suggests that community level risk factors should be included in future work with IUGR and that more work needs to be conducted. ^
Resumo:
Of cancer death, colorectal cancer death ranks second in the United States. Obesity is an important risk factor for colorectal cancer (1). Early detection of colorectal cancer when it is localized can effectively reduce mortality of colorectal cancer and increase survival time of patients if they are treated. Also, previous studies showed that obese women were more likely to delay breast cancer screening and cervical cancer screening than normal weight women (2-5). However, results from prior studies demonstrating the relationship between obesity and colorectal cancer screening are not consistent. This research was done to conduct a meta-analysis of previous cross-sectional studies selected from the Medline database and to evaluate the association between obesity and colorectal cancer screening. While the odds ratio was not statistically different from one, the results from this meta-analysis under the random effects model showed that obese people are slightly less likely to have colorectal cancer screening compared to normal weight individuals (OR,0.93;95% CI 0.75-1.15). This meta-analysis was particularly sensitive to one individual study (6) and the effect of obesity on colorectal cancer screening was statistically significant (OR, 0.87; 95% CI, 0.81-0.92) after removing Heo's study. Further systematic studies focused on whether the effect of obesity on colorectal cancer screening is limited to women only are suggested. ^
The determinants of improvements in health outcomes and of cost reduction in hospital inpatient care
Resumo:
This study aims to address two research questions. First, ‘Can we identify factors that are determinants both of improved health outcomes and of reduced costs for hospitalized patients with one of six common diagnoses?’ Second, ‘Can we identify other factors that are determinants of improved health outcomes for such hospitalized patients but which are not associated with costs?’ The Healthcare Cost and Utilization Project (HCUP) Nationwide Inpatient Sample (NIS) database from 2003 to 2006 was employed in this study. The total study sample consisted of hospitals which had at least 30 patients each year for the given diagnosis: 954 hospitals for acute myocardial infarction (AMI), 1552 hospitals for congestive heart failure (CHF), 1120 hospitals for stroke (STR), 1283 hospitals for gastrointestinal hemorrhage (GIH), 979 hospitals for hip fracture (HIP), and 1716 hospitals for pneumonia (PNE). This study used simultaneous equations models to investigate the determinants of improvement in health outcomes and of cost reduction in hospital inpatient care for these six common diagnoses. In addition, the study used instrumental variables and two-stage least squares random effect model for unbalanced panel data estimation. The study concluded that a few factors were determinants of high quality and low cost. Specifically, high specialty was the determinant of high quality and low costs for CHF patients; small hospital size was the determinant of high quality and low costs for AMI patients. Furthermore, CHF patients who were treated in Midwest, South, and West region hospitals had better health outcomes and lower hospital costs than patients who were treated in Northeast region hospitals. Gastrointestinal hemorrhage and pneumonia patients who were treated in South region hospitals also had better health outcomes and lower hospital costs than patients who were treated in Northeast region hospitals. This study found that six non-cost factors were related to health outcomes for a few diagnoses: hospital volume, percentage emergency room admissions for a given diagnosis, hospital competition, specialty, bed size, and hospital region.^
A systematic review of clostridium difficile infection in patients with iatrogenic immunesuppression
Resumo:
Background: Incidence of C. difficile infection (CDI) has increased dramatically in the past decade and is the most frequent cause of nosocomial infectious diarrhea. The outcome of infection may range from mild diarrhea to life-threatening pseudomembranous colitis depending on the immunological response of the host, which is highly compromised in this special population that includes bone marrow transplant (BMT), solid organ transplant (SOT) and cancer patients on cytotoxic chemotherapy. ^ Objectives: We conducted a meta-analysis to assess the incidence rates of CDI and the time to onset of infection in patients with iatrogenic immune suppression. ^ Methods: Original studies were identified through an extensive search of electronic databases including PubMed, Ovid Medline (R), RefWorks and Biological Abstracts and their references. The overall incidence rate of CDI in the immune suppressed population was calculated using random effects model and their 95% confidence interval was derived. Differences in the incidence of CDI and time to onset of infection were calculated between the groups and within the groups. Publication bias was assessed using a funnel plot. Results: Twenty nine published articles involving 7,424 patients met the eligibility requirements. The overall incidence of CDI in the immune suppressed population is 11.1% (95% Confidence Interval (CI): 9.2–13.4%). The incidence of CDI was higher in SOT patients (14.2%, 95% CI: 6.8–21.5%); (p-value-0.022) and in cancer patients on cytotoxic chemotherapy (11.4%, 95% CI: 8.4–15.4%); (p = 0.042) than in BMT patients (10.5%, 95% CI: 7.9–13.1%). In a subgroup analysis of BMT population, the incidence of CDI is significantly higher in patients who received allogeneic BMT (15.1%, 95% CI: 11.2–20.0%; p value <0.0001). Similarly, in the SOT population, the incidence of CDI was higher in patients who underwent liver transplantation (11.0%, 95% CI: 5.6–20.3%); (p= 0.0672). The median time to onset of infection was shorter in BMT patients (p=0.0025). ^ Conclusions: It is evident from the combined analysis of these 29 published studies that the incidence of CDI in the immune suppressed population is higher. However, early diagnosis and treatment of CDI will help reduce the morbidity and mortality due to CDI in this special population.^
Resumo:
Background: An increased understanding of the pathogenesis of cancer at the molecular level has led to the development of personalized cancer therapy based on the mutation status of the tumor. Tailoring treatments to genetic signatures has improved treatment outcomes in patients with advanced cancer. We conducted a meta-analysis to provide a quantitative summary of the response to treatment on a phase I clinical trial matched to molecular aberration in patients with advanced solid tumors. ^ Methods: Original studies that reported the results of phase I clinical trials in patients with advanced cancer treated with matched anti-cancer therapies between January 2006 and November 2011 were identified through an extensive search of Medline, Embase, Web of Science and Cochrane Library databases. Odds Ratio (OR) with 95% confidence interval (CI) was estimated for each study to assess the strength of an association between objective response rate (ORR) and mutation status. Random effects model was used to estimate the pooled OR and their 95% CI was derived. Funnel plot was used to assess publication bias. ^ Results: Thirteen studies published between January 2006 and November 2011that reported on responses to matched phase I clinical trials in patients with advanced cancer were included in the meta-analysis. Nine studies reported on the responses seen in 538 of the 835 patients with driver mutations responsive to therapy and seven studies on the responses observed in 234 of the 306 patients with mutation predictive for negative response. Random effects model was used to estimate pooled OR, which was 7.767(95% CI = 4.199 − 14.366; p-value=0.000) in patients with activating mutations that were responsive to therapy and 0.287 (95% CI = 0.119 − 0.694; p-value=0.009) in patients with mutation predictive of negative response. ^ Conclusion: It is evident from the meta-analysis that somatic mutations present in tumor tissue of patients are predictive of responses to therapy in patients with advanced cancer in phase I setting. Plethora of research and growing evidence base indicate that selection of patients based on mutation analysis of the tumor and personalizing therapy is a step forward in the war against cancer.^
Resumo:
Perceptual learning is a training induced improvement in performance. Mechanisms underlying the perceptual learning of depth discrimination in dynamic random dot stereograms were examined by assessing stereothresholds as a function of decorrelation. The inflection point of the decorrelation function was defined as the level of decorrelation corresponding to 1.4 times the threshold when decorrelation is 0%. In general, stereothresholds increased with increasing decorrelation. Following training, stereothresholds and standard errors of measurement decreased systematically for all tested decorrelation values. Post training decorrelation functions were reduced by a multiplicative constant (approximately 5), exhibiting changes in stereothresholds without changes in the inflection points. Disparity energy model simulations indicate that a post-training reduction in neuronal noise can sufficiently account for the perceptual learning effects. In two subjects, learning effects were retained over a period of six months, which may have application for training stereo deficient subjects.
Resumo:
The use of group-randomized trials is particularly widespread in the evaluation of health care, educational, and screening strategies. Group-randomized trials represent a subset of a larger class of designs often labeled nested, hierarchical, or multilevel and are characterized by the randomization of intact social units or groups, rather than individuals. The application of random effects models to group-randomized trials requires the specification of fixed and random components of the model. The underlying assumption is usually that these random components are normally distributed. This research is intended to determine if the Type I error rate and power are affected when the assumption of normality for the random component representing the group effect is violated. ^ In this study, simulated data are used to examine the Type I error rate, power, bias and mean squared error of the estimates of the fixed effect and the observed intraclass correlation coefficient (ICC) when the random component representing the group effect possess distributions with non-normal characteristics, such as heavy tails or severe skewness. The simulated data are generated with various characteristics (e.g. number of schools per condition, number of students per school, and several within school ICCs) observed in most small, school-based, group-randomized trials. The analysis is carried out using SAS PROC MIXED, Version 6.12, with random effects specified in a random statement and restricted maximum likelihood (REML) estimation specified. The results from the non-normally distributed data are compared to the results obtained from the analysis of data with similar design characteristics but normally distributed random effects. ^ The results suggest that the violation of the normality assumption for the group component by a skewed or heavy-tailed distribution does not appear to influence the estimation of the fixed effect, Type I error, and power. Negative biases were detected when estimating the sample ICC and dramatically increased in magnitude as the true ICC increased. These biases were not as pronounced when the true ICC was within the range observed in most group-randomized trials (i.e. 0.00 to 0.05). The normally distributed group effect also resulted in bias ICC estimates when the true ICC was greater than 0.05. However, this may be a result of higher correlation within the data. ^
Resumo:
With hundreds of single nucleotide polymorphisms (SNPs) in a candidate gene and millions of SNPs across the genome, selecting an informative subset of SNPs to maximize the ability to detect genotype-phenotype association is of great interest and importance. In addition, with a large number of SNPs, analytic methods are needed that allow investigators to control the false positive rate resulting from large numbers of SNP genotype-phenotype analyses. This dissertation uses simulated data to explore methods for selecting SNPs for genotype-phenotype association studies. I examined the pattern of linkage disequilibrium (LD) across a candidate gene region and used this pattern to aid in localizing a disease-influencing mutation. The results indicate that the r2 measure of linkage disequilibrium is preferred over the common D′ measure for use in genotype-phenotype association studies. Using step-wise linear regression, the best predictor of the quantitative trait was not usually the single functional mutation. Rather it was a SNP that was in high linkage disequilibrium with the functional mutation. Next, I compared three strategies for selecting SNPs for application to phenotype association studies: based on measures of linkage disequilibrium, based on a measure of haplotype diversity, and random selection. The results demonstrate that SNPs selected based on maximum haplotype diversity are more informative and yield higher power than randomly selected SNPs or SNPs selected based on low pair-wise LD. The data also indicate that for genes with small contribution to the phenotype, it is more prudent for investigators to increase their sample size than to continuously increase the number of SNPs in order to improve statistical power. When typing large numbers of SNPs, researchers are faced with the challenge of utilizing an appropriate statistical method that controls the type I error rate while maintaining adequate power. We show that an empirical genotype based multi-locus global test that uses permutation testing to investigate the null distribution of the maximum test statistic maintains a desired overall type I error rate while not overly sacrificing statistical power. The results also show that when the penetrance model is simple the multi-locus global test does as well or better than the haplotype analysis. However, for more complex models, haplotype analyses offer advantages. The results of this dissertation will be of utility to human geneticists designing large-scale multi-locus genotype-phenotype association studies. ^
Resumo:
This study investigates a theoretical model where a longitudinal process, that is a stationary Markov-Chain, and a Weibull survival process share a bivariate random effect. Furthermore, a Quality-of-Life adjusted survival is calculated as the weighted sum of survival time. Theoretical values of population mean adjusted survival of the described model are computed numerically. The parameters of the bivariate random effect do significantly affect theoretical values of population mean. Maximum-Likelihood and Bayesian methods are applied on simulated data to estimate the model parameters. Based on the parameter estimates, predicated population mean adjusted survival can then be calculated numerically and compared with the theoretical values. Bayesian method and Maximum-Likelihood method provide parameter estimations and population mean prediction with comparable accuracy; however Bayesian method suffers from poor convergence due to autocorrelation and inter-variable correlation. ^
Resumo:
Objective. To measure the demand for primary care and its associated factors by building and estimating a demand model of primary care in urban settings.^ Data source. Secondary data from 2005 California Health Interview Survey (CHIS 2005), a population-based random-digit dial telephone survey, conducted by the UCLA Center for Health Policy Research in collaboration with the California Department of Health Services, and the Public Health Institute between July 2005 and April 2006.^ Study design. A literature review was done to specify the demand model by identifying relevant predictors and indicators. CHIS 2005 data was utilized for demand estimation.^ Analytical methods. The probit regression was used to estimate the use/non-use equation and the negative binomial regression was applied to the utilization equation with the non-negative integer dependent variable.^ Results. The model included two equations in which the use/non-use equation explained the probability of making a doctor visit in the past twelve months, and the utilization equation estimated the demand for primary conditional on at least one visit. Among independent variables, wage rate and income did not affect the primary care demand whereas age had a negative effect on demand. People with college and graduate educational level were associated with 1.03 (p < 0.05) and 1.58 (p < 0.01) more visits, respectively, compared to those with no formal education. Insurance was significantly and positively related to the demand for primary care (p < 0.01). Need for care variables exhibited positive effects on demand (p < 0.01). Existence of chronic disease was associated with 0.63 more visits, disability status was associated with 1.05 more visits, and people with poor health status had 4.24 more visits than those with excellent health status. ^ Conclusions. The average probability of visiting doctors in the past twelve months was 85% and the average number of visits was 3.45. The study emphasized the importance of need variables in explaining healthcare utilization, as well as the impact of insurance, employment and education on demand. The two-equation model of decision-making, and the probit and negative binomial regression methods, was a useful approach to demand estimation for primary care in urban settings.^
Resumo:
A Bayesian approach to estimation of the regression coefficients of a multinominal logit model with ordinal scale response categories is presented. A Monte Carlo method is used to construct the posterior distribution of the link function. The link function is treated as an arbitrary scalar function. Then the Gauss-Markov theorem is used to determine a function of the link which produces a random vector of coefficients. The posterior distribution of the random vector of coefficients is used to estimate the regression coefficients. The method described is referred to as a Bayesian generalized least square (BGLS) analysis. Two cases involving multinominal logit models are described. Case I involves a cumulative logit model and Case II involves a proportional-odds model. All inferences about the coefficients for both cases are described in terms of the posterior distribution of the regression coefficients. The results from the BGLS method are compared to maximum likelihood estimates of the regression coefficients. The BGLS method avoids the nonlinear problems encountered when estimating the regression coefficients of a generalized linear model. The method is not complex or computationally intensive. The BGLS method offers several advantages over Bayesian approaches. ^