8 resultados para Correlation based analysis
em DigitalCommons@The Texas Medical Center
Resumo:
The genomic era brought by recent advances in the next-generation sequencing technology makes the genome-wide scans of natural selection a reality. Currently, almost all the statistical tests and analytical methods for identifying genes under selection was performed on the individual gene basis. Although these methods have the power of identifying gene subject to strong selection, they have limited power in discovering genes targeted by moderate or weak selection forces, which are crucial for understanding the molecular mechanisms of complex phenotypes and diseases. Recent availability and rapid completeness of many gene network and protein-protein interaction databases accompanying the genomic era open the avenues of exploring the possibility of enhancing the power of discovering genes under natural selection. The aim of the thesis is to explore and develop normal mixture model based methods for leveraging gene network information to enhance the power of natural selection target gene discovery. The results show that the developed statistical method, which combines the posterior log odds of the standard normal mixture model and the Guilt-By-Association score of the gene network in a naïve Bayes framework, has the power to discover moderate/weak selection gene which bridges the genes under strong selection and it helps our understanding the biology under complex diseases and related natural selection phenotypes.^
Resumo:
Genome-wide association studies (GWAS) have rapidly become a standard method for disease gene discovery. Many recent GWAS indicate that for most disorders, only a few common variants are implicated and the associated SNPs explain only a small fraction of the genetic risk. The current study incorporated gene network information into gene-based analysis of GWAS data for Crohn's disease (CD). The purpose was to develop statistical models to boost the power of identifying disease-associated genes and gene subnetworks by maximizing the use of existing biological knowledge from multiple sources. The results revealed that Markov random field (MRF) based mixture model incorporating direct neighborhood information from a single gene network is not efficient in identifying CD-related genes based on the GWAS data. The incorporation of solely direct neighborhood information might lead to the low efficiency of these models. Alternative MRF models looking beyond direct neighboring information are necessary to be developed in the future for the purpose of this study.^
Resumo:
Schizophrenia (SZ) is a complex disorder with high heritability and variable phenotypes that has limited success in finding causal genes associated with the disease development. Pathway-based analysis is an effective approach in investigating the molecular mechanism of susceptible genes associated with complex diseases. The etiology of complex diseases could be a network of genetic factors and within the genes, interaction may occur. In this work we argue that some genes might be of small effect that by itself are neither sufficient nor necessary to cause the disease however, their effect may induce slight changes to the gene expression or affect the protein function, therefore, analyzing the gene-gene interaction mechanism within the disease pathway would play crucial role in dissecting the genetic architecture of complex diseases, making the pathway-based analysis a complementary approach to GWAS technique. ^ In this study, we implemented three novel linkage disequilibrium based statistics, the linear combination, the quadratic, and the decorrelation test statistics, to investigate the interaction between linked and unlinked genes in two independent case-control GWAS datasets for SZ including participants of European (EA) and African (AA) ancestries. The EA population included 1,173 cases and 1,378 controls with 729,454 genotyped SNPs, while the AA population included 219 cases and 288 controls with 845,814 genotyped SNPs. We identified 17,186 interacting gene-sets at significant level in EA dataset, and 12,691 gene-sets in AA dataset using the gene-gene interaction method. We also identified 18,846 genes in EA dataset and 19,431 genes in AA dataset that were in the disease pathways. However, few genes were reported of significant association to SZ. ^ Our research determined the pathways characteristics for schizophrenia through the gene-gene interaction and gene-pathway based approaches. Our findings suggest insightful inferences of our methods in studying the molecular mechanisms of common complex diseases.^
Resumo:
Magnetic resonance temperature imaging (MRTI) is recognized as a noninvasive means to provide temperature imaging for guidance in thermal therapies. The most common method of estimating temperature changes in the body using MR is by measuring the water proton resonant frequency (PRF) shift. Calculation of the complex phase difference (CPD) is the method of choice for measuring the PRF indirectly since it facilitates temperature mapping with high spatiotemporal resolution. Chemical shift imaging (CSI) techniques can provide the PRF directly with high sensitivity to temperature changes while minimizing artifacts commonly seen in CPD techniques. However, CSI techniques are currently limited by poor spatiotemporal resolution. This research intends to develop and validate a CSI-based MRTI technique with intentional spectral undersampling which allows relaxed parameters to improve spatiotemporal resolution. An algorithm based on autoregressive moving average (ARMA) modeling is developed and validated to help overcome limitations of Fourier-based analysis allowing highly accurate and precise PRF estimates. From the determined acquisition parameters and ARMA modeling, robust maps of temperature using the k-means algorithm are generated and validated in laser treatments in ex vivo tissue. The use of non-PRF based measurements provided by the technique is also investigated to aid in the validation of thermal damage predicted by an Arrhenius rate dose model.
Resumo:
Little is known about epidemiological markers that are associated with survival of patients with myelodysplastic syndromes (MDS). We conducted a secondary case-based analysis of 465 de novo MDS patients from the University of Texas MD Anderson Cancer Center (UTMDACC). We investigated the association between demographic as well as occupational exposure markers and survival while incorporating known clinical markers of prognosis. In our patient population, 60.6% were men and the majority were white (93.1%). The distribution of MDS subtypes by the French–American–British (FAB) classification was 81 (19%) refractory anemia (RA), 46 (9.9%) refractory anemia with ringed sideroblasts (RARS), 57 (12.3%) chronic myelomonocytic leukemia (CMML), 173 (37.2%) RA with excess blasts (RAEB), and 86 (18.5%) RAEB in transformation (RAEBT). We found that those older at diagnosis (> 60 years of age) (HR = 1.68, CI = 1.26-2.25) were at a higher risk of dying compared to younger patients. Similarly, high pack years of smoking (>= 30 pack years of smoking) (HR = 1.34, CI = 1.02-1.74), and agricultural chemical exposure (HR = 1.61, CI = 1.05-2.46) were significantly associated with overall lower survival when compared to patients with none or medium exposures. Among clinical markers, greater than 5% bone marrow blasts (HR = 1.81 CI = 1.27-2.56), poor cytogenetics (HR = 3.20, CI = 2.37-4.33)), and platelet cytopenias (<100000/ul) (HR = 1.46, CI = 1.11-1.92) were also significantly associated with overall MDS survival.^ The identification of epidemiological markers could help physicians stratify patients and customize treatment strategies to improve the outcome of MDS based on patient lifestyle information such as smoking exposure and agrochemical exposure. We hope that this study highlights the impact of these exposures in MDS prognosis.^
Resumo:
Mechanisms that allow pathogens to colonize the host are not the product of isolated genes, but instead emerge from the concerted operation of regulatory networks. Therefore, identifying components and the systemic behavior of networks is necessary to a better understanding of gene regulation and pathogenesis. To this end, I have developed systems biology approaches to study transcriptional and post-transcriptional gene regulation in bacteria, with an emphasis in the human pathogen Mycobacterium tuberculosis (Mtb). First, I developed a network response method to identify parts of the Mtb global transcriptional regulatory network utilized by the pathogen to counteract phagosomal stresses and survive within resting macrophages. As a result, the method unveiled transcriptional regulators and associated regulons utilized by Mtb to establish a successful infection of macrophages throughout the first 14 days of infection. Additionally, this network-based analysis identified the production of Fe-S proteins coupled to lipid metabolism through the alkane hydroxylase complex as a possible strategy employed by Mtb to survive in the host. Second, I developed a network inference method to infer the small non-coding RNA (sRNA) regulatory network in Mtb. The method identifies sRNA-mRNA interactions by integrating a priori knowledge of possible binding sites with structure-driven identification of binding sites. The reconstructed network was useful to predict functional roles for the multitude of sRNAs recently discovered in the pathogen, being that several sRNAs were postulated to be involved in virulence-related processes. Finally, I applied a combined experimental and computational approach to study post-transcriptional repression mediated by small non-coding RNAs in bacteria. Specifically, a probabilistic ranking methodology termed rank-conciliation was developed to infer sRNA-mRNA interactions based on multiple types of data. The method was shown to improve target prediction in Escherichia coli, and therefore is useful to prioritize candidate targets for experimental validation.
Resumo:
Pancreatic cancer is the 4th most common cause for cancer death in the United States, accompanied by less than 5% five-year survival rate based on current treatments, particularly because it is usually detected at a late stage. Identifying a high-risk population to launch an effective preventive strategy and intervention to control this highly lethal disease is desperately needed. The genetic etiology of pancreatic cancer has not been well profiled. We hypothesized that unidentified genetic variants by previous genome-wide association study (GWAS) for pancreatic cancer, due to stringent statistical threshold or missing interaction analysis, may be unveiled using alternative approaches. To achieve this aim, we explored genetic susceptibility to pancreatic cancer in terms of marginal associations of pathway and genes, as well as their interactions with risk factors. We conducted pathway- and gene-based analysis using GWAS data from 3141 pancreatic cancer patients and 3367 controls with European ancestry. Using the gene set ridge regression in association studies (GRASS) method, we analyzed 197 pathways from the Kyoto Encyclopedia of Genes and Genomes (KEGG) database. Using the logistic kernel machine (LKM) test, we analyzed 17906 genes defined by University of California Santa Cruz (UCSC) database. Using the likelihood ratio test (LRT) in a logistic regression model, we analyzed 177 pathways and 17906 genes for interactions with risk factors in 2028 pancreatic cancer patients and 2109 controls with European ancestry. After adjusting for multiple comparisons, six pathways were marginally associated with risk of pancreatic cancer ( P < 0.00025): Fc epsilon RI signaling, maturity onset diabetes of the young, neuroactive ligand-receptor interaction, long-term depression (Ps < 0.0002), and the olfactory transduction and vascular smooth muscle contraction pathways (P = 0.0002; Nine genes were marginally associated with pancreatic cancer risk (P < 2.62 × 10−5), including five reported genes (ABO, HNF1A, CLPTM1L, SHH and MYC), as well as four novel genes (OR13C4, OR 13C3, KCNA6 and HNF4 G); three pathways significantly interacted with risk factors on modifying the risk of pancreatic cancer (P < 2.82 × 10−4): chemokine signaling pathway with obesity ( P < 1.43 × 10−4), calcium signaling pathway (P < 2.27 × 10−4) and MAPK signaling pathway with diabetes (P < 2.77 × 10−4). However, none of the 17906 genes tested for interactions survived the multiple comparisons corrections. In summary, our current GWAS study unveiled unidentified genetic susceptibility to pancreatic cancer using alternative methods. These novel findings provide new perspectives on genetic susceptibility to and molecular mechanisms of pancreatic cancer, once confirmed, will shed promising light on the prevention and treatment of this disease. ^
Resumo:
Background: Futile medical treatments are interventions that are not associated with a benefit to the patient. The definition and concept of medical futility are controversial. The Texas Advance Directives Act (TADA) was passed in 1999 to address medically inappropriate interventions by allowing providers to withdraw inappropriate interventions against a surrogate decision maker's wishes following a review, attempt to transfer the patient, and 10-day waiting period. The original legislation was a negotiated compromise by players across the political spectrum. However, in recent years there has been increasing controversy regarding TADA and attempts to alter its applicability in Texas. ^ Purpose: The purpose of this project was to apply Paul Sabatier's advocacy coalition framework (ACF) to gain understanding into the historical, ethical, and political basis of the initial compromise, and determine the sources of conflict that have led to increased opposition to TADA. ^ Methods: Using the ACF model, key actors within the medical futility policy debate in Texas were aggregated into coalitions based on shared beliefs. A narrative summary based analysis identified the core elements of the policy subsystem, as well as the constraints and resources of the subsystem actors. Externalities that promoted adjustments to coalition beliefs and tactics used by coalition participants were analyzed. Data sources included review of the published literature regarding medical futility, as well as analysis of published newspaper accounts and editorials regarding the medical futility issue in Texas, legislative testimony, and review of weblogs and online commentaries dealing with the issue. ^ Results: Primary coalition participants in developing compromise legislation in 1999 were the Providers and Vitalists, with Autonomists gaining a prominent role starting in 2006. Internal factors associated with the breakdown of consensus included changes to the makeup of the governing coalition and changes in individual case information available to the Vitalist coalition. Externalities related to the intertwining of the Sun Hudson case and the Terri Schiavo case generated negative publicity for the TADA from progressive and conservative viewpoints. Dissemination of information in various venues regarding contentious cases was associated with more polarization of viewpoints, and realignment of coalition alliances. ^ Conclusions: The ACF provided an outline for the initial compromise over the creation of the Texas Advance Directives Act as well as the eventual loss of consensus. The debate between the Provider, Vitalist, and Autonomist coalitions has been affected by internal policy evolution, changes in the governing coalition, and important externalities. The debate over medical futility in Texas has had much broader implications in the dispute over Health Care Reform.^