31 resultados para gene interaction

em DigitalCommons@The Texas Medical Center


Relevância:

100.00% 100.00%

Publicador:

Resumo:

Numerous studies have been carried out to try to better understand the genetic predisposition for cardiovascular disease. Although it is widely believed that multifactorial diseases such as cardiovascular disease is the result from effects of many genes which working alone or interact with other genes, most genetic studies have been focused on identifying of cardiovascular disease susceptibility genes and usually ignore the effects of gene-gene interactions in the analysis. The current study applies a novel linkage disequilibrium based statistic for testing interactions between two linked loci using data from a genome-wide study of cardiovascular disease. A total of 53,394 single nucleotide polymorphisms (SNPs) are tested for pair-wise interactions, and 8,644 interactions are found to be significant with p-values less than 3.5×10-11. Results indicate that known cardiovascular disease susceptibility genes tend not to have many significantly interactions. One SNP in the CACNG1 (calcium channel, voltage-dependent, gamma subunit 1) gene and one SNP in the IL3RA (interleukin 3 receptor, alpha) gene are found to have the most significant pair-wise interactions. Findings from the current study should be replicated in other independent cohort to eliminate potential false positive results.^

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Schizophrenia (SZ) is a complex disorder with high heritability and variable phenotypes that has limited success in finding causal genes associated with the disease development. Pathway-based analysis is an effective approach in investigating the molecular mechanism of susceptible genes associated with complex diseases. The etiology of complex diseases could be a network of genetic factors and within the genes, interaction may occur. In this work we argue that some genes might be of small effect that by itself are neither sufficient nor necessary to cause the disease however, their effect may induce slight changes to the gene expression or affect the protein function, therefore, analyzing the gene-gene interaction mechanism within the disease pathway would play crucial role in dissecting the genetic architecture of complex diseases, making the pathway-based analysis a complementary approach to GWAS technique. ^ In this study, we implemented three novel linkage disequilibrium based statistics, the linear combination, the quadratic, and the decorrelation test statistics, to investigate the interaction between linked and unlinked genes in two independent case-control GWAS datasets for SZ including participants of European (EA) and African (AA) ancestries. The EA population included 1,173 cases and 1,378 controls with 729,454 genotyped SNPs, while the AA population included 219 cases and 288 controls with 845,814 genotyped SNPs. We identified 17,186 interacting gene-sets at significant level in EA dataset, and 12,691 gene-sets in AA dataset using the gene-gene interaction method. We also identified 18,846 genes in EA dataset and 19,431 genes in AA dataset that were in the disease pathways. However, few genes were reported of significant association to SZ. ^ Our research determined the pathways characteristics for schizophrenia through the gene-gene interaction and gene-pathway based approaches. Our findings suggest insightful inferences of our methods in studying the molecular mechanisms of common complex diseases.^

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Complex diseases such as cancer result from multiple genetic changes and environmental exposures. Due to the rapid development of genotyping and sequencing technologies, we are now able to more accurately assess causal effects of many genetic and environmental factors. Genome-wide association studies have been able to localize many causal genetic variants predisposing to certain diseases. However, these studies only explain a small portion of variations in the heritability of diseases. More advanced statistical models are urgently needed to identify and characterize some additional genetic and environmental factors and their interactions, which will enable us to better understand the causes of complex diseases. In the past decade, thanks to the increasing computational capabilities and novel statistical developments, Bayesian methods have been widely applied in the genetics/genomics researches and demonstrating superiority over some regular approaches in certain research areas. Gene-environment and gene-gene interaction studies are among the areas where Bayesian methods may fully exert its functionalities and advantages. This dissertation focuses on developing new Bayesian statistical methods for data analysis with complex gene-environment and gene-gene interactions, as well as extending some existing methods for gene-environment interactions to other related areas. It includes three sections: (1) Deriving the Bayesian variable selection framework for the hierarchical gene-environment and gene-gene interactions; (2) Developing the Bayesian Natural and Orthogonal Interaction (NOIA) models for gene-environment interactions; and (3) extending the applications of two Bayesian statistical methods which were developed for gene-environment interaction studies, to other related types of studies such as adaptive borrowing historical data. We propose a Bayesian hierarchical mixture model framework that allows us to investigate the genetic and environmental effects, gene by gene interactions (epistasis) and gene by environment interactions in the same model. It is well known that, in many practical situations, there exists a natural hierarchical structure between the main effects and interactions in the linear model. Here we propose a model that incorporates this hierarchical structure into the Bayesian mixture model, such that the irrelevant interaction effects can be removed more efficiently, resulting in more robust, parsimonious and powerful models. We evaluate both of the 'strong hierarchical' and 'weak hierarchical' models, which specify that both or one of the main effects between interacting factors must be present for the interactions to be included in the model. The extensive simulation results show that the proposed strong and weak hierarchical mixture models control the proportion of false positive discoveries and yield a powerful approach to identify the predisposing main effects and interactions in the studies with complex gene-environment and gene-gene interactions. We also compare these two models with the 'independent' model that does not impose this hierarchical constraint and observe their superior performances in most of the considered situations. The proposed models are implemented in the real data analysis of gene and environment interactions in the cases of lung cancer and cutaneous melanoma case-control studies. The Bayesian statistical models enjoy the properties of being allowed to incorporate useful prior information in the modeling process. Moreover, the Bayesian mixture model outperforms the multivariate logistic model in terms of the performances on the parameter estimation and variable selection in most cases. Our proposed models hold the hierarchical constraints, that further improve the Bayesian mixture model by reducing the proportion of false positive findings among the identified interactions and successfully identifying the reported associations. This is practically appealing for the study of investigating the causal factors from a moderate number of candidate genetic and environmental factors along with a relatively large number of interactions. The natural and orthogonal interaction (NOIA) models of genetic effects have previously been developed to provide an analysis framework, by which the estimates of effects for a quantitative trait are statistically orthogonal regardless of the existence of Hardy-Weinberg Equilibrium (HWE) within loci. Ma et al. (2012) recently developed a NOIA model for the gene-environment interaction studies and have shown the advantages of using the model for detecting the true main effects and interactions, compared with the usual functional model. In this project, we propose a novel Bayesian statistical model that combines the Bayesian hierarchical mixture model with the NOIA statistical model and the usual functional model. The proposed Bayesian NOIA model demonstrates more power at detecting the non-null effects with higher marginal posterior probabilities. Also, we review two Bayesian statistical models (Bayesian empirical shrinkage-type estimator and Bayesian model averaging), which were developed for the gene-environment interaction studies. Inspired by these Bayesian models, we develop two novel statistical methods that are able to handle the related problems such as borrowing data from historical studies. The proposed methods are analogous to the methods for the gene-environment interactions on behalf of the success on balancing the statistical efficiency and bias in a unified model. By extensive simulation studies, we compare the operating characteristics of the proposed models with the existing models including the hierarchical meta-analysis model. The results show that the proposed approaches adaptively borrow the historical data in a data-driven way. These novel models may have a broad range of statistical applications in both of genetic/genomic and clinical studies.

Relevância:

70.00% 70.00%

Publicador:

Resumo:

BACKGROUND: Variants in the complement cascade genes and the LOC387715/HTRA1, have been widely reported to associate with age-related macular degeneration (AMD), the most common cause of visual impairment in industrialized countries. METHODS/PRINCIPAL FINDINGS: We investigated the association between the LOC387715 A69S and complement component C3 R102G risk alleles in the Finnish case-control material and found a significant association with both variants (OR 2.98, p = 3.75 x 10(-9); non-AMD controls and OR 2.79, p = 2.78 x 10(-19), blood donor controls and OR 1.83, p = 0.008; non-AMD controls and OR 1.39, p = 0.039; blood donor controls), respectively. Previously, we have shown a strong association between complement factor H (CFH) Y402H and AMD in the Finnish population. A carrier of at least one risk allele in each of the three susceptibility loci (LOC387715, C3, CFH) had an 18-fold risk of AMD when compared to a non-carrier homozygote in all three loci. A tentative gene-gene interaction between the two major AMD-associated loci, LOC387715 and CFH, was found in this study using a multiplicative (logistic regression) model, a synergy index (departure-from-additivity model) and the mutual information method (MI), suggesting that a common causative pathway may exist for these genes. Smoking (ever vs. never) exerted an extra risk for AMD, but somewhat surprisingly, only in connection with other factors such as sex and the C3 genotype. Population attributable risks (PAR) for the CFH, LOC387715 and C3 variants were 58.2%, 51.4% and 5.8%, respectively, the summary PAR for the three variants being 65.4%. CONCLUSIONS/SIGNIFICANCE: Evidence for gene-gene interaction between two major AMD associated loci CFH and LOC387715 was obtained using three methods, logistic regression, a synergy index and the mutual information (MI) index.

Relevância:

70.00% 70.00%

Publicador:

Resumo:

C-Reactive Protein (CRP) is a biomarker indicating tissue damage, inflammation, and infection. High-sensitivity CRP (hsCRP) is an emerging biomarker often used to estimate an individual’s risk for future coronary heart disease (CHD). hsCRP levels falling below 1.00 mg/l indicate a low risk for developing CHD, levels ranging between 1.00 mg/l and 3.00 mg/l indicate an elevated risk, and levels exceeding 3.00 mg/l indicate high risk. Multiple Genome-Wide Association Studies (GWAS) have identified a number of genetic polymorphisms which influence CRP levels. SNPs implicated in such studies have been found in or near genes of interest including: CRP, APOE, APOC, IL-6, HNF1A, LEPR, and GCKR. A strong positive correlation has also been found to exist between CRP levels and BMI, a known risk factor for CHD and a state of chronic inflammation. We conducted a series of analyses designed to identify loci which interact with BMI to influence CRP levels in a subsample of European-Americans in the ARIC cohort. In a stratified GWA analysis, 15 genetic regions were identified as having significantly (p-value < 2.00*10-3) distinct effects on hsCRP levels between the two obesity strata: lean (18.50 kg/m2 < BMI < 24.99 kg/m2) and obese (BMI ≥ 30.00 kg/m2). A GWA analysis performed on all individuals combined (i.e. not a priori stratified for obesity status) with the inclusion of an additional parameter for BMI by gene interaction, identified 11 regions which interact with BMI to influence hsCRP levels. Two regions containing the genes GJA5 and GJA8 (on chromosome 1) and FBXO11 (on chromosome 2) were identified in both methods of analysis suggesting that these genes possibly interact with BMI to influence hsCRP levels. We speculate that atrial fibrillation (AF), age-related cataracts and the TGF-β pathway may be the biological processes influenced by the interaction of GJA5, GJA8 and FBXO11, respectively, with BMI to cause changes in hsCRP levels. Future studies should focus on the influence of gene x bmi interaction on AF, age-related cataracts and TGF-β.

Relevância:

70.00% 70.00%

Publicador:

Resumo:

My dissertation focuses on developing methods for gene-gene/environment interactions and imprinting effect detections for human complex diseases and quantitative traits. It includes three sections: (1) generalizing the Natural and Orthogonal interaction (NOIA) model for the coding technique originally developed for gene-gene (GxG) interaction and also to reduced models; (2) developing a novel statistical approach that allows for modeling gene-environment (GxE) interactions influencing disease risk, and (3) developing a statistical approach for modeling genetic variants displaying parent-of-origin effects (POEs), such as imprinting. In the past decade, genetic researchers have identified a large number of causal variants for human genetic diseases and traits by single-locus analysis, and interaction has now become a hot topic in the effort to search for the complex network between multiple genes or environmental exposures contributing to the outcome. Epistasis, also known as gene-gene interaction is the departure from additive genetic effects from several genes to a trait, which means that the same alleles of one gene could display different genetic effects under different genetic backgrounds. In this study, we propose to implement the NOIA model for association studies along with interaction for human complex traits and diseases. We compare the performance of the new statistical models we developed and the usual functional model by both simulation study and real data analysis. Both simulation and real data analysis revealed higher power of the NOIA GxG interaction model for detecting both main genetic effects and interaction effects. Through application on a melanoma dataset, we confirmed the previously identified significant regions for melanoma risk at 15q13.1, 16q24.3 and 9p21.3. We also identified potential interactions with these significant regions that contribute to melanoma risk. Based on the NOIA model, we developed a novel statistical approach that allows us to model effects from a genetic factor and binary environmental exposure that are jointly influencing disease risk. Both simulation and real data analyses revealed higher power of the NOIA model for detecting both main genetic effects and interaction effects for both quantitative and binary traits. We also found that estimates of the parameters from logistic regression for binary traits are no longer statistically uncorrelated under the alternative model when there is an association. Applying our novel approach to a lung cancer dataset, we confirmed four SNPs in 5p15 and 15q25 region to be significantly associated with lung cancer risk in Caucasians population: rs2736100, rs402710, rs16969968 and rs8034191. We also validated that rs16969968 and rs8034191 in 15q25 region are significantly interacting with smoking in Caucasian population. Our approach identified the potential interactions of SNP rs2256543 in 6p21 with smoking on contributing to lung cancer risk. Genetic imprinting is the most well-known cause for parent-of-origin effect (POE) whereby a gene is differentially expressed depending on the parental origin of the same alleles. Genetic imprinting affects several human disorders, including diabetes, breast cancer, alcoholism, and obesity. This phenomenon has been shown to be important for normal embryonic development in mammals. Traditional association approaches ignore this important genetic phenomenon. In this study, we propose a NOIA framework for a single locus association study that estimates both main allelic effects and POEs. We develop statistical (Stat-POE) and functional (Func-POE) models, and demonstrate conditions for orthogonality of the Stat-POE model. We conducted simulations for both quantitative and qualitative traits to evaluate the performance of the statistical and functional models with different levels of POEs. Our results showed that the newly proposed Stat-POE model, which ensures orthogonality of variance components if Hardy-Weinberg Equilibrium (HWE) or equal minor and major allele frequencies is satisfied, had greater power for detecting the main allelic additive effect than a Func-POE model, which codes according to allelic substitutions, for both quantitative and qualitative traits. The power for detecting the POE was the same for the Stat-POE and Func-POE models under HWE for quantitative traits.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Idiopathic or isolated clubfoot is a common orthopedic birth defect that affects approximately 135,000 children worldwide. It is characterized by equinus, varus and adductus deformities of the ankle and foot. Correction of clubfoot involves months of serial manipulations, castings and bracing, with surgical correction needed in forty percent of cases. Multifactorial etiology has been suggested in numerous studies with both environmental and genetic factors playing an etiologic role. Maternal smoking during pregnancy is the only common environmental factor that has consistently been shown to increase the risk for clubfoot. Moreover, a positive family history of clubfoot and maternal smoking increases the risk of clubfoot twenty fold. These findings suggest that genetic variation in smoking metabolism genes may increase susceptibility to clubfoot. Based on this reasoning, we interrogated eight candidate genes, chosen based on their involvement in phase 1 and 2 cigarette smoke metabolism. Twenty-two SNPs and two null alleles in eight genes (CYP1A1, CYP1A2, CYP1B1, CYP2A6, EPHX1, NAT2, GSTM1 and GSTT1) were genotyped in a dataset composed of nonHispanic white and Hispanic multiplex and simplex families. Only one SNP in CYP1A1, rs1048943, had significantly altered transmission in the aggregate and multiplex NHW datasets (p=0.003 and p=0.009). Perturbation of CYP1A1 by rs1048943 polymorphism causes an increase in the amount of harmful, adduct forming metabolic intermediates. A significant gene interaction between EPHX1 and NAT2 was also found (p=0.007). This interaction may affect the metabolism of harmful metabolic intermediates. Additionally, marginal interactions were found for other xenobiotic genes and these interactions may play a contributory role in clubfoot. Importantly, for CYP1A2, significant maternal (p=0.03; RR=1.24; 95% CI: 1.04-1.44) and fetal (p=0.01; RR=1.33; 95% CI: 1.13-1.54) genotypic effects were identified suggesting that both maternal and fetal genotypes impact normal limb development. No association was found for maternal smoking status and tobacco metabolism genes. Together, these results suggest that xenobiotic metabolism genes may play a contributory role in the etiology of clubfoot regardless of maternal smoking status and may impact foot development through perturbation of tobacco metabolic pathways.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Lynch syndrome, is caused by inherited germ-line mutations in the DNA mismatch repair genes resulting in cancers at an early age, predominantly colorectal (CRC) and endometrial cancers. Though the median age at onset for CRC is about 45 years, disease penetrance varies suggesting that cancer susceptibility may be modified by environmental or other low-penetrance genes. Genetic variation due to polymorphisms in genes encoding metabolic enzymes can influence carcinogenesis by alterations in the expression and activity level of the enzymes. Variation in MTHFR, an important folate metabolizing enzyme can affect DNA methylation and DNA synthesis and variation in xenobiotic-metabolizing enzymes can affect the metabolism and clearance of carcinogens, thus modifying cancer risk. ^ This study examined a retrospective cohort of 257 individuals with Lynch syndrome, for polymorphisms in genes encoding xenobiotic-metabolizing enzymes-- CYP1A1 (I462V and MspI), EPHX1 (H139R and Y113H), GSTP1 (I105V and A114V), GSTM1 and GSTT1 (deletions) and folate metabolizing enzyme--MTHFR (C677T and A1298C). In addition, a series of 786 cases of sporadic CRC were genotyped for CYP1A1 I462V and EPHX1 Y113H to assess gene-gene interaction and gene-environment interaction with smoking in a case-only analysis. ^ Prominent findings of this study were that the presence of an MTHFR C677T variant allele was associated with a 4 year later age at onset for CRC on average and a reduced age-associated risk for developing CRC (Hazard ratio: 0.55; 95% confidence interval: 0.36–0.85) compared to the absence of any variant allele in individuals with Lynch syndrome. Similarly, Lynch syndrome individuals heterozygous for CYP1A1 I462V A>G polymorphism developed CRC an average of 4 years earlier and were at a 78% increased age-associated risk (Hazard ratio for AG relative to AA: 1.78; 95% confidence interval: 1.16-2.74) than those with the homozygous wild-type genotype. Therefore these two polymorphisms may be additional susceptibility factors for CRC in Lynch syndrome. In the case-only analysis, evidence of gene-gene interaction was seen between CYP1A1 I462V and EPHX1 Y113H and between EPHX1 Y113H and smoking suggesting that genetic and environmental factors may interact to increase sporadic CRC risk. Implications of these findings are the ability to identify subsets of high-risk individuals for targeted prevention and intervention. ^

Relevância:

40.00% 40.00%

Publicador:

Resumo:

Despite current enthusiasm for investigation of gene-gene interactions and gene-environment interactions, the essential issue of how to define and detect gene-environment interactions remains unresolved. In this report, we define gene-environment interactions as a stochastic dependence in the context of the effects of the genetic and environmental risk factors on the cause of phenotypic variation among individuals. We use mutual information that is widely used in communication and complex system analysis to measure gene-environment interactions. We investigate how gene-environment interactions generate the large difference in the information measure of gene-environment interactions between the general population and a diseased population, which motives us to develop mutual information-based statistics for testing gene-environment interactions. We validated the null distribution and calculated the type 1 error rates for the mutual information-based statistics to test gene-environment interactions using extensive simulation studies. We found that the new test statistics were more powerful than the traditional logistic regression under several disease models. Finally, in order to further evaluate the performance of our new method, we applied the mutual information-based statistics to three real examples. Our results showed that P-values for the mutual information-based statistics were much smaller than that obtained by other approaches including logistic regression models.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

The BCR gene is involved in the pathogenesis of Philadelphia chromosome-positive (Ph$\sp1$) leukemias. Typically, the 5$\sp\prime$ portion of BCR on chromosome 22 becomes fused to a 5$\sp\prime$ truncated ABL gene from chromosome 9 resulting in a chimeric BCR-ABL gene. To investigate the role of the BCR gene product, a number of BCR peptide sequences were used to generate anti-BCR antibodies for detection of BCR and BCR-ABL proteins. Since both BCR and ABL proteins have kinase activity, the anti-BCR antibodies were tested for their ability to immunoprecipitate BCR and BCR-ABL proteins from cellular lysates by use of an immunokinase assay. Antisera directed towards the C-terminal portions of P160 BCR, sequences not present in BCR-ABL proteins, were capable of co-immunoprecipitating P210 BCR-ABL from the Ph$\sp1$- positive cell line K562. Re-immunoprecipitation studies following complete denaturation showed that C-terminal BCR antisera specifically recognized P160 BCR but not P210 BCR-ABL. These and other results indicated the presence of a P160 BCR/P210 BCR-ABL protein complex in K562 cells. Experiments performed with Ph$\sp1$-positive ALL cells and uncultured Ph$\sp1$-positive patient white blood cells established the general presence of BCR/BCR-ABL protein complexes in BCR-ABL expressing cells. However, two cell lines derived from Ph$\sp1$-positive patients lacked P160 BCR/P210 BCR-ABL complexes. Lysates from one of these cell lines mixed with lysates from a cell line that expresses only P160 BCR failed to generate BCR/BCR-ABL protein complexes in vitro indicating that P160 BCR and P210 BCR-ABL do not simply oligomerize.^ Two-dimensional tryptic maps were performed on both BCR and BCR-ABL proteins labeled in vitro with $\sp{32}$P. These maps indicate that the autophosphorylation sites in BCR-ABL proteins are primarily located within BCR exon 1 sequences in both P210 and P185 BCR-ABL, and that P160 BCR is phosphorylated in trans in similar sites by the activated ABL kinase of both BCR-ABL proteins. These results provide strong evidence that P160 BCR serves as a target for the BCR-ABL oncoprotein.^ K562 cells, induced to terminally differentiate with the tumor promoter TPA, show a loss of P210 BCR-ABL kinase activity 12-18 hours after addition of TPA. This loss coincides with the loss of activity in P160 BCR/P210 BCR-ABL complexes but not with the loss of the P210 BCR-ABL, suggesting the existence of an inactive form of P210 BCR-ABL. However, a degraded BCR-ABL protein served as the kinase active form preferentially sequestered within the remaining BCR/BCR-ABL protein complex.^ The results described in this thesis form the basis for a model for BCR-ABL induced leukemias which is presented and discussed. ^

Relevância:

40.00% 40.00%

Publicador:

Resumo:

Hypertension (HT) is mediated by the interaction of many genetic and environmental factors. Previous genome-wide linkage analysis studies have found many loci that show linkage to HT or blood pressure (BP) regulation, but the results were generally inconsistent. Gene by environment interaction is among the reasons that potentially explain these inconsistencies between studies. Here we investigate influences of gene by smoking (GxS) interaction on HT and BP in European American (EA), African American (AA) and Mexican American (MA) families from the GENOA study. A variance component-based method was utilized to perform genome-wide linkage analysis of systolic blood pressure (SBP), diastolic blood pressure (DBP), and HT status, as well as bivariate analysis for SBP and DBP for smokers, non-smokers, and combined groups. The most significant results were found for SBP in MA. The strongest signal was for chromosome 17q24 (LOD = 4.2), increased to (LOD = 4.7) in bivariate analysis but there was no evidence of GxS interaction at this locus (p = 0.48). Two signals were identified only in one group: on chromosome 15q26.2 (LOD = 3.37) in non-smokers and chromosome 7q21.11 (LOD = 1.4) in smokers, both of which had strong evidence for GxS interaction (p = 0.00039 and 0.009 respectively). There were also two other signals, one on chromosome 20q12 (LOD = 2.45) in smokers, which became much higher in the combined sample (LOD = 3.53), and one on chromosome 6p22.2 (LOD = 2.06) in non-smokers. Neither peak had very strong evidence for GxS interaction (p = 0.08 and 0.06 respectively). A fine mapping association study was performed using 200 SNPs in 30 genes located under the linkage signals on chromosomes 15 and 17. Under the chromosome 15 peak, the association analysis identified 6 SNPs accounting for a 7 mmHg increase in SBP in MA non-smokers. For the chromosome 17 linkage peak, the association analysis identified 3 SNPs accounting for a 6 mmHg increase in SBP in MA. However, none of these SNPs was significant after correcting for multiple testing, and accounting for them in the linkage analysis produced very small reductions in the linkage signal. ^ The linkage analysis of BP traits considering the smoking status produced very interesting signals for SBP in the MA population. The fine mapping association analysis gave some insight into the contribution of some SNPs to two of the identified signals, but since these SNPs did not remain significant after multiple testing correction and did not explain the linkage peaks, more work is needed to confirm these exploratory results and identify the culprit variations under these linkage peaks. ^

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Brain tumor is one of the most aggressive types of cancer in humans, with an estimated median survival time of 12 months and only 4% of the patients surviving more than 5 years after disease diagnosis. Until recently, brain tumor prognosis has been based only on clinical information such as tumor grade and patient age, but there are reports indicating that molecular profiling of gliomas can reveal subgroups of patients with distinct survival rates. We hypothesize that coupling molecular profiling of brain tumors with clinical information might improve predictions of patient survival time and, consequently, better guide future treatment decisions. In order to evaluate this hypothesis, the general goal of this research is to build models for survival prediction of glioma patients using DNA molecular profiles (U133 Affymetrix gene expression microarrays) along with clinical information. First, a predictive Random Forest model is built for binary outcomes (i.e. short vs. long-term survival) and a small subset of genes whose expression values can be used to predict survival time is selected. Following, a new statistical methodology is developed for predicting time-to-death outcomes using Bayesian ensemble trees. Due to a large heterogeneity observed within prognostic classes obtained by the Random Forest model, prediction can be improved by relating time-to-death with gene expression profile directly. We propose a Bayesian ensemble model for survival prediction which is appropriate for high-dimensional data such as gene expression data. Our approach is based on the ensemble "sum-of-trees" model which is flexible to incorporate additive and interaction effects between genes. We specify a fully Bayesian hierarchical approach and illustrate our methodology for the CPH, Weibull, and AFT survival models. We overcome the lack of conjugacy using a latent variable formulation to model the covariate effects which decreases computation time for model fitting. Also, our proposed models provides a model-free way to select important predictive prognostic markers based on controlling false discovery rates. We compare the performance of our methods with baseline reference survival methods and apply our methodology to an unpublished data set of brain tumor survival times and gene expression data, selecting genes potentially related to the development of the disease under study. A closing discussion compares results obtained by Random Forest and Bayesian ensemble methods under the biological/clinical perspectives and highlights the statistical advantages and disadvantages of the new methodology in the context of DNA microarray data analysis.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The mammalian adaptor protein Alix [ALG-2 (apoptosis-linked-gene-2 product)-interacting protein X] belongs to a conserved family of proteins that have in common an N-terminal Bro1 domain and a C-terminal PRD (proline-rich domain), both of which mediate partner protein interactions. Following our previous finding that Xp95, the Xenopus orthologue of Alix, undergoes a phosphorylation-dependent gel mobility shift during progesteroneinduced oocyte meiotic maturation, we explored potential regulation of Xp95/Alix by protein phosphorylation in hormone-induced cell cycle re-entry or M-phase induction. By MALDI-TOF (matrix-assisted laser-desorption ionization-time-of-flight) MS analyses and gel mobility-shift assays, Xp95 is phosphorylated at multiple sites within the N-terminal half of the PRD during Xenopus oocyte maturation, and a similar region in Alix is phosphorylated in mitotically arrested but not serum-stimulated mammalian cells. By tandem MS, Thr745 within this region, which localizes in a conserved binding site to the adaptor protein SETA [SH3 (Src homology 3) domain-containing, expressed in tumorigenic astrocytes] CIN85 (a-cyano-4-hydroxycinnamate)/SH3KBP1 (SH3-domain kinase-binding protein 1), is one of the phosphorylation sites in Xp95. Results from GST (glutathione S-transferase)-pull down and peptide binding/competition assays further demonstrate that the Thr745 phosphorylation inhibits Xp95 interaction with the second SH3 domain of SETA. However, immunoprecipitates of Xp95 from extracts of M-phase-arrested mature oocytes contained additional partner proteins as compared with immunoprecipitates from extracts of G2-arrested immature oocytes. The deubiquitinase AMSH (associated molecule with the SH3 domain of signal transducing adaptor molecule) specifically interacts with phosphorylated Xp95 in M-phase cell lysates. These findings establish that Xp95/Alix is phosphorylated within the PRD during M-phase induction, and indicate that the phosphorylation may both positively and negatively modulate their interaction with partner proteins.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

High-throughput assays, such as yeast two-hybrid system, have generated a huge amount of protein-protein interaction (PPI) data in the past decade. This tremendously increases the need for developing reliable methods to systematically and automatically suggest protein functions and relationships between them. With the available PPI data, it is now possible to study the functions and relationships in the context of a large-scale network. To data, several network-based schemes have been provided to effectively annotate protein functions on a large scale. However, due to those inherent noises in high-throughput data generation, new methods and algorithms should be developed to increase the reliability of functional annotations. Previous work in a yeast PPI network (Samanta and Liang, 2003) has shown that the local connection topology, particularly for two proteins sharing an unusually large number of neighbors, can predict functional associations between proteins, and hence suggest their functions. One advantage of the work is that their algorithm is not sensitive to noises (false positives) in high-throughput PPI data. In this study, we improved their prediction scheme by developing a new algorithm and new methods which we applied on a human PPI network to make a genome-wide functional inference. We used the new algorithm to measure and reduce the influence of hub proteins on detecting functionally associated proteins. We used the annotations of the Gene Ontology (GO) and the Kyoto Encyclopedia of Genes and Genomes (KEGG) as independent and unbiased benchmarks to evaluate our algorithms and methods within the human PPI network. We showed that, compared with the previous work from Samanta and Liang, our algorithm and methods developed in this study improved the overall quality of functional inferences for human proteins. By applying the algorithms to the human PPI network, we obtained 4,233 significant functional associations among 1,754 proteins. Further comparisons of their KEGG and GO annotations allowed us to assign 466 KEGG pathway annotations to 274 proteins and 123 GO annotations to 114 proteins with estimated false discovery rates of <21% for KEGG and <30% for GO. We clustered 1,729 proteins by their functional associations and made pathway analysis to identify several subclusters that are highly enriched in certain signaling pathways. Particularly, we performed a detailed analysis on a subcluster enriched in the transforming growth factor β signaling pathway (P<10-50) which is important in cell proliferation and tumorigenesis. Analysis of another four subclusters also suggested potential new players in six signaling pathways worthy of further experimental investigations. Our study gives clear insight into the common neighbor-based prediction scheme and provides a reliable method for large-scale functional annotations in this post-genomic era.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Inflammation is a key process in cardiovascular diseases. The extracellular matrix (ECM) of the vasculature is a major target of inflammatory cytokines, and TNFalpha regulates ECM metabolism by affecting collagen production. In this study, we have examined the pathways mediating TNFalpha-induced suppression of prolyl-4 hydroxylase alpha1 (P4Halpha1), the rate-limiting isoform of P4H responsible for procollagen hydroxylation, maturation, and organization. Using human aortic smooth muscle cells, we found that TNFalpha activated the MKK4-JNK1 pathway, which induced histone (H) 4 lysine 12 acetylation within the TNFalpha response element in the P4Halpha1 promoter. The acetylated-H4 then recruited a transcription factor, NonO, which, in turn, recruited HDACs and induced H3 lysine 9 deacetylation, thereby inhibiting transcription of the P4Halpha1 promoter. Furthermore, we found that TNFalpha oxidized DJ-1, which may be essential for the NonO-P4Halpha1 interaction because treatment with gene specific siRNA to knockout DJ-1 eliminated the TNFalpha-induced NonO-P4Halpha1 interaction and its suppression. Our findings may be relevant to aortic aneurysm and dissection and the stability of the fibrous cap of atherosclerotic plaque in which collagen metabolism is important in arterial remodeling. Defining this cytokine-mediated regulatory pathway may provide novel molecular targets for therapeutic intervention in preventing plaque rupture and acute coronary occlusion.