993 resultados para repressor gene
Resumo:
Exponential growth of genomic data in the last two decades has made manual analyses impractical for all but trial studies. As genomic analyses have become more sophisticated, and move toward comparisons across large datasets, computational approaches have become essential. One of the most important biological questions is to understand the mechanisms underlying gene regulation. Genetic regulation is commonly investigated and modelled through the use of transcriptional regulatory network (TRN) structures. These model the regulatory interactions between two key components: transcription factors (TFs) and the target genes (TGs) they regulate. Transcriptional regulatory networks have proven to be invaluable scientific tools in Bioinformatics. When used in conjunction with comparative genomics, they have provided substantial insights into the evolution of regulatory interactions. Current approaches to regulatory network inference, however, omit two additional key entities: promoters and transcription factor binding sites (TFBSs). In this study, we attempted to explore the relationships among these regulatory components in bacteria. Our primary goal was to identify relationships that can assist in reducing the high false positive rates associated with transcription factor binding site predictions and thereupon enhance the reliability of the inferred transcription regulatory networks. In our preliminary exploration of relationships between the key regulatory components in Escherichia coli transcription, we discovered a number of potentially useful features. The combination of location score and sequence dissimilarity scores increased de novo binding site prediction accuracy by 13.6%. Another important observation made was with regards to the relationship between transcription factors grouped by their regulatory role and corresponding promoter strength. Our study of E.coli ��70 promoters, found support at the 0.1 significance level for our hypothesis | that weak promoters are preferentially associated with activator binding sites to enhance gene expression, whilst strong promoters have more repressor binding sites to repress or inhibit gene transcription. Although the observations were specific to �70, they nevertheless strongly encourage additional investigations when more experimentally confirmed data are available. In our preliminary exploration of relationships between the key regulatory components in E.coli transcription, we discovered a number of potentially useful features { some of which proved successful in reducing the number of false positives when applied to re-evaluate binding site predictions. Of chief interest was the relationship observed between promoter strength and TFs with respect to their regulatory role. Based on the common assumption, where promoter homology positively correlates with transcription rate, we hypothesised that weak promoters would have more transcription factors that enhance gene expression, whilst strong promoters would have more repressor binding sites. The t-tests assessed for E.coli �70 promoters returned a p-value of 0.072, which at 0.1 significance level suggested support for our (alternative) hypothesis; albeit this trend may only be present for promoters where corresponding TFBSs are either all repressors or all activators. Nevertheless, such suggestive results strongly encourage additional investigations when more experimentally confirmed data will become available. Much of the remainder of the thesis concerns a machine learning study of binding site prediction, using the SVM and kernel methods, principally the spectrum kernel. Spectrum kernels have been successfully applied in previous studies of protein classification [91, 92], as well as the related problem of promoter predictions [59], and we have here successfully applied the technique to refining TFBS predictions. The advantages provided by the SVM classifier were best seen in `moderately'-conserved transcription factor binding sites as represented by our E.coli CRP case study. Inclusion of additional position feature attributes further increased accuracy by 9.1% but more notable was the considerable decrease in false positive rate from 0.8 to 0.5 while retaining 0.9 sensitivity. Improved prediction of transcription factor binding sites is in turn extremely valuable in improving inference of regulatory relationships, a problem notoriously prone to false positive predictions. Here, the number of false regulatory interactions inferred using the conventional two-component model was substantially reduced when we integrated de novo transcription factor binding site predictions as an additional criterion for acceptance in a case study of inference in the Fur regulon. This initial work was extended to a comparative study of the iron regulatory system across 20 Yersinia strains. This work revealed interesting, strain-specific difierences, especially between pathogenic and non-pathogenic strains. Such difierences were made clear through interactive visualisations using the TRNDifi software developed as part of this work, and would have remained undetected using conventional methods. This approach led to the nomination of the Yfe iron-uptake system as a candidate for further wet-lab experimentation due to its potential active functionality in non-pathogens and its known participation in full virulence of the bubonic plague strain. Building on this work, we introduced novel structures we have labelled as `regulatory trees', inspired by the phylogenetic tree concept. Instead of using gene or protein sequence similarity, the regulatory trees were constructed based on the number of similar regulatory interactions. While the common phylogentic trees convey information regarding changes in gene repertoire, which we might regard being analogous to `hardware', the regulatory tree informs us of the changes in regulatory circuitry, in some respects analogous to `software'. In this context, we explored the `pan-regulatory network' for the Fur system, the entire set of regulatory interactions found for the Fur transcription factor across a group of genomes. In the pan-regulatory network, emphasis is placed on how the regulatory network for each target genome is inferred from multiple sources instead of a single source, as is the common approach. The benefit of using multiple reference networks, is a more comprehensive survey of the relationships, and increased confidence in the regulatory interactions predicted. In the present study, we distinguish between relationships found across the full set of genomes as the `core-regulatory-set', and interactions found only in a subset of genomes explored as the `sub-regulatory-set'. We found nine Fur target gene clusters present across the four genomes studied, this core set potentially identifying basic regulatory processes essential for survival. Species level difierences are seen at the sub-regulatory-set level; for example the known virulence factors, YbtA and PchR were found in Y.pestis and P.aerguinosa respectively, but were not present in both E.coli and B.subtilis. Such factors and the iron-uptake systems they regulate, are ideal candidates for wet-lab investigation to determine whether or not they are pathogenic specific. In this study, we employed a broad range of approaches to address our goals and assessed these methods using the Fur regulon as our initial case study. We identified a set of promising feature attributes; demonstrated their success in increasing transcription factor binding site prediction specificity while retaining sensitivity, and showed the importance of binding site predictions in enhancing the reliability of regulatory interaction inferences. Most importantly, these outcomes led to the introduction of a range of visualisations and techniques, which are applicable across the entire bacterial spectrum and can be utilised in studies beyond the understanding of transcriptional regulatory networks.
Resumo:
Paraffin sections (n = 168, 27 benign, 16 low malignant potential [LMP] and 125 malignant tumours) from epithelial ovarian tumours were evaluated immunohistochemically for expression of retinoblastoma gene product (pRB) and p53 protein, and the relationship among pRB, p53 and cyclin-dependent kinase inhibitor 2 (CDKN2) gene product p16INK4A (p16) was analysed, following our previous study of p16. Forty-one percent of the benign, 50% of the LMP and most (71%) of the malignant tumours showed high pRB expression. High expression of pRB (>50% pRB-positive cells) significantly correlated with non-mucinous histological subtypes. Reduced pRB expression, substage and residual disease were significant predictors for poor prognosis in stage I patients. All the benign and most of the LMP (81%) tumours were in either the p53-negative or low p53-positive category, but nearly half of the malignant tumours had high p53 expression. High p53 accumulation was found in non-mucinous, high grade and late stage tumours. For well-differentiated carcinomas, high p53 expression was a predictor of poor prognosis. However, even though high p53 expression was not associated with histological subtype, stage or the presence of residual disease, high p53 expression was not an independent predictor when all clinical parameters were combined. For all ovarian cancers, a close correlation was found between high p53 and high p16 expression. The relationship between the expression of pRB and p16 depended on tumour stage. In stage I tumours, high pRB was associated with low p16 reactivity. On the other hand, most advanced tumours showed both high pRB and high p16 reactivity. Int. J. Cancer 74:407–415, 1997. © 1997 Wiley-Liss, Inc.
Resumo:
Paraffin sections from 190 epithelial ovarian tumours, including 159 malignant and 31 benign epithelial tumours, were analysed immunohistochemically for expression of cyclin-dependent kinase inhibitor 2 (CDKN2A) gene product p16INK4A (p16). Most benign tumours showed no p16 expression in the tumour cells, whereas only 11% of malignant cancers were p16 negative. A high proportion of p16-positive tumour cells was associated with advanced stage and grade, and with poor prognosis in cancer patients. For FIGO stage 1 tumours, a high proportion of p16-positive tumour cells was associated with poorer survival, suggesting that accumulation of p16 is an early event of ovarian tumorigenesis. In contrast to tumour cells, high expression of p16 in the surrounding stromal cells was not associated with the stage and grade, but was associated with longer survival. When all parameters were combined in multivariate analysis, high p16 expression in stromal cells was not an independent predictor for survival, indicating that low p16 expression in stromal cells is associated with other markers of tumour progression. High expression of p16 survival in the stromal cells of tumours from long-term survivors suggests that tumour growth is limited to some extent by factors associated with p16 expression in the matrix.
Resumo:
BACKGROUND: Stromal signalling increases the lateral cell adhesions of prostate epithelial cells grown in 3D culture. The aim of this study was to use microarray analysis to identify significant epithelial signalling pathways and genes in this process. METHODS: Microarray analysis was used to identify genes that were differentially expressed when epithelial cells were grown in 3D Matrigel culture with stromal co-culture compared to without stroma. Two culture models were employed: primary epithelial cells (ten samples) and an epithelial cell line (three experiments). A separate microarray analysis was performed on each model system and then compared to identify tissue-relevant genes in a cell line model. RESULTS: TGF beta signalling was significantly ranked for both model systems and in both models the TGF beta signalling gene SOX4 was significantly down regulated. Analysis of all differentially expressed genes to identify genes that were common to both models found several morphology related gene clusters; actin binding (DIAPH2, FHOD3, ABLIM1, TMOD4, MYH10), GTPase activator activity (BCR, MYH10), cytoskeleton (MAP2, MYH10, TMOD4, FHOD3), protein binding (ITGA6, CD44), proteinaceous extracellular matrix (NID2, CILP2), ion channel/ ion transporter activity (CACNA1C, CACNB2, KCNH2, SLC8A1, SLC39A9) and genes associated with developmental pathways (POFUT1, FZD2, HOXA5, IRX2, FGF11, SOX4, SMARCC1). CONCLUSIONS: In 3D prostate cultures, stromal cells increase lateral epithelial cell adhesions. We show that this morphological effect is associated with gene expression changes to TGF beta signalling, cytoskeleton and anion activity.
Resumo:
BACKGROUND: Broccoli consumption has been associated with a reduced risk of prostate cancer. Isothiocyanates (ITCs) derived from glucosinolates that accumulate in broccoli are dietary compounds that may mediate these health effects. Sulforaphane (SF, 4-methylsulphinylbutyl ITC) derives from heading broccoli (calabrese) and iberin (IB, 3-methylsulphinypropyl ITC) from sprouting broccoli. While there are many studies regarding the biological activity of SF, mainly undertaken with cancerous cells, there are few studies associated with IB. METHODS: Primary epithelial and stromal cells were derived from benign prostatic hyperplasia tissue. Affymetrix U133 Plus 2.0 whole genome arrays were used to compare global gene expression between these cells, and to quantify changes in gene expression following exposure to physiologically appropriate concentrations of SF and IB. Ontology and pathway analyses were used to interpret results. Changes in expression of a subset of genes were confirmed by real-time RT-PCR. RESULTS: Global gene expression profiling identified epithelial and stromal-specific gene expression profiles. SF induced more changes in epithelial cells, whereas IB was more effective in stromal cells. Although IB and SF induced different changes in gene expression in both epithelial and stromal cells, these were associated with similar pathways, such as cell cycle and detoxification. Both ITCs increased expression of PLAGL1, a tumor suppressor gene, in stromal cells and suppressed expression of the putative tumor promoting genes IFITM1, CSPG2, and VIM in epithelial cells. CONCLUSION: These data suggest that IB and SF both alter genes associated with cancer prevention, and IB should be investigated further as a potential chemopreventative agent.
Resumo:
Destruction of cancer cells by genetically modified viral and nonviral vectors has been the aim of many research programs. The ability to target cytotoxic gene therapies to the cells of interest is an essential prerequisite, and the treatment has always had the potential to provide better and more long-lasting therapy than existing chemotherapies. However, the potency of these infectious agents requires effective testing systems, in which hypotheses can be explored both in vitro and in vivo before the establishment of clinical trials in humans. The real prospect of off-target effects should be eliminated in the preclinical stage, if current prejudices against such therapies are to be overcome. In this review we have set out, using adenoviral vectors as a commonly used example, to discuss some of the key parameters required to develop more effective testing, and to critically assess the current cellular models for the development and testing of prostate cancer biotherapy. Only by developing models that more closely mirror human tissues will we be able to translate literature publications into clinical trials and hence into acceptable alternative treatments for the most commonly diagnosed cancer in humans.
Resumo:
The DNA damage response encompasses a complex series of signaling pathways that function to regulate and facilitate the repair of damaged DNA. Recent studies have shown that the repair of transcriptionally inactive chromatin, named heterochromatin, is dependent upon the phosphorylation of the co-repressor, Krüppel-associated box (KRAB) domain-associated protein (KAP-1), by the ataxia telangiectasia-mutated (ATM) kinase. Co-repressors, such as KAP-1, function to regulate the rigid structure of heterochromatin by recruiting histone-modifying enzymes, such HDAC1/2, SETDB1, and nucleosome-remodeling complexes such as CHD3. Here, we have characterized a phosphorylation site in the HP1-binding domain of KAP-1, Ser-473, which is phosphorylated by the cell cycle checkpoint kinase Chk2. Expression of a nonphosphorylatable S473A mutant conferred cellular sensitivity to DNA-damaging agents and led to defective repair of DNA double-strand breaks in heterochromatin. In addition, cells expressing S473A also displayed defective mobilization of the HP1-β chromodomain protein. The DNA repair defect observed in cells expressing S473A was alleviated by depletion of HP1-β, suggesting that phosphorylation of KAP-1 on Ser-473 promotes the mobilization of HP1-β from heterochromatin and subsequent DNA repair. These results suggest a novel mechanism of KAP-1-mediated chromatin restructuring via Chk2-regulated HP1-β exchange from heterochromatin, promoting DNA repair.
Resumo:
Sequencing of mba gene fragments of reference strains of Ureaplasma urealyticum serovars 1, 3, 6, 14, in addition to 33 clinical U. urealyticum isolates is reported. A phylogenetic tree deduced from an alignment of these sequences clearly demonstrates two major clusters (confidence limit 100%), which equate to the parvo and T960 biovars, and five types which we have designated mba 1, 3, 6, 8 and X. These relationships are supported by bootstrap analysis. Polymorphisms within the mba fragment of types mba 1, 3, and 6 were used to define nine subtypes (mba 1a, 1b, 3a, 3b, 3c, 3d, 3e, 6a, and 6b) thus facilitating high resolution typing of U. urealyticum. Inclusion of the reference strains for serovars 1, 3, 6, and 8 in the mba typing scheme showed that the results of this analysis are broadly consistent with currently accepted serotyping. In addition a ure gene fragment from nine of the clinical isolates was amplified and sequenced. Comparisons of the sequences clearly distinguished the two biovars of U. urealyticum; however this fragment was invariant within the parvo biovar. This study has shown that the sequence of the mba can reveal the fine details of the relationships between U. urealyticum isolates and also supports the significant evolutionary gap between the two biovars.
Resumo:
Background Accumulated biological research outcomes show that biological functions do not depend on individual genes, but on complex gene networks. Microarray data are widely used to cluster genes according to their expression levels across experimental conditions. However, functionally related genes generally do not show coherent expression across all conditions since any given cellular process is active only under a subset of conditions. Biclustering finds gene clusters that have similar expression levels across a subset of conditions. This paper proposes a seed-based algorithm that identifies coherent genes in an exhaustive, but efficient manner. Methods In order to find the biclusters in a gene expression dataset, we exhaustively select combinations of genes and conditions as seeds to create candidate bicluster tables. The tables have two columns: (a) a gene set, and (b) the conditions on which the gene set have dissimilar expression levels to the seed. First, the genes with less than the maximum number of dissimilar conditions are identified and a table of these genes is created. Second, the rows that have the same dissimilar conditions are grouped together. Third, the table is sorted in ascending order based on the number of dissimilar conditions. Finally, beginning with the first row of the table, a test is run repeatedly to determine whether the cardinality of the gene set in the row is greater than the minimum threshold number of genes in a bicluster. If so, a bicluster is outputted and the corresponding row is removed from the table. Repeating this process, all biclusters in the table are systematically identified until the table becomes empty. Conclusions This paper presents a novel biclustering algorithm for the identification of additive biclusters. Since it involves exhaustively testing combinations of genes and conditions, the additive biclusters can be found more readily.
Resumo:
A novel multiple regression method (RM) is developed to predict identity-by-descent probabilities at a locus L (IBDL), among individuals without pedigree, given information on surrounding markers and population history. These IBDL probabilities are a function of the increase in linkage disequilibrium (LD) generated by drift in a homogeneous population over generations. Three parameters are sufficient to describe population history: effective population size (Ne), number of generations since foundation (T), and marker allele frequencies among founders (p). IBD L are used in a simulation study to map a quantitative trait locus (QTL) via variance component estimation. RM is compared to a coalescent method (CM) in terms of power and robustness of QTL detection. Differences between RM and CM are small but significant. For example, RM is more powerful than CM in dioecious populations, but not in monoecious populations. Moreover, RM is more robust than CM when marker phases are unknown or when there is complete LD among founders or Ne is wrong, and less robust when p is wrong. CM utilises all marker haplotype information, whereas RM utilises information contained in each individual marker and all possible marker pairs but not in higher order interactions. RM consists of a family of models encompassing four different population structures, and two ways of using marker information, which contrasts with the single model that must cater for all possible evolutionary scenarios in CM.
Resumo:
Population-wide associations between loci due to linkage disequilibrium can be used to map quantitative trait loci (QTL) with high resolution. However, spurious associations between markers and QTL can also arise as a consequence of population stratification. Statistical methods that cannot differentiate between loci associations due to linkage disequilibria from those caused in other ways can render false-positive results. The transmission-disequilibrium test (TDT) is a robust test for detecting QTL. The TDT exploits within-family associations that are not affected by population stratification. However, some TDTs are formulated in a rigid-form, with reduced potential applications. In this study we generalize TDT using mixed linear models to allow greater statistical flexibility. Allelic effects are estimated with two independent parameters: one exploiting the robust within-family information and the other the potentially biased between-family information. A significant difference between these two parameters can be used as evidence for spurious association. This methodology was then used to test the effects of the fourth melanocortin receptor (MC4R) on production traits in the pig. The new analyses supported the previously reported results; i.e., the studied polymorphism is either causal of in very strong linkage disequilibrium with the causal mutation, and provided no evidence for spurious association.
Resumo:
The sheep (Ovis aries) is commonly used as a large animal model in skeletal research. Although the sheep genome has been sequenced there are still only a limited number of annotated mRNA sequences in public databases. A complementary DNA (cDNA) library was constructed to provide a generic resource for further exploration of genes that are actively expressed in bone cells in sheep. It was anticipated that the cDNA library would provide molecular tools for further research into the process of fracture repair and bone homeostasis, and add to the existing body of knowledge. One of the hallmarks of cDNA libraries has been the identification of novel genes and in this library the full open reading frame of the gene C12orf29 was cloned and characterised. This gene codes for a protein of unknown function with a molecular weight of 37 kDa. A literature search showed that no previous studies had been conducted into the biological role of C12orf29, except for some bioinformatics studies that suggested a possible link with cancer. Phylogenetic analyses revealed that C12orf29 had an ancient pedigree with a homologous gene found in some bacterial taxa. This implied that the gene was present in the last common eukaryotic ancestor, thought to have existed more than 2 billion years ago. This notion was further supported by the fact that the gene is found in taxa belonging to the two major eukaryotic branches, bikonts and unikonts. In the bikont supergroup a C12orf29-like gene was found in the single celled protist Naegleria gruberi, whereas in the unikont supergroup, encompassing the metazoa, the gene is universal to all chordate and, therefore, vertebrate species. It appears to have been lost to the majority of cnidaria and protostomes taxa; however, C12orf29-like genes have been found in the cnidarian freshwater hydra and the protostome Pacific oyster. The experimental data indicate that C12orf29 has a structural role in skeletal development and tissue homeostasis, whereas in silico analysis of the human C12orf29 promoter region suggests that its expression is potentially under the control of the NOTCH, WNT and TGF- developmental pathways, as well SOX9 and BAPX1; pathways that are all heavily involved in skeletogenesis. Taken together, this investigation provides strong evidence that C12orf29 has a very important role in the chordate body plan, in early skeletal development, cartilage homeostasis, and also a possible link with spina bifida in humans.
Resumo:
Currently, mass spectrometry-based metabolomics studies extend beyond conventional chemical categorization and metabolic phenotype analysis to understanding gene function in various biological contexts (e.g., mammalian, plant, and microbial). These novel utilities have led to many innovative discoveries in the following areas: disease pathogenesis, therapeutic pathway or target identification, the biochemistry of animal and plant physiological and pathological activities in response to diverse stimuli, and molecular signatures of host-pathogen interactions during microbial infection. In this review, we critically evaluate the representative applications of mass spectrometry-based metabolomics to better understand gene function in diverse biological contexts, with special emphasis on working principles, study protocols, and possible future development of this technique. Collectively, this review raises awareness within the biomedical community of the scientific value and applicability of mass spectrometry-based metabolomics strategies to better understand gene function, thus advancing this application's utility in a broad range of biological fields