940 resultados para Transcription factor binding site motifs
Resumo:
Protein-DNA interactions are involved in many fundamental biological processes essential for cellular function. Most of the existing computational approaches employed only the sequence context of the target residue for its prediction. In the present study, for each target residue, we applied both the spatial context and the sequence context to construct the feature space. Subsequently, Latent Semantic Analysis (LSA) was applied to remove the redundancies in the feature space. Finally, a predictor (PDNAsite) was developed through the integration of the support vector machines (SVM) classifier and ensemble learning. Results on the PDNA-62 and the PDNA-224 datasets demonstrate that features extracted from spatial context provide more information than those from sequence context and the combination of them gives more performance gain. An analysis of the number of binding sites in the spatial context of the target site indicates that the interactions between binding sites next to each other are important for protein-DNA recognition and their binding ability. The comparison between our proposed PDNAsite method and the existing methods indicate that PDNAsite outperforms most of the existing methods and is a useful tool for DNA-binding site identification. A web-server of our predictor (http://hlt.hitsz.edu.cn:8080/PDNAsite/) is made available for free public accessible to the biological research community.
Resumo:
Males and age group 1 to 5 years show a much higher risk for childhood acute lymphoblastic leukemia (ALL). We performed a case-only genome-wide association study (GWAS), using the Illumina Infinium HumanCoreExome Chip, to unmask gender- and age-specific risk variants in 240 non-Hispanic white children with ALL recruited at Texas Children’s Cancer Center, Houston, Texas. Besides statistically most significant results, we also considered results that yielded the highest effect sizes. Existing experimental data and bioinformatic predictions were used to complement results, and to examine the biological significance of statistical results. Our study identified novel risk variants for childhood ALL. The SNP, rs4813720 (RASSF2), showed the statistically most significant gender-specific associations (P < 2 x 10-6). Likewise, rs10505918 (SOX5) yielded the lowest P value (P < 1 x 10-5) for age-specific associations, and also showed the statistically most significant association with age-at-onset (P < 1 x 10-4). Two SNPs, rs12722042 and 12722039, from the HLA-DQA1 region yielded the highest effect sizes (odds ratio (OR) = 15.7; P = 0.002) for gender-specific results, and the SNP, rs17109582 (OR = 12.5; P = 0.006), showed the highest effect size for age-specific results. Sex chromosome variants did not appear to be involved in gender-specific associations. The HLA-DQA1 SNPs belong to DQA1*01:07and confirmed previously reported male-specific association with DQA1*01:07. Twenty one of the SNPs identified as risk markers for gender- or age-specific associations were located in the transcription factor binding sites and 56 SNPs were non-synonymous variants, likely to alter protein function. Although bioinformatic analysis did not implicate a particular mechanism for gender- and age-specific associations, RASSF2 has an estrogen receptor-alpha binding site in its promoter. The unknown mechanisms may be due to lack of interest in gender- and age-specificity in associations. These results provide a foundation for further studies to examine the gender- and age-differential in childhood ALL risk. Following replication and mechanistic studies, risk factors for one gender or age group may have a potential to be used as biomarkers for targeted intervention for prevention and maybe also for treatment.
Resumo:
Date of Acceptance: 12/07/2015 © 2015 John Wiley & Sons Ltd. Acknowledgements This study was supported by funding from the Encompass kick start and SMART:Scotland award schemes of Scottish Enterprise and Friends of Anchor. The Grampian Biorepository assisted with the immunohistochemical investigations.
Resumo:
Date of Acceptance: 12/07/2015 © 2015 John Wiley & Sons Ltd. Acknowledgements This study was supported by funding from the Encompass kick start and SMART:Scotland award schemes of Scottish Enterprise and Friends of Anchor. The Grampian Biorepository assisted with the immunohistochemical investigations.
Resumo:
Mémoire numérisé par la Direction des bibliothèques de l'Université de Montréal.
Resumo:
Expression Quantitative Trait Loci (eQTL) analysis allows for the identification of genetic variation associated with variation in gene expression. It is often unclear however, which of the associated variants are causal, and by what mechanism. Integrating functional genomic data with eQTL data can provide insight into the impact of natural variation in the population, and the nature of the transcriptional machinery itself. In this thesis, I integrate functional genomic data with eQTL data derived from both 5’ CAGE and 3’ TagXseq expression assays, in developing embryos. I first use both datasets to analyse the transcription landscape in embryonic D., melanogaster, and then carry out an analysis of sequence motifs associated with transcription factor binding sites, promoters, and 3’ polyadenylation sites. Finally, I integrate functional genomic data, including these novel sequence motifs, to shed light on the mechanisms of gene expression variation in D.,melanogaster. I am able to demonstrate that some variants effecting gene regulation in Drosophila are found within haplotypes which buffer their effects.
Resumo:
Mémoire numérisé par la Direction des bibliothèques de l'Université de Montréal.
Resumo:
Le syndrome de détresse respiratoire du nouveau-né (SDR) est l’une des pathologies les plus fréquentes dont souffrent les bébés prématurés. Le SDR est causé par un déficit dans la synthèse du surfactant pulmonaire en raison de l’immaturité du poumon lors d’une naissance prématurée. Plusieurs éléments régulent le développement pulmonaire notamment les stéroïdes sexuels et les corticostéroïdes. Le sexe est aussi un élément régulateur du développement pulmonaire. En effet, les garçons sont plus atteints que les filles par le SDR. Ce dimorphisme sexuel est attribué aux androgènes. Le traitement anténatal aux glucocorticoïdes est prescrit aux femmes qui sont à risque d’accoucher prématurément. En effet, les corticostéroïdes favorisent la maturation pulmonaire anténatale. Également, il a été démontré que les microARNs sont primordiaux pour le développement pulmonaire. Ceci nous a conduit à étudier l’impact des androgènes sur le profil d’expression des microARNs lors de la transition du stade canaliculaire au stade sacculaire (jour gestationnel (JG)17.0 au JG18.0), période qui coïncide avec la montée de la synthèse et de la sécrétion du surfactant chez la souris. Tout d’abord, nous avons étudié la stabilité des gènes de normalisation (snoRNAs) afin de quantifier les microARNs par qPCR. Cette analyse a été effectuée avec 3 logiciels différents et sur plusieurs stades du développement notamment de la période pseudoglandulaire jusqu’au stade alvéolaire chez les deux sexes. On a identifié les meilleures combinaisons de gènes de normalisation les plus stables pour chaque stade du développement étudié ainsi que pour la période couvrant tous les stades étudiés. Ensuite nous avons analysé à GD17.0 et GD18.0 le profil d’expression des microARNs chez des fœtus mâles dont les mères ont été traitées au flutamide (anti-androgènes pure). Les résultats ont montré que 43 microARNs matures sont modulés par les androgènes à GD17.0 et 35 microARNs à GD18.0. Pour certains microARNs, nous avons identifié des cibles potentielles qui sont inversement modulées par les androgènes par rapport aux microARNs. Ces cibles sont impliquées dans plusieurs processus biologiques tels que le métabolisme des lipides et la prolifération cellulaire ainsi que dans des fonctions moléculaires tels que la liaison des facteurs de transcription. Des expériences de validation ont été effectuées par qPCR. Nos résultats ont montré que les androgènes régulent des processus qui peuvent être impliqués dans la maturation pulmonaire via la régulation des microARNs. En plus de l’intérêt porté aux androgènes dans la maturation pulmonaire, nous avons analysé l’expression d’enzymes de synthèse des corticostéroïdes dans le poumon fœtal humain. L’expression de l’enzyme 21-hydroxylase a été étudiée par qPCR et par immunobuvardage. Également la localisation de l’ARNm de cette enzyme clé de la synthèse des glucocorticoïdes, a été effectuée par hybridation in situ. L’ARNm de CYP21A2 a été détecté par qPCR dans les 34 échantillons analysés et dont les âges variaient entre 17 et 40 semaines de grossesse. Aucune corrélation, avec l’âge gestationnel ou le sexe, n’a été observée. Des niveaux significatifs de la protéine 21-hydroxylase ont été détectés dans nos échantillons. Nous avons investigué l’expression d’autres enzymes impliquées dans la voie de synthèse des glucocorticoïdes notamment CYP11B1, CYP11B2 et CYP17A1. Les ARNm des gènes CYP11B1, CYP11B2 n’ont pas été détectés dans nos échantillons, contrairement à CYP17A1 dont l’ARNm a été détecté dans tous nos tissus fœtaux analysés. La protéine de la 17α-hydroxylase a été détectée à de faibles niveaux. Nos résultats d’hybridation in situ ont montré que l’expression de CYP21A2 est localisée presqu’exclusivement dans l’épithélium pulmonaire distal. Nos résultats suggèrent que les produits de la 21-hydroxylase agiront via une action intracrine sur l’épithélium distal en activant le récepteur des glucocorticoïdes (GR). L’activation du récepteur des minéralocorticoïdes (MR) ne semble pas dépendre de produits de la 21-hydroxylase en raison des quantités importantes d’aldostérone circulante.
Resumo:
The cytokine hormone leptin is a key signalling molecule in many pathways that control physiological functions. Although leptin demonstrates structural conservation in mammals, there is evidence of positive selection in primates, lagomorphs and chiropterans. We previously reported that the leptin genes of the grey and harbour seals (phocids) have significantly diverged from other mammals. Therefore we further investigated the diversification of leptin in phocids, other marine mammals and terrestrial taxa by sequencing the leptin genes of representative species. Phylogenetic reconstruction revealed that leptin diversification was pronounced within the phocid seals with a high dN/dS ratio of 2.8, indicating positive selection. We found significant evidence of positive selection along the branch leading to the phocids, within the phocid clade, but not over the dataset as a whole. Structural predictions indicate that the individual residues under selection are away from the leptin receptor (LEPR) binding site. Predictions of the surface electrostatic potential indicate that phocid seal leptin is notably different to other mammalian leptins, including the otariids. Cloning the grey seal leptin binding domain of LEPR confirmed that this was structurally conserved. These data, viewed in toto, support a hypothesis that phocid leptin divergence is unlikely to have arisen by random mutation. Based upon these phylogenetic and structural assessments, and considering the comparative physiology and varying life histories among species, we postulate that the unique phocid diving behaviour has produced this selection pressure. The Phocidae includes some of the deepest diving species, yet have the least modified lung structure to cope with pressure and volume changes experienced at depth. Therefore, greater surfactant production is required to facilitate rapid lung re-inflation upon surfacing, while maintaining patent airways. We suggest that this additional surfactant requirement is met by the leptin pulmonary surfactant production pathway which normally appears only to function in the mammalian foetus.
Does BFR1, a component of the transcription factor (TFIIIB), have a role in prostate carcinogenesis?
Resumo:
No abstract available.
Resumo:
The Bacillus subtilis DnaI, DnaB and DnaD proteins load the replicative ring helicase DnaC onto DNA during priming of DNA replication. Here we show that DnaI consists of a C-terminal domain (Cd) with ATPase and DNA-binding activities and an N-terminal domain (Nd) that interacts with the replicative ring helicase. A Zn2+-binding module mediates the interaction with the helicase and C67, C70 and H84 are involved in the coordination of the Zn2+. DnaI binds ATP and exhibits ATPase activity that is not stimulated by ssDNA, because the DNA-binding site on Cd is masked by Nd. The ATPase activity resides on the Cd domain and when detached from the Nd domain, it becomes sensitive to stimulation by ssDNA because its cryptic DNA-binding site is exposed. Therefore, Nd acts as a molecular 'switch' regulating access to the ssDNA binding site on Cd, in response to binding of the helicase. DnaI is sufficient to load the replicative helicase from a complex with six DnaI molecules, so there is no requirement for a dual helicase loader system.
Resumo:
Males and age group 1 to 5 years show a much higher risk for childhood acute lymphoblastic leukemia (ALL). We performed a case-only genome-wide association study (GWAS), using the Illumina Infinium HumanCoreExome Chip, to unmask gender- and age-specific risk variants in 240 non-Hispanic white children with ALL recruited at Texas Children’s Cancer Center, Houston, Texas. Besides statistically most significant results, we also considered results that yielded the highest effect sizes. Existing experimental data and bioinformatic predictions were used to complement results, and to examine the biological significance of statistical results. ^ Our study identified novel risk variants for childhood ALL. The SNP, rs4813720 (RASSF2), showed the statistically most significant gender-specific associations (P < 2 x 10-6). Likewise, rs10505918 (SOX5) yielded the lowest P value (P < 1 x 10-5 ) for age-specific associations, and also showed the statistically most significant association with age-at-onset (P < 1 x 10-4). Two SNPs, rs12722042 and 12722039, from the HLA-DQA1 region yielded the highest effect sizes (odds ratio (OR) = 15.7; P = 0.002) for gender-specific results, and the SNP, rs17109582 (OR = 12.5; P = 0.006), showed the highest effect size for age-specific results. Sex chromosome variants did not appear to be involved in gender-specific associations. ^ The HLA-DQA1 SNPs belong to DQA1*01:07and confirmed previously reported male-specific association with DQA1*01:07. Twenty one of the SNPs identified as risk markers for gender- or age-specific associations were located in the transcription factor binding sites and 56 SNPs were non-synonymous variants, likely to alter protein function. Although bioinformatic analysis did not implicate a particular mechanism for gender- and age-specific associations, RASSF2 has an estrogen receptor-alpha binding site in its promoter. The unknown mechanisms may be due to lack of interest in gender- and age-specificity in associations. These results provide a foundation for further studies to examine the gender- and age-differential in childhood ALL risk. Following replication and mechanistic studies, risk factors for one gender or age group may have a potential to be used as biomarkers for targeted intervention for prevention and maybe also for treatment.^
Resumo:
Endometrial cancer is one of the most common female diseases in developed nations and is the most commonly diagnosed gynaecological cancer in Australia. The disease is commonly classified by histology: endometrioid or non-endometrioid endometrial cancer. While non-endometrioid endometrial cancers are accepted to be high-grade, aggressive cancers, endometrioid cancers (comprising 80% of all endometrial cancers diagnosed) generally carry a favourable patient prognosis. However, endometrioid endometrial cancer patients endure significant morbidity due to surgery and radiotherapy used for disease treatment, and patients with recurrent disease have a 5-year survival rate of less than 50%. Genetic analysis of women with endometrial cancer could uncover novel markers associated with disease risk and/or prognosis, which could then be used to identify women at high risk and for the use of specialised treatments. Proteases are widely accepted to play an important role in the development and progression of cancer. This PhD project hypothesised that SNPs from two protease gene families, the matrix metalloproteases (MMPs, including their tissue inhibitors, TIMPs) and the tissue kallikrein-related peptidases (KLKs) would be associated with endometrial cancer susceptibility and/or prognosis. In the first part of this study, optimisation of the genotyping techniques was performed. Results from previously published endometrial cancer genetic association studies were attempted to be validated in a large, multicentre replication set (maximum cases n = 2,888, controls n = 4,483, 3 studies). The rs11224561 progesterone receptor SNP (PGR, A/G) was observed to be associated with increased endometrial cancer risk (per A allele OR 1.31, 95% CI 1.12-1.53; p-trend = 0.001), a result which was initially reported among a Chinese sample set. Previously reported associations for the remaining 8 SNPs investigated for this section of the PhD study were not confirmed, thereby reinforcing the importance of validation of genetic association studies. To examine the effect of SNPs from the MMP and KLK families on endometrial cancer risk, we selected the most significantly associated MMP and KLK SNPs from genome-wide association study analysis (GWAS) to be genotyped in the GWAS replication set (cases n = 4,725, controls n = 9,803, 13 studies). The significance of the MMP24 rs932562 SNP was unchanged after incorporation of the stage 2 samples (Stage 1 per allele OR 1.18, p = 0.002; Combined Stage 1 and 2 OR 1.09, p = 0.002). The rs10426 SNP, located 3' to KLK10 was predicted by bioinformatic analysis to effect miRNA binding. This SNP was observed in the GWAS stage 1 result to exhibit a recessive effect on endometrial cancer risk, a result which was not validated in the stage 2 sample set (Stage 1 OR 1.44, p = 0.007; Combined Stage 1 and 2 OR 1.14, p = 0.08). Investigation of the regions imputed surrounding the MMP, TIMP and KLK genes did not reveal any significant targets for further analysis. Analysis of the case data from the endometrial cancer GWAS to identify genetic variation associated with cancer grade did not reveal SNPs from the MMP, TIMP or KLK genes to be statistically significant. However, the representation of SNPs from the MMP, TIMP and KLK families by the GWAS genotyping platform used in this PhD project was examined and observed to be very low, with the genetic variation of four genes (MMP23A, MMP23B, MMP28 and TIMP1) not captured at all by this technique. This suggests that comprehensive candidate gene association studies will be required to assess the role of SNPs from these genes with endometrial cancer risk and prognosis. Meta-analysis of gene expression microarray datasets curated as part of this PhD study identified a number of MMP, TIMP and KLK genes to display differential expression by endometrial cancer status (MMP2, MMP10, MMP11, MMP13, MMP19, MMP25 and KLK1) and histology (MMP2, MMP11, MMP12, MMP26, MMP28, TIMP2, TIMP3, KLK6, KLK7, KLK11 and KLK12). In light of these findings these genes should be prioritised for future targeted genetic association studies. Two SNPs located 43.5 Mb apart on chromosome 15 were observed from the GWAS analysis to be associated with increased endometrial cancer grade, results that were validated in silico in two independent datasets. One of these SNPs, rs8035725 is located in the 5' untranslated region of a MYC promoter binding protein DENND4A (Stage 1 OR 1.15, p = 9.85 x 10P -5 P, combined Stage 1 and in silico validation OR 1.13, p = 5.24 x 10P -6 P). This SNP has previously been reported to alter the expression of PTPLAD1, a gene involved in the synthesis of very long fatty acid chains and in the Rac1 signaling pathway. Meta-analysis of gene expression microarray data found PTPLAD1 to display increased expression in the aggressive non-endometrioid histology compared with endometrioid endometrial cancer, suggesting that the causal SNP underlying the observed genetic association may influence expression of this gene. Neither rs8035725 nor significant SNPs identified by imputation were predicted bioinformatically to affect transcription factor binding sites, indicating that further studies are required to assess their potential effect on other regulatory elements. The other grade- associated SNP, rs6606792, is located upstream of an inferred pseudogene, ELMO2P1 (Stage 1 OR 1.12, p = 5 x 10P -5 P; combined Stage 1 and in silico validation OR 1.09, p = 3.56 x 10P -5 P). Imputation of the ±1 Mb region surrounding this SNP revealed a cluster of significantly associated variants which are predicted to abolish various transcription factor binding sites, and would be expected to decrease gene expression. ELMO2P1 was not included on the microarray platforms collected for this PhD, and so its expression could not be investigated. However, the high sequence homology of ELMO2P1 with ELMO2, a gene important to cell motility, indicates that ELMO2 could be the parent gene for ELMO2P1 and as such, ELMO2P1 could function to regulate the expression of ELMO2. Increased expression of ELMO2 was seen to be associated with increasing endometrial cancer grade, as well as with aggressive endometrial cancer histological subtypes by microarray meta-analysis. Thus, it is hypothesised that SNPs in linkage disequilibrium with rs6606792 decrease the transcription of ELMO2P1, reducing the regulatory effect of ELMO2P1 on ELMO2 expression. Consequently, ELMO2 expression is increased, cell motility is enhanced leading to an aggressive endometrial cancer phenotype. In summary, these findings have identified several areas of research for further study. The results presented in this thesis provide evidence that a SNP in PGR is associated with risk of developing endometrial cancer. This PhD study also reports two independent loci on chromosome 15 to be associated with increased endometrial cancer grade, and furthermore, genes associated with these SNPs to be differentially expressed according in aggressive subtypes and/or by grade. The studies reported in this thesis support the need for comprehensive SNP association studies on prioritised MMP, TIMP and KLK genes in large sample sets. Until these studies are performed, the role of MMP, TIMP and KLK genetic variation remains unclear. Overall, this PhD study has contributed to the understanding of genetic variation involvement in endometrial cancer susceptibility and prognosis. Importantly, the genetic regions highlighted in this study could lead to the identification of novel gene targets to better understand the biology of endometrial cancer and also aid in the development of therapeutics directed at treating this disease.
Resumo:
Sepsid flies (Diptera: Sepsidae) are important model insects for sexual selection research. In order to develop mitochondrial (mt) genome data for this significant group, we sequenced the first complete mt genome of the sepsid fly Nemopoda mamaevi Ozerov, 1997. The circular 15,878 bp mt genome is typical of Diptera, containing all 37 genes usually present in bilaterian animals. We discovered inaccurate annotations of fly mt genomes previously deposited on GenBank and thus re-annotated all published mt genomes of Cyclorrhapha. These re-annotations were based on comparative analysis of homologous genes, and provide a statistical analysis of start and stop codon positions. We further detected two 18 bp of conserved intergenic sequences from tRNAGlu-tRNAPhe and ND1-tRNASer(UCN) across Cyclorrhapha, which are the mtTERM binding site motifs. Additionally, we compared automated annotation software MITOS with hand annotation method. Phylogenetic trees based on the mt genome data from Cyclorrhapha were inferred by Maximum-likelihood and Bayesian methods, strongly supported a close relationship between Sepsidae and the Tephritoidea.