308 resultados para nucleotide binding


Relevância:

30.00% 30.00%

Publicador:

Resumo:

SUMMARY : Eukaryotic DNA interacts with the nuclear proteins using non-covalent ionic interactions. Proteins can recognize specific nucleotide sequences based on the sterical interactions with the DNA and these specific protein-DNA interactions are the basis for many nuclear processes, e.g. gene transcription, chromosomal replication, and recombination. New technology termed ChIP-Seq has been recently developed for the analysis of protein-DNA interactions on a whole genome scale and it is based on immunoprecipitation of chromatin and high-throughput DNA sequencing procedure. ChIP-Seq is a novel technique with a great potential to replace older techniques for mapping of protein-DNA interactions. In this thesis, we bring some new insights into the ChIP-Seq data analysis. First, we point out to some common and so far unknown artifacts of the method. Sequence tag distribution in the genome does not follow uniform distribution and we have found extreme hot-spots of tag accumulation over specific loci in the human and mouse genomes. These artifactual sequence tags accumulations will create false peaks in every ChIP-Seq dataset and we propose different filtering methods to reduce the number of false positives. Next, we propose random sampling as a powerful analytical tool in the ChIP-Seq data analysis that could be used to infer biological knowledge from the massive ChIP-Seq datasets. We created unbiased random sampling algorithm and we used this methodology to reveal some of the important biological properties of Nuclear Factor I DNA binding proteins. Finally, by analyzing the ChIP-Seq data in detail, we revealed that Nuclear Factor I transcription factors mainly act as activators of transcription, and that they are associated with specific chromatin modifications that are markers of open chromatin. We speculate that NFI factors only interact with the DNA wrapped around the nucleosome. We also found multiple loci that indicate possible chromatin barrier activity of NFI proteins, which could suggest the use of NFI binding sequences as chromatin insulators in biotechnology applications. RESUME : L'ADN des eucaryotes interagit avec les protéines nucléaires par des interactions noncovalentes ioniques. Les protéines peuvent reconnaître les séquences nucléotidiques spécifiques basées sur l'interaction stérique avec l'ADN, et des interactions spécifiques contrôlent de nombreux processus nucléaire, p.ex. transcription du gène, la réplication chromosomique, et la recombinaison. Une nouvelle technologie appelée ChIP-Seq a été récemment développée pour l'analyse des interactions protéine-ADN à l'échelle du génome entier et cette approche est basée sur l'immuno-précipitation de la chromatine et sur la procédure de séquençage de l'ADN à haut débit. La nouvelle approche ChIP-Seq a donc un fort potentiel pour remplacer les anciennes techniques de cartographie des interactions protéine-ADN. Dans cette thèse, nous apportons de nouvelles perspectives dans l'analyse des données ChIP-Seq. Tout d'abord, nous avons identifié des artefacts très communs associés à cette méthode qui étaient jusqu'à présent insoupçonnés. La distribution des séquences dans le génome ne suit pas une distribution uniforme et nous avons constaté des positions extrêmes d'accumulation de séquence à des régions spécifiques, des génomes humains et de la souris. Ces accumulations des séquences artéfactuelles créera de faux pics dans toutes les données ChIP-Seq, et nous proposons différentes méthodes de filtrage pour réduire le nombre de faux positifs. Ensuite, nous proposons un nouvel échantillonnage aléatoire comme un outil puissant d'analyse des données ChIP-Seq, ce qui pourraient augmenter l'acquisition de connaissances biologiques à partir des données ChIP-Seq. Nous avons créé un algorithme d'échantillonnage aléatoire et nous avons utilisé cette méthode pour révéler certaines des propriétés biologiques importantes de protéines liant à l'ADN nommés Facteur Nucléaire I (NFI). Enfin, en analysant en détail les données de ChIP-Seq pour la famille de facteurs de transcription nommés Facteur Nucléaire I, nous avons révélé que ces protéines agissent principalement comme des activateurs de transcription, et qu'elles sont associées à des modifications de la chromatine spécifiques qui sont des marqueurs de la chromatine ouverte. Nous pensons que lés facteurs NFI interagir uniquement avec l'ADN enroulé autour du nucléosome. Nous avons également constaté plusieurs régions génomiques qui indiquent une éventuelle activité de barrière chromatinienne des protéines NFI, ce qui pourrait suggérer l'utilisation de séquences de liaison NFI comme séquences isolatrices dans des applications de la biotechnologie.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

mRNAs specifying immunoglobulin mu and delta heavy chains are encoded by a single large, complex transcription unit (mu + delta gene). The transcriptional activity of delta gene segments in terminally differentiated, IgM-secreting B lymphocytes is 10-20 times lower than in earlier B-lineage cells expressing delta mRNA. We find that transcription of the mu + delta gene in IgM-secreting murine myeloma cells terminates within a region of 500-1000 nucleotides immediately following the mu membrane (mu m) polyadenylylation site. Transcription decreases only minimally through this region in murine cell lines representative of earlier stages in B-cell development. A DNA fragment containing the mu m polyadenylylation signal gives protein-DNA complexes with different mobilities in gel retardation assays with nuclear extracts from myeloma cells than with nuclear extracts from earlier B-lineage cells. However, using a recently developed "footprinting" procedure in which protein-DNA complexes resolved in gel retardation assays are subjected to nucleolytic cleavage while still in the polyacrylamide gel, we find that the DNA sequences protected by factors from the two cell types are indistinguishable. The factor-binding site on the DNA is located 5' of the mu m polyadenylylation signal AATAAA and includes the 15-nucleotide-long A + T-rich palindrome CTGTAAACAAATGTC. This type of palindromic binding site exhibits orientation-dependent activity consistent with the reported properties of polymerase II termination signals. This binding site is followed by two sets of directly repeated DNA sequences with different helical conformation as revealed by their reactivity with the chemical nuclease 1,10-phenanthroline-copper. The close proximity of these features to the signals for mu m mRNA processing may reflect a linkage of the processes of developmentally regulated mu m polyadenylylation and transcription termination.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

OBJECTIVE: Inflammatory bowel diseases (IBDs), Crohn's disease, and ulcerative colitis (UC), are multifactorial disorders, characterized by chronic inflammation of the intestine. A number of genetic components have been proposed to contribute to IBD pathogenesis. In this case-control study, we investigated the association between two common vitamin D-binding protein (DBP) genetic variants and IBD susceptibility. These two single nucleotide polymorphisms (SNPs) in exon 11 of the DBP gene, at codons 416 (GAT>GAG; Asp>Glu) and 420 (ACG>AAG; Thr>Lys), have been previously suggested to play roles in the etiology of other autoimmune diseases. METHODS: Using TaqMan SNP technology, we have genotyped 884 individuals (636 IBD cases and 248 non-IBD controls) for the two DBP variants. RESULTS: On statistical analysis, we observed that the DBP 420 variant Lys is less frequent in IBD cases than in non-IBD controls (allele frequencies, P=0.034; homozygous carrier genotype frequencies, P=0.006). This inverse association between the DBP 420 Lys and the disease remained significant, when non-IBD participants were compared with UC (homozygous carrier genotype frequencies, P=0.022) or Crohn's disease (homozygous carrier genotype frequencies, P=0.016) patients separately. Although the DBP position 416 alone was not found to be significantly associated with IBD, the haplotype DBP_2, consisting of 416 Asp and 420 Lys, was more frequent in the non-IBD population, particularly notably when compared with the UC group (Odds ratio, 4.390). CONCLUSION: Our study adds DBP to the list of potential genes that contribute to the complex genetic etiology of IBD, and further emphasizes the association between vitamin D homeostasis and intestinal inflammation.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Essential hypertension is a multifactorial disorder and is the main risk factor for renal and cardiovascular complications. The research on the genetics of hypertension has been frustrated by the small predictive value of the discovered genetic variants. The HYPERGENES Project investigated associations between genetic variants and essential hypertension pursuing a 2-stage study by recruiting cases and controls from extensively characterized cohorts recruited over many years in different European regions. The discovery phase consisted of 1865 cases and 1750 controls genotyped with 1M Illumina array. Best hits were followed up in a validation panel of 1385 cases and 1246 controls that were genotyped with a custom array of 14 055 markers. We identified a new hypertension susceptibility locus (rs3918226) in the promoter region of the endothelial NO synthase gene (odds ratio: 1.54 [95% CI: 1.37-1.73]; combined P=2.58 · 10(-13)). A meta-analysis, using other in silico/de novo genotyping data for a total of 21 714 subjects, resulted in an overall odds ratio of 1.34 (95% CI: 1.25-1.44; P=1.032 · 10(-14)). The quantitative analysis on a population-based sample revealed an effect size of 1.91 (95% CI: 0.16-3.66) for systolic and 1.40 (95% CI: 0.25-2.55) for diastolic blood pressure. We identified in silico a potential binding site for ETS transcription factors directly next to rs3918226, suggesting a potential modulation of endothelial NO synthase expression. Biological evidence links endothelial NO synthase with hypertension, because it is a critical mediator of cardiovascular homeostasis and blood pressure control via vascular tone regulation. This finding supports the hypothesis that there may be a causal genetic variation at this locus.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The three subtypes of the peroxisome proliferator-activated receptors (PPARalpha, beta/delta, and gamma) form heterodimers with the 9-cis-retinoic acid receptor (RXR) and bind to a common consensus response element, which consists of a direct repeat of two hexanucleotides spaced by one nucleotide (DR1). As a first step toward understanding the molecular mechanisms determining PPAR subtype specificity, we evaluated by electrophoretic mobility shift assays the binding properties of the three PPAR subtypes, in association with either RXRalpha or RXRgamma, on 16 natural PPAR response elements (PPREs). The main results are as follows. (i) PPARgamma in combination with either RXRalpha or RXRgamma binds more strongly than PPARalpha or PPARbeta to all natural PPREs tested. (ii) The binding of PPAR to strong elements is reinforced if the heterodimerization partner is RXRgamma. In contrast, weak elements favor RXRalpha as heterodimerization partner. (iii) The ordering of the 16 natural PPREs from strong to weak elements does not depend on the core DR1 sequence, which has a relatively uniform degree of conservation, but correlates with the number of identities of the 5'-flanking nucleotides with respect to a consensus element. This 5'-flanking sequence is essential for PPARalpha binding and thus contributes to subtype specificity. As a demonstration of this, the PPARgamma-specific element ARE6 PPRE is able to bind PPARalpha only if its 5'-flanking region is exchanged with that of the more promiscuous HMG PPRE.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

[(11)C]PBR28 binds the 18-kDa Translocator Protein (TSPO) and is used in positron emission tomography (PET) to detect microglial activation. However, quantitative interpretations of signal are confounded by large interindividual variability in binding affinity, which displays a trimodal distribution compatible with a codominant genetic trait. Here, we tested directly for an underlying genetic mechanism to explain this. Binding affinity of PBR28 was measured in platelets isolated from 41 human subjects and tested for association with polymorphisms in TSPO and genes encoding other proteins in the TSPO complex. Complete agreement was observed between the TSPO Ala147Thr genotype and PBR28 binding affinity phenotype (P value=3.1 x 10(-13)). The TSPO Ala147Thr polymorphism predicts PBR28 binding affinity in human platelets. As all second-generation TSPO PET radioligands tested hitherto display a trimodal distribution in binding affinity analogous to PBR28, testing for this polymorphism may allow quantitative interpretation of TSPO PET studies with these radioligands.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The ability to determine the location and relative strength of all transcription-factor binding sites in a genome is important both for a comprehensive understanding of gene regulation and for effective promoter engineering in biotechnological applications. Here we present a bioinformatically driven experimental method to accurately define the DNA-binding sequence specificity of transcription factors. A generalized profile was used as a predictive quantitative model for binding sites, and its parameters were estimated from in vitro-selected ligands using standard hidden Markov model training algorithms. Computer simulations showed that several thousand low- to medium-affinity sequences are required to generate a profile of desired accuracy. To produce data on this scale, we applied high-throughput genomics methods to the biochemical problem addressed here. A method combining systematic evolution of ligands by exponential enrichment (SELEX) and serial analysis of gene expression (SAGE) protocols was coupled to an automated quality-controlled sequence extraction procedure based on Phred quality scores. This allowed the sequencing of a database of more than 10,000 potential DNA ligands for the CTF/NFI transcription factor. The resulting binding-site model defines the sequence specificity of this protein with a high degree of accuracy not achieved earlier and thereby makes it possible to identify previously unknown regulatory sequences in genomic DNA. A covariance analysis of the selected sites revealed non-independent base preferences at different nucleotide positions, providing insight into the binding mechanism.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Previous studies in Caenorhabditis elegans showed that RPM-1 (Regulator of Presynaptic Morphology-1) regulates axon termination and synapse formation. To understand the mechanism of how rpm-1 functions, we have used mass spectrometry to identify RPM-1 binding proteins, and have identified RAE-1 (RNA Export protein-1) as an evolutionarily conserved binding partner. We define a RAE-1 binding region in RPM-1, and show that this binding interaction is conserved and also occurs between Rae1 and the human ortholog of RPM-1 called Pam (protein associated with Myc). rae-1 loss of function causes similar axon and synapse defects, and synergizes genetically with two other RPM-1 binding proteins, GLO-4 and FSN-1. Further, we show that RAE-1 colocalizes with RPM-1 in neurons, and that rae-1 functions downstream of rpm-1. These studies establish a novel postmitotic function for rae-1 in neuronal development.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The malic enzyme (ME) gene is a target for both thyroid hormone receptors and peroxisome proliferator-activated receptors (PPAR). Within the ME promoter, two direct repeat (DR)-1-like elements, MEp and MEd, have been identified as putative PPAR response elements (PPRE). We demonstrate that only MEp and not MEd is able to bind PPAR/retinoid X receptor (RXR) heterodimers and mediate peroxisome proliferator signaling. Taking advantage of the close sequence resemblance of MEp and MEd, we have identified crucial determinants of a PPRE. Using reciprocal mutation analyses of these two elements, we show the preference for adenine as the spacing nucleotide between the two half-sites of the PPRE and demonstrate the importance of the two first bases flanking the core DR1 in 5'. This latter feature of the PPRE lead us to consider the polarity of the PPAR/RXR heterodimer bound to its cognate element. We demonstrate that, in contrast to the polarity of RXR/TR and RXR/RAR bound to DR4 and DR5 elements respectively, PPAR binds to the 5' extended half-site of the response element, while RXR occupies the 3' half-site. Consistent with this polarity is our finding that formation and binding of the PPAR/RXR heterodimer requires an intact hinge T region in RXR while its integrity is not required for binding of the RXR/TR heterodimer to a DR4.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Nucleotide excision repair (NER) is an evolutionary conserved DNA repair system that is essential for the removal of UV-induced DNA damage. In this study we investigated how NER is compartmentalized in the interphase nucleus of human cells at the ultrastructural level by using electron microscopy in combination with immunogold labeling. We analyzed the role of two nuclear compartments: condensed chromatin domains and the perichromatin region. The latter contains transcriptionally active and partly decondensed chromatin at the surface of condensed chromatin domains. We studied the distribution of the damage-recognition protein XPC and of XPA, which is a central component of the chromatin-associated NER complex. Both XPC and XPA rapidly accumulate in the perichromatin region after UV irradiation, whereas only XPC is also moderately enriched in condensed chromatin domains. These observations suggest that DNA damage is detected by XPC throughout condensed chromatin domains, whereas DNA-repair complexes seem preferentially assembled in the perichromatin region. We propose that UV-damaged DNA inside condensed chromatin domains is relocated to the perichromatin region, similar to what has been shown for DNA replication. In support of this, we provide evidence that UV-damaged chromatin domains undergo expansion, which might facilitate the translocation process. Our results offer novel insight into the dynamic spatial organization of DNA repair in the human cell nucleus.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Adrenoceptors are prototypic members of the superfamily of seven transmembrane domain, G protein-coupled receptors. Study of the properties of several mutationally activated adrenoceptors is deepening understanding of the normal functioning of this ubiquitous class of receptors. The new findings suggest an expansion of the classical ternary complex model of receptor action to include an explicit isomerization of the receptors from an inactive to an active state which couples to the G protein ('allosteric ternary complex model'). This isomerization involves conformational changes which may occur spontaneously, or be induced by agonists or appropriate mutations which abrogate the normal 'constraining' function of the receptor, allowing it to 'relax' into the active conformation. Robert Lefkowitz and colleagues discuss the physiological and pathophysiological implications of these new insights into regulation of receptor activity.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The IncP alpha promiscuous plasmid (R18, R68, RK2, RP1 and RP4) comprises 60,099 bp of nucleotide sequence, encoding at least 74 genes. About 40 kb of the genome, designated the IncP core and including all essential replication and transfer functions, can be aligned with equivalent sequences in the IncP beta plasmid R751. The compiled IncP alpha sequence revealed several previously unidentified reading frames that are potential genes. IncP alpha plasmids carry genetic information very efficiently: the coding sequences of the genes are closely packed but rarely overlap, and occupy almost 86% of the genome's nucleotide sequence. All of the 74 genes should be expressed, although there is as yet experimental evidence for expression of only 60 of them. Six examples of tandem-in-frame initiation sites specifying two gene products each are known. Two overlapping gene arrangements occupy different reading frames of the same region. Intergenic regions include most of the 25 promoters; transcripts are usually polycistronic. Translation of most of the open reading frames seems to be initiated independently, each from its own ribosomal binding and initiation site, although, a few cases of coupled translation have been reported. The most frequently used initiation codon is AUG but translation for a few open reading frames begins at GUG or UUG. The most common stop-codon is UGA followed by UAA and then UAG. Regulatory circuits are complex and largely dependent on two components of the central control operon. KorA and KorB are transcriptional repressors controlling at least seven operons. KorA and KorB act synergistically in several cases by recognizing and binding to conserved nucleotide sequences. Twelve KorB binding sites were found around the IncP alpha sequence and these are conserved in R751 (IncP beta) with respect to both sequence and location. Replication of IncP alpha plasmids requires oriV and the plasmid-encoded initiator protein TrfA in combination with the host-encoded replication machinery. Conjugative plasmid transfer depends on two separate regions occupying about half of the genome. The primary segregational stability system designated Par/Mrs consists of a putative site-specific recombinase, a possible partitioning apparatus and a post-segregational lethality mechanism, all encoded in two divergent operons. Proteins related to the products of F sop and P1 par partitioning genes are separately encoded in the central control operon.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The Xenopus laevis 68-kd and 74-kd albumin amino acid sequences are examined with respect to their relationship to the other known members of the albumin/alpha-fetoprotein/vitamin D-binding protein gene family. Each of the three members of this family presents a unique pattern of conserved regions indicating a differential selective pressure related to specific functional characteristics. Furthermore, an evolutionary tree of these genes was deduced from the divergence times calculated from direct nucleotide sequence comparisons of individual gene pairs. These calculations indicate that the vitamin D-binding protein/albumin separation occurred 560-600 million years (Myr) ago and the albumin/alpha-fetoprotein divergence 280 Myr ago. This observation leads to the hypothesis according to which the albumin/alpha-fetoprotein gene duplication occurred shortly after the amphibian/reptile separation. Consequently, and unlike mammals, amphibians and fishes should lack an alpha-fetoprotein in their serum at larval stages, which is consistent with a recent analysis of serum proteins in Xenopus laevis larvae. This hypothesis now will have to be tested further in additional lower vertebrates.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Although experimental studies have suggested that insulin-like growth factor I (IGF-I) and its binding protein IGFBP-3 might have a role in the aetiology of coronary artery disease (CAD), the relevance of circulating IGFs and their binding proteins in the development of CAD in human populations is unclear. We conducted a nested case-control study, with a mean follow-up of six years, within the EPIC-Norfolk cohort to assess the association between circulating levels of IGF-I and IGFBP-3 and risk of CAD in up to 1,013 cases and 2,055 controls matched for age, sex and study enrolment date. After adjustment for cardiovascular risk factors, we found no association between circulating levels of IGF-I or IGFBP-3 and risk of CAD (odds ratio: 0.98 (95% Cl 0.90-1.06) per 1 SD increase in circulating IGF-I; odds ratio: 1.02 (95% Cl 0.94-1.12) for IGFBP-3). We examined associations between tagging single nucleotide polymorphisms (tSNPs) at the IGF1 and IGFBP3 loci and circulating IGF-I and IGFBP-3 levels in up to 1,133 cases and 2,223 controls and identified three tSNPs (rs1520220, rs3730204, rs2132571) that showed independent association with either circulating IGF-I or IGFBP-3 levels. In an assessment of 31 SNPs spanning the IGF1 or IGFBP3 loci, none were associated with risk of CAD in a meta-analysis that included EPIC-Norfolk and eight additional studies comprising up to 9,319 cases and 19,964 controls. Our results indicate that IGF-I and IGFBP-3 are unlikely to be importantly involved in the aetiology of CAD in human populations.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Abstract One of the most important issues in molecular biology is to understand regulatory mechanisms that control gene expression. Gene expression is often regulated by proteins, called transcription factors which bind to short (5 to 20 base pairs),degenerate segments of DNA. Experimental efforts towards understanding the sequence specificity of transcription factors is laborious and expensive, but can be substantially accelerated with the use of computational predictions. This thesis describes the use of algorithms and resources for transcriptionfactor binding site analysis in addressing quantitative modelling, where probabilitic models are built to represent binding properties of a transcription factor and can be used to find new functional binding sites in genomes. Initially, an open-access database(HTPSELEX) was created, holding high quality binding sequences for two eukaryotic families of transcription factors namely CTF/NF1 and LEFT/TCF. The binding sequences were elucidated using a recently described experimental procedure called HTP-SELEX, that allows generation of large number (> 1000) of binding sites using mass sequencing technology. For each HTP-SELEX experiments we also provide accurate primary experimental information about the protein material used, details of the wet lab protocol, an archive of sequencing trace files, and assembled clone sequences of binding sequences. The database also offers reasonably large SELEX libraries obtained with conventional low-throughput protocols.The database is available at http://wwwisrec.isb-sib.ch/htpselex/ and and ftp://ftp.isrec.isb-sib.ch/pub/databases/htpselex. The Expectation-Maximisation(EM) algorithm is one the frequently used methods to estimate probabilistic models to represent the sequence specificity of transcription factors. We present computer simulations in order to estimate the precision of EM estimated models as a function of data set parameters(like length of initial sequences, number of initial sequences, percentage of nonbinding sequences). We observed a remarkable robustness of the EM algorithm with regard to length of training sequences and the degree of contamination. The HTPSELEX database and the benchmarked results of the EM algorithm formed part of the foundation for the subsequent project, where a statistical framework called hidden Markov model has been developed to represent sequence specificity of the transcription factors CTF/NF1 and LEF1/TCF using the HTP-SELEX experiment data. The hidden Markov model framework is capable of both predicting and classifying CTF/NF1 and LEF1/TCF binding sites. A covariance analysis of the binding sites revealed non-independent base preferences at different nucleotide positions, providing insight into the binding mechanism. We next tested the LEF1/TCF model by computing binding scores for a set of LEF1/TCF binding sequences for which relative affinities were determined experimentally using non-linear regression. The predicted and experimentally determined binding affinities were in good correlation.