918 resultados para Genome duplication


Relevância:

20.00% 20.00%

Publicador:

Resumo:

Given that retroposed copies of genes are presumed to lack the regulatory elements required for their expression, retroposition has long been considered a mechanism without functional relevance. However, through an in silico assay for transcriptional activity, we identify here >1,000 transcribed retrocopies in the human genome, of which at least approximately 120 have evolved into bona fide genes. Among these, approximately 50 retrogenes have evolved functions in testes, more than half of which were recruited as functional autosomal counterparts of X-linked genes during spermatogenesis. Generally, retrogenes emerge "out of the testis," because they are often initially transcribed in testis and later evolve stronger and sometimes more diverse spatial expression patterns. We find a significant excess of transcribed retrocopies close to other genes or within introns, suggesting that retrocopies can exploit the regulatory elements and/or open chromatin of neighboring genes to become transcribed. In direct support of this hypothesis, we identify 36 retrocopy-host gene fusions, including primate-specific chimeric genes. Strikingly, 27 intergenic retrogenes have acquired untranslated exons de novo during evolution to achieve high expression levels. Notably, our screen for highly transcribed retrocopies also uncovered a retrogene linked to a human recessive disorder, gelatinous drop-like corneal dystrophy, a form of blindness. These functional implications for retroposition notwithstanding, we find that the insertion of retrocopies into genes is generally deleterious, because it may interfere with the transcription of host genes. Our results demonstrate that natural selection has been fundamental in shaping the retrocopy repertoire of the human genome.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

RÉSUMÉ: Le génome de toute cellule est susceptible d'être attaqué par des agents endogènes et exogènes. Afin de préserver l'intégrité génomique, les cellules ont développé des multitudes de mécanismes. La réplication de l'ADN, une étape importante durant le cycle cellulaire, constitue un stress et présente un danger important pour l'intégrité du génome. L'anémie de Fanconi est une maladie héréditaire rare dont les protéines impliquées semblent jouer un rôle crucial dans la réponse au stress réplicatif. La maladie est associée à une instabilité chromosomique ainsi qu'à une forte probabilité de développer des cancers. Les cellules des patients souffrant de l'anémie de Fanconi sont sensibles à des agents interférant avec la réplication de l'ADN, et plus particulièrement àdes agents qui fient les deux brins d'ADN d'une manière covalente. L'anémie de Fanconi est une maladie génétiquement hétérogène. Treize protéines ont pu être identifiées. Elles semblent figurer dans une même voie de signalisation qui est aussi connue sous le nom de « FA/BRCA pathway », car un des gènes est identique au gène BRCA2 (breast cancer susceptibility gene 2). Huit protéines forment un complexe nucléaire dont l'intégrité est nécessaire à la monoubiquitination de deux autres protéines, FANCD2 et FANCI, en réponse à un stress réplicatif. A ce jour, la fonction moléculaire des protéines du « FA/BRCA pathway »reste encore mal décrite. Au début de mon travail de thèse, nous avons donc décidé de purifier les protéines du complexe nucléaire et d'étudier leurs propriétés biochimiques. Nous avons tout d'abord étudié les cinq protéines connues à l'époque qui sont FANCA, FANCC, FANCE, FANCF et FANCG. Par la suite, nous avons étendu notre étude à des protéines découvertes plus récemment, FANCL, FANCM et FAAP24, en concentrant finalement notre travail sur la caractérisation de FANCM. FANCM, contrairement aux autres protéines du complexe, est constituée de deux domaines conservés suggérant un rôle important dans le métabolisme de l'ADN. Il s'agit d'un domaine « DEAH box hélicase »situé dans la partie N-terminale et d'un domaine « ERCC4 nuclease »situé dans la partie C-terminale de la protéine. Dans cette étude, nous avons purifié avec succès la protéine FANCM entière à partir d'un système hétérologue. Nous montrons que FANCM s'attache de manière spécifique à des jonctions de Holliday et des fourches de réplication. De plus, nous démontrons que FANCM peut déplacer le point de jonction de ces structures via son domaine hélicase de manière dépendante de l'ATP. FANCM est aussi capable de dissocier de grands intermédiaires de la recombinaison, via la migration de jonctions de Holliday à travers une région d'homologie de 2.6 kb. Tous ces résultats suggèrent que FANCM peut s'attacher spécifiquement à des fourches de réplication et à des jonctions de Holliday in vitro et que son domaine hélicase est associé à une activité migratoire efficace. Nous pensons que FANCM peut avoir un rôle direct sur les intermédiaires de réplication. Ceci est en accord avec l'idée que les protéines de l'anémie de Fanconi coordonnent la réparation de l'ADN au niveau des fourches de réplication arrêtées. Nos résultats donnent une première indication quant au rôle de FANCM dans la cellule et peuvent contribuer à élucider la fonction de cette voie de signalisation peu comprise jusqu'à présent. SUMMARY: The genome of every cell is subject to a constant offence by endogenous and exogenous agents. Not surprisingly; cells have evolved a multitude of mechanisms which aim at preserving genomic integrity. A key step during the life cycle of a cell, DNA replication itself, constitutes a special danger to the integrity of the genome. The proteins defective in the rare hereditary disease Fanconi anemia (FA) are suspected to play a crucial role in the cellular response to DNA replication stress. The disease is associated with chromosomal instability and pronounced cancer susceptibility. Cells from Fanconi anemia patients are sensitive to a variety of agents which interfere with DNA replication, DNA interstrand cross-linking agents being particularly threatening to their survival. Fanconi anemia is a genetically heterogeneous disease with 13 different proteins identified, which seem to work together in a common pathway. Since one of the FA genes is identical to the breast cancer susceptibility gene BRCA2, it is also referred to as the FA/BRCA pathway. Eight proteins form a nuclear complex, whose integriry is required for the monoubiquitination of two other FA proteins, FANCD2 and FANCI, in response to DNA replication stress. Despite intensive research, the function of the FA/BRCA pathway at a molecular level has remained largely elusive so far. At the beginning of my thesis, we therefore decided to purify the proteins of the FA core complex and to investigate their biochemical properties. We started with the five proteins which were known at that time, FANCA, FANCC, FANCE, FANCF, and FACG. Later on, we extended our studies to the newly discovered proteins FANCL, FANCM, and FAAP24, and eventually focused our work on the characterisation of FANCM. In contrast to the other core complex proteins, FANCM contains two conserved domains, which point to a role in DNA metabolism: an N-terminal DEAH box helicase domain and a C-terminal ERCC4 nuclease domain. In this study, we have successfully purified full-length FANCM from a recombinant source. We show that purified FANCM binds to branched DNA molecules, such as Holliday junctions and replication forks, with high specificity and affinity. In addition, we demonstrate that FANCM can translocate the junction point of branched DNA molecules due to its helicase domain in an ATPase-dependent manner. FANCM can even dissociate large recombination intermediates, via branch migration of Holliday junctions through a 2.6 kb region of homology. Taken together, our data suggest that FANCM can specifically bind to replication forks and Holliday junctions in vitro, and that its DEAH box helicase domain is associated with a potent branch migration activity. We propose that FANCM might have a direct role in the processing of DNA replication intermediates. This is consistent with the current view that FA proteins coordinate DNA repair at stalled replication forks. Our findings provide a first hint as to the context in which FANCM might play a role in the cell. We are optimistic that they might be key to further elucidate the function of a pathway which is far from being understood.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Numerous genetic loci have been associated with systolic blood pressure (SBP) and diastolic blood pressure (DBP) in Europeans. We now report genome-wide association studies of pulse pressure (PP) and mean arterial pressure (MAP). In discovery (N = 74,064) and follow-up studies (N = 48,607), we identified at genome-wide significance (P = 2.7 × 10(-8) to P = 2.3 × 10(-13)) four new PP loci (at 4q12 near CHIC2, 7q22.3 near PIK3CG, 8q24.12 in NOV and 11q24.3 near ADAMTS8), two new MAP loci (3p21.31 in MAP4 and 10q25.3 near ADRB1) and one locus associated with both of these traits (2q24.3 near FIGN) that has also recently been associated with SBP in east Asians. For three of the new PP loci, the estimated effect for SBP was opposite of that for DBP, in contrast to the majority of common SBP- and DBP-associated variants, which show concordant effects on both traits. These findings suggest new genetic pathways underlying blood pressure variation, some of which may differentially influence SBP and DBP.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

BACKGROUND: Efavirenz and abacavir are components of recommended first-line regimens for HIV-1 infection. We used genome-wide genotyping and clinical data to explore genetic associations with virologic failure among patients randomized to efavirenz-containing or abacavir-containing regimens in AIDS Clinical Trials Group (ACTG) protocols. PARTICIPANTS AND METHODS: Virologic response and genome-wide genotype data were available from treatment-naive patients randomized to efavirenz-containing (n=1596) or abacavir-containing (n=786) regimens in ACTG protocols 384, A5142, A5095, and A5202. RESULTS: Meta-analysis of association results across race/ethnic groups showed no genome-wide significant associations (P<5×10) with virologic response for either efavirenz or abacavir. Our sample size provided 80% power to detect a genotype relative risk of 1.8 for efavirenz and 2.4 for abacavir. Analyses focused on CYP2B genotypes that define the lowest plasma efavirenz exposure stratum did not show associations nor did analysis limited to gene sets predicted to be relevant to efavirenz and abacavir disposition. CONCLUSION: No single polymorphism is associated strongly with virologic failure with efavirenz-containing or abacavir-containing regimens. Analyses to better consider context, and that minimize confounding by nongenetic factors, may show associations not apparent here.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Searching for matches between large collections of short (14-30 nucleotides) words and sequence databases comprising full genomes or transcriptomes is a common task in biological sequence analysis. We investigated the performance of simple indexing strategies for handling such tasks and developed two programs, fetchGWI and tagger, that index either the database or the query set. Either strategy outperforms megablast for searches with more than 10,000 probes. FetchGWI is shown to be a versatile tool for rapidly searching multiple genomes, whose performance is limited in most cases by the speed of access to the filesystem. We have made publicly available a Web interface for searching the human, mouse, and several other genomes and transcriptomes with oligonucleotide queries.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The Caulobacter DNA methyltransferase CcrM is one of five master cell-cycle regulators. CcrM is transiently present near the end of DNA replication when it rapidly methylates the adenine in hemimethylated GANTC sequences. The timing of transcription of two master regulator genes and two cell division genes is controlled by the methylation state of GANTC sites in their promoters. To explore the global extent of this regulatory mechanism, we determined the methylation state of the entire chromosome at every base pair at five time points in the cell cycle using single-molecule, real-time sequencing. The methylation state of 4,515 GANTC sites, preferentially positioned in intergenic regions, changed progressively from full to hemimethylation as the replication forks advanced. However, 27 GANTC sites remained unmethylated throughout the cell cycle, suggesting that these protected sites could participate in epigenetic regulatory functions. An analysis of the time of activation of every cell-cycle regulatory transcription start site, coupled to both the position of a GANTC site in their promoter regions and the time in the cell cycle when the GANTC site transitions from full to hemimethylation, allowed the identification of 59 genes as candidates for epigenetic regulation. In addition, we identified two previously unidentified N(6)-methyladenine motifs and showed that they maintained a constant methylation state throughout the cell cycle. The cognate methyltransferase was identified for one of these motifs as well as for one of two 5-methylcytosine motifs.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Genomic clones containing the Xenopus laevis vitellogenin gene B1 have been isolated from DNA libraries and characterized by heteroduplex mapping in the electron microscope, restriction endonuclease analysis, and in vitro transcription in a HeLa whole-cell extract. Sequences from the 3'-flanking region of the previously isolated A1 vitellogenin gene were found in the 5'-flanking region of this B1 gene. Thus, the two genes are linked, with 15.5 kilobase pairs of DNA between them. Their length is about 22 kilobase pairs (A1 gene) and 16.5 kilobase pairs (B1 gene) and they have the following arrangement: 5'-A1 gene-spacer-B1 gene-3'. The analysis of heteroduplexes formed between the two genes revealed several regions of homology. Both genes are in the same orientation and, therefore, are transcribed from the same DNA strand. The possible events by which the vitellogenin gene family arose in Xenopus laevis are discussed.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Within the ENCODE Consortium, GENCODE aimed to accurately annotate all protein-coding genes, pseudogenes, and noncoding transcribed loci in the human genome through manual curation and computational methods. Annotated transcript structures were assessed, and less well-supported loci were systematically, experimentally validated. Predicted exon-exon junctions were evaluated by RT-PCR amplification followed by highly multiplexed sequencing readout, a method we called RT-PCR-seq. Seventy-nine percent of all assessed junctions are confirmed by this evaluation procedure, demonstrating the high quality of the GENCODE gene set. RT-PCR-seq was also efficient to screen gene models predicted using the Human Body Map (HBM) RNA-seq data. We validated 73% of these predictions, thus confirming 1168 novel genes, mostly noncoding, which will further complement the GENCODE annotation. Our novel experimental validation pipeline is extremely sensitive, far more than unbiased transcriptome profiling through RNA sequencing, which is becoming the norm. For example, exon-exon junctions unique to GENCODE annotated transcripts are five times more likely to be corroborated with our targeted approach than with extensive large human transcriptome profiling. Data sets such as the HBM and ENCODE RNA-seq data fail sampling of low-expressed transcripts. Our RT-PCR-seq targeted approach also has the advantage of identifying novel exons of known genes, as we discovered unannotated exons in ~11% of assessed introns. We thus estimate that at least 18% of known loci have yet-unannotated exons. Our work demonstrates that the cataloging of all of the genic elements encoded in the human genome will necessitate a coordinated effort between unbiased and targeted approaches, like RNA-seq and RT-PCR-seq.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Context : It is now clearly shown that genetic factors in association with environment play a key role in obesity and eating disorders. This project studies the clinical symptoms and molecular abnormalities in patients carrying a strong hereditary predisposition to obesity and eating behavior disorders. We have previously published the association between the 16:29.5-30.1 deletion and a very penetrant form of morbid obesity and macrocephaly. We have also demonstrated the association between the reciprocal 16:29.5-30.1 duplication and underweight and small head circumference. These 2 studies demonstrate that gene dosage of one or several genes in this region regulates BMI as well as brain growth. At present, there are no data pointing towards particular candidate genes. We are currently investigating a second non-overlapping recurrent CNV encompassing SH2B1, upstream of the aforementioned rearrangement. SNPs in this gene have been associated with BMI in GWAS studies and mice models confirmed this association. Bokuchova et al have reported an association between deletions encompassing this gene and severe early onset obesity, as well as insulin resistance. We are currently collecting and analyzing data to fully characterize the phenotype and the transcriptional patterns associated with this rearrangement. Aims : 1. Identify carriers of any CNVs in the greater 16p11.2 region (between 16:28MB and 32MB) in the EGG consortium. 2. Perform association studies between SNPs in the greater 16p11.2 region (16:28-32MB) and anthropometric measures with adjusted "locus-wide significance", to identify or prioritize candidate genes potentially driving the association observed in patients with the CNVs (and thus worthy of further validation and sequencing). 3. Explore associations between GSV genome-wide and brain volume. 4. Explore relationship between brain volumes (whole brain and regional for those who underwent brain MRI), head circumference and BMI. 5. Extrapolate this procedure to other regions covered by the Metabochip. Methods : - Examine and collect clinical informations, as well as molecular informations in these patients. - Analysis of MRI data in children and adults with BMI > 2SD. Compare changes to MRI data obtained in patients with monogenic forms of obesity (data from Lausanne study) and to underweight (BMI<-2SD) individuals from EGG. - Test whether opposite extremes of the phenotypic distribution may be highly informative Expected results : This is a highly focused study, pertaining to approximately 1 0/00 of the human genome. Yet it is clear that if successful, the lessons learned from this study could be extrapolated to other segments of the genome and would need validation and replication by additional studies. Altogether they will contribute to further explore the missing heritability and point to etiologic genes and pathways underlying these important health burdens.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

We report on two patients with de novo subtelomeric terminal deletion of chromosome 6p. Patient 1 is an 8-month-old female born with normal growth parameters, typical facial features of 6pter deletion, bilateral corectopia, and protruding tongue. She has severe developmental delay, profound bilateral neurosensory deafness, poor visual contact, and hypsarrhythmia since the age of 6 months. Patient 2 is a 5-year-old male born with normal growth parameters and unilateral hip dysplasia; he has a characteristic facial phenotype, bilateral embryotoxon, and moderate mental retardation. Further characterization of the deletion, using high-resolution array comparative genomic hybridization (array-CGH; Agilent Human Genome kit 244 K), revealed that Patient 1 has a 8.1 Mb 6pter-6p24.3 deletion associated with a contiguous 5.8 Mb 6p24.3-6p24.1 duplication and Patient 2 a 5.7 Mb 6pter-6p25.1 deletion partially overlapping with that of Patient 1. Complementary FISH and array analysis showed that the inv del dup(6) in Patient 1 originated de novo. Our results demonstrate that simple rearrangements are often more complex than defined by standard techniques. We also discuss genotype-phenotype correlations including previously reported cases of deletion 6p.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The "one-gene, one-protein" rule, coined by Beadle and Tatum, has been fundamental to molecular biology. The rule implies that the genetic complexity of an organism depends essentially on its gene number. The discovery, however, that alternative gene splicing and transcription are widespread phenomena dramatically altered our understanding of the genetic complexity of higher eukaryotic organisms; in these, a limited number of genes may potentially encode a much larger number of proteins. Here we investigate yet another phenomenon that may contribute to generate additional protein diversity. Indeed, by relying on both computational and experimental analysis, we estimate that at least 4%-5% of the tandem gene pairs in the human genome can be eventually transcribed into a single RNA sequence encoding a putative chimeric protein. While the functional significance of most of these chimeric transcripts remains to be determined, we provide strong evidence that this phenomenon does not correspond to mere technical artifacts and that it is a common mechanism with the potential of generating hundreds of additional proteins in the human genome.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Plants have evolved exquisite ways to detect their enemies and are able to induce defenses responses tailored to their specific aggressors. Insect eggs deposited on a leaf represent a future threat as larvae hatching from the egg will ultimately feed on the plant. Although direct and indirect defenses towards oviposition have been documented, our knowledge of the molecular changes triggered by egg deposition is limited. Using a whole-genome microarray, we recently analyzed the expression profile of Arabidopsis thaliana leaves after oviposition by two pierid butterflies. Eggs laid by the large white Pieris brassicae modified the expression of hundreds of genes. The transcript signature included defense and stress-related genes that were also induced in plants experiencing localized cell death. Further analyses revealed that cellular changes associated with a hypersensitive response occur at the site of egg deposition and that they are triggered by egg-derived elicitors. Our study brings molecular evidence for previous observations of oviposition-induced necrosis in other plant species and might illustrate a direct defense of the plant against the egg. In this addendum, we discuss the relevance of the oviposition-induced gene expression changes and the possibility that plants use eggs as cues to anticipate their enemies.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Recent genome-wide association (GWA) studies described 95 loci controlling serum lipid levels. These common variants explain ∼25% of the heritability of the phenotypes. To date, no unbiased screen for gene-environment interactions for circulating lipids has been reported. We screened for variants that modify the relationship between known epidemiological risk factors and circulating lipid levels in a meta-analysis of genome-wide association (GWA) data from 18 population-based cohorts with European ancestry (maximum N = 32,225). We collected 8 further cohorts (N = 17,102) for replication, and rs6448771 on 4p15 demonstrated genome-wide significant interaction with waist-to-hip-ratio (WHR) on total cholesterol (TC) with a combined P-value of 4.79×10(-9). There were two potential candidate genes in the region, PCDH7 and CCKAR, with differential expression levels for rs6448771 genotypes in adipose tissue. The effect of WHR on TC was strongest for individuals carrying two copies of G allele, for whom a one standard deviation (sd) difference in WHR corresponds to 0.19 sd difference in TC concentration, while for A allele homozygous the difference was 0.12 sd. Our findings may open up possibilities for targeted intervention strategies for people characterized by specific genomic profiles. However, more refined measures of both body-fat distribution and metabolic measures are needed to understand how their joint dynamics are modified by the newly found locus.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The RNA genome of the human T-cell leukemia virus type 1 (HTLV-1) codes for proteins involved in infectivity, replication, and transformation. We report in this study the characterization of a novel viral protein encoded by the complementary strand of the HTLV-1 RNA genome. This protein, designated HBZ (for HTLV-1 bZIP factor), contains a N-terminal transcriptional activation domain and a leucine zipper motif in its C terminus. We show here that HBZ is able to interact with the bZIP transcription factor CREB-2 (also called ATF-4), known to activate the HTLV-1 transcription by recruiting the viral trans-activator Tax on the Tax-responsive elements (TxREs). However, we demonstrate that the HBZ/CREB-2 heterodimers are no more able to bind to the TxRE and cyclic AMP response element sites. Taking these findings together, the functional inactivation of CREB-2 by HBZ is suggested to contribute to regulation of the HTLV-1 transcription. Moreover, the characterization of a minus-strand gene protein encoded by HTLV-1 has never been reported until now.