26 resultados para REPETITIVE DNA
em National Center for Biotechnology Information - NCBI
Resumo:
Although integration of viral DNA into host chromosomes occurs regularly in bacteria and animals, there are few reported cases in plants, and these involve insertion at only one or a few sites. Here, we report that pararetrovirus-like sequences have integrated repeatedly into tobacco chromosomes, attaining a copy number of ≈103. Insertion apparently occurred by illegitimate recombination. From the sequences of 22 independent insertions recovered from a healthy plant, an 8-kilobase genome encoding a previously uncharacterized pararetrovirus that does not contain an integrase function could be assembled. Preferred boundaries of the viral inserts may correspond to recombinogenic gaps in open circular viral DNA. An unusual feature of the integrated viral sequences is a variable tandem repeat cluster, which might reflect defective genomes that preferentially recombine into plant DNA. The recurrent invasion of pararetroviral DNA into tobacco chromosomes demonstrates that viral sequences can contribute significantly to plant genome evolution.
Resumo:
The positions of ≈4,800 individual miniature inverted-repeat transposable element (MITE)-like repeats from four families were mapped on the Caenorhabditis elegans chromosomes. These families represent 1–2% of the total sequence of the organism. The four MITE families (Cele1, Cele2, Cele14, and Cele42) displayed distinct chromosomal distribution profiles. For example, the Cele14 MITEs were observed clustering near the ends of the autosomes. In contrast, the Cele2 MITEs displayed an even distribution through the central autosome domains, with no evidence for clustering at the ends. Both the number of elements and the distribution patterns of each family were conserved on all five C. elegans autosomes. The distribution profiles indicate chromosomal polarity and suggest that the current genetic and physical maps of chromosomes II, III, and X are inverted with respect to the other chromosomes. The degree of conservation of both the number and distribution of these elements on the five autosomes suggests a role in defining specific chromosomal domains.
Resumo:
Rearrangements between tandem sequence homologies of various lengths are a major source of genomic change and can be deleterious to the organism. These rearrangements can result in either deletion or duplication of genetic material flanked by direct sequence repeats. Molecular genetic analysis of repetitive sequence instability in Escherichia coli has provided several clues to the underlying mechanisms of these rearrangements. We present evidence for three mechanisms of RecA-independent sequence rearrangements: simple replication slippage, sister-chromosome exchange-associated slippage, and single-strand annealing. We discuss the constraints of these mechanisms and contrast their properties with RecA-dependent homologous recombination. Replication plays a critical role in the two slipped misalignment mechanisms, and difficulties in replication appear to trigger rearrangements via all these mechanisms.
Resumo:
We have characterized a family of repetitive DNA elements with homology to the MgPa cellular adhesion operon of Mycoplasma genitalium, a bacterium that has the smallest known genome of any free-living organism. One element, 2272 bp in length and flanked by DNA with no homology to MgPa, was completely sequenced. At least four others were partially sequenced. The complete element is a composite of six regions. Five of these regions show sequence similarity with nonadjacent segments of genes of the MgPa operon. The sixth region, located near the center of the element, is an A+T-rich sequence that has only been found in this repeat family. Open reading frames are present within the five individual regions showing sequence homology to MgPa and the adjacent open reading frame 3 (ORF3) gene. However, termination codons are found between adjacent regions of homology to the MgPa operon and in the A+T-rich sequence. Thus, these repetitive elements do not appear to be directly expressible protein coding sequences. The sequence of one region from five different repetitive elements was compared with the homologous region of the MgPa gene from the type strain G37 and four newly isolated M. genitalium strains. Recombination between repetitive elements of strain G37 and the MgPa operon can explain the majority of polymorphisms within our partial sequences of the MgPa genes of the new isolates. Therefore, we propose that the repetitive elements of M. genitalium provide a reservoir of sequence that contributes to antigenic variation in proteins of the MgPa cellular adhesion operon.
Resumo:
Eukaryotic genomes contain tracts of DNA in which a single base or a small number of bases are repeated (microsatellites). Mutations in the yeast DNA mismatch repair genes MSH2, PMS1, and MLH1 increase the frequency of mutations for normal DNA sequences and destabilize microsatellites. Mutations of human homologs of MSH2, PMS1, and MLH1 also cause microsatellite instability and result in certain types of cancer. We find that a mutation in the yeast gene MSH3 that does not substantially affect the rate of spontaneous mutations at several loci increases microsatellite instability about 40-fold, preferentially causing deletions. We suggest that MSH3 has different substrate specificities than the other mismatch repair proteins and that the human MSH3 homolog (MRP1) may be mutated in some tumors with microsatellite instability.
Resumo:
In the nuclear genome of Saccharomyces cerevisiae, simple, repetitive DNA sequences (microsatellites) mutate at rates much higher than nonrepetitive sequences. Most of these mutations are deletions or additions of repeat units. The yeast mitochondrial genome also contains many microsatellites. To examine the stability of these sequences, we constructed a reporter gene (arg8m) containing out-of-frame insertions of either poly(AT) or poly(GT) tracts within the coding sequence. Yeast strains with this reporter gene inserted within the mitochondrial genome were constructed. Using these strains, we showed that poly(GT) tracts were considerably less stable than poly(AT) tracts and that alterations usually involved deletions rather than additions of repeat units. In contrast, in the nuclear genome, poly(GT) and poly(AT) tracts had similar stabilities, and alterations usually involved additions rather than deletions. Poly(GT) tracts were more stable in the mitochondria of diploid cells than in haploids. In addition, an msh1 mutation destabilized poly(GT) tracts in the mitochondrial genome.
Resumo:
Current evidence on the long-term evolutionary effect of insertion of sequence elements into gene regions is reviewed, restricted to cases where a sequence derived from a past insertion participates in the regulation of expression of a useful gene. Ten such examples in eukaryotes demonstrate that segments of repetitive DNA or mobile elements have been inserted in the past in gene regions, have been preserved, sometimes modified by selection, and now affect control of transcription of the adjacent gene. Included are only examples in which transcription control was modified by the insert. Several cases in which merely transcription initiation occurred in the insert were set aside. Two of the examples involved the long terminal repeats of mammalian endogenous retroviruses. Another two examples were control of transcription by repeated sequence inserts in sea urchin genomes. There are now six published examples in which Alu sequences were inserted long ago into human gene regions, were modified, and now are central in control/enhancement of transcription. The number of published examples of Alu sequences affecting gene control has grown threefold in the last year and is likely to continue growing. Taken together, all of these examples show that the insertion of sequence elements in the genome has been a significant source of regulatory variation in evolution.
Resumo:
Recent experiments have exposed significant discrepancies between experimental data and predictive models for DNA structure. These results strongly suggest that DNA structural parameters incorporated in the models are not always sufficient to account for the influence of sequence context and of specific ion effects. In an attempt to evaluate these two effects, we have investigated repetitive DNA sequences with the sequence motif GAGAG.CTCTC located in different helical phasing arrangements with respect to poly(A) tracts and GGGCCC.GGGCCC sequence motifs. Methods used are ligase-mediated cyclization and gel mobility experiments along with DNase I cutting and chemical probe studies. The results provide new evidence for curvature in poly(A) tracts. They also show that the sequence context in which bending and flexible sequence elements are found is an important aspect of sequence-dependent DNA conformation. Although dinucleotide models generally have good predictive power, this work demonstrates that in some instances sequence elements larger than the dinucleotide must be taken into account, and hence it provides a starting point for the appropriate modification and refinement of existing structural models for DNA.
Resumo:
ATRX is a member of the SNF2 family of helicase/ATPases that is thought to regulate gene expression via an effect on chromatin structure and/or function. Mutations in the hATRX gene cause severe syndromal mental retardation associated with α-thalassemia. Using indirect immunofluorescence and confocal microscopy we have shown that ATRX protein is associated with pericentromeric heterochromatin during interphase and mitosis. By coimmunofluorescence, ATRX localizes with a mouse homologue of the Drosophila heterochromatic protein HP1 in vivo, consistent with a previous two-hybrid screen identifying this interaction. From the analysis of a trap assay for nuclear proteins, we have shown that the localization of ATRX to heterochromatin is encoded by its N-terminal region, which contains a conserved plant homeodomain-like finger and a coiled-coil domain. In addition to its association with heterochromatin, at metaphase ATRX clearly binds to the short arms of human acrocentric chromosomes, where the arrays of ribosomal DNA are located. The unexpected association of a putative transcriptional regulator with highly repetitive DNA provides a potential explanation for the variability in phenotype of patients with identical mutations in the ATRX gene.
Resumo:
The human adult α-globin locus consists of three pairs of homology blocks (X, Y, and Z) interspersed with three nonhomology blocks (I, II, and III), and three Alu family repeats, Alu1, Alu2, and Alu3. It has been suggested that an ancient primate α-globin-containing unit was ancestral to the X, Y, and Z and the Alu1/Alu2 repeats. However, the evolutionary origin of the three nonhomologous blocks has remained obscure. We have now analyzed the sequence organization of the entire adult α-globin locus of gibbon (Hylobates lar). DNA segments homologous to human block I occur in both duplication units of the gibbon α-globin locus. Detailed interspecies sequence comparisons suggest that nonhomologous blocks I and II, as well as another sequence, IV, were all part of the ancestral α-globin-containing unit prior to its tandem duplication. However, sometime thereafter, block I was deleted from the human α1-globin-containing unit, and block II was also deleted from the α2-globin-containing unit in both human and gibbon. These were probably independent events both mediated by independent illegitimate recombination processes. Interestingly, the end points of these deletions coincide with potential insertion sites of Alu family repeats. These results suggest that the shaping of DNA segments in eukaryotic genomes involved the retroposition of repetitive DNA elements in conjunction with simple DNA recombination processes.
Resumo:
A physical map of the 31-megabase Aspergillus nidulans genome is reported, in which 94% of 5,134 cosmids are assigned to 49 contiguous segments. The physical map is the result of a two-way ordering process, in which clones and probes were ordered simultaneously on a binary DNA/DNA hybridization matrix. Compression by elimination of redundant clones resulted in a minimal map, which is a chromosome walk. Repetitive DNA is nonrandomly dispersed in the A. nidulans genome, reminiscent of heterochromatic banding patterns of higher eukaryotes. We hypothesize gene clusters may arise by horizontal transfer and spread by transposition to explain the nonrandom pattern of repeats along chromosomes.
Resumo:
The maize genome is replete with chromosomal duplications and repetitive DNA. The duplications resulted from an ancient polyploid event that occurred over 11 million years ago. Based on DNA sequence data, the polyploid event occurred after the divergence between sorghum and maize, and hence the polyploid event explains some of the difference in DNA content between these two species. Genomic rearrangement and diploidization followed the polyploid event. Most of the repetitive DNA in the maize genome is retrotransposable elements, and they comprise 50% of the genome. Retrotransposon multiplication has been relatively recent—within the last 5–6 million years—suggesting that the proliferation of retrotransposons has also contributed to differences in DNA content between sorghum and maize. There are still unanswered questions about repetitive DNA, including the distribution of repetitive DNA throughout the genome, the relative impacts of retrotransposons and chromosomal duplication in plant genome evolution, and the hypothesized correlation of duplication events with transposition. Population genetic processes also affect the evolution of genomes. We discuss how centromeric genes should, in theory, contain less genetic diversity than noncentromeric genes. In addition, studies of diversity in the wild relatives of maize indicate that different genes have different histories and also show that domestication and intensive breeding have had heterogeneous effects on genetic diversity across genes.
Resumo:
The RAD27 gene of Saccharomyces cerevisiae encodes a 5′-3′ flap exo/endonuclease, which plays an important role during DNA replication for Okazaki fragment maturation. Genetic studies have shown that RAD27 is not essential for growth, although rad27Δ mutants are temperature sensitive. Moreover, they exhibit increased sensitivity to alkylating agents, enhanced spontaneous recombination, and repetitive DNA instability. The conditional lethality conferred by the rad27Δ mutation indicates that other nuclease(s) can compensate for the absence of Rad27. Indeed, biochemical and genetical analyses indicate that Okazaki fragment processing can be assured by other enzymatic activities or by alternative pathways such as homologous recombination. Here we present the results of a screen that makes use of a synthetic lethality assay to identify functions required for the survival of rad27Δ strains. Altogether, we confirm that all genes of the Rad52 recombinational repair pathway are required for the survival of rad27Δ strains at both permissive (23°C) and semipermissive (30°C) temperatures for growth. We also find that several point mutations that confer weaker phenotypes in mitotic than in meiotic cells (rad50S, mre11s) and additional gene deletions (com1/sae2, srs2) exhibit synthetic lethality with rad27Δ and that rad59Δ exhibits synergistic effects with rad27Δ. This and previous studies indicate that homologous recombination is the primary, but not only, pathway that functions to bypass the replication defects that arise in the absence of the Rad27 protein.
Resumo:
The nucleotide sequence of the human alpha-albumin gene, including 887 bp of the 5'-flanking region and 1311 bp of the 3-flanking region (24,454 in total), was determined from three overlapping lambda phage clones. The sequence spans 22,256 bp from the cap site to the polyadenylylation site, revealing a gene structure of 15 exons separated by 14 introns. The methionine initiation codon ATG is within exon 1; the termination codon TGA is within exon 14. Exon 15 is entirely untranslated and contains the polyadenylylation signal AATAAA. The deduced polypeptide chain is composed of a 21-amino-acid leader peptide, followed by 578 amino acids of the mature protein. There are seven repetitive DNA elements (Alu and Kpn) in the introns and 3-flanking region. The sizes of the 15 alpha-albumin exons match closely those of the albumin, alpha-fetoprotein, and vitamin D-binding protein genes. The exons are symmetrically placed within the three domains of the individual proteins, and they share a characteristic codon splitting pattern that is conserved among members of the gene family. The results provide strong evidence that alpha-albumin belongs to, and most likely completes with, the serum albumin gene family. Based on structural similarity, alpha-albumin appears to be most closely related to alpha-fetoprotein. The complete structure of this family of four tandemly linked genes provides a well-characterized approximately 200 kb locus in the 4q subcentromeric region of the human genome.
Resumo:
Arabidopsis thaliana is a small flowering plant that is a member of the family cruciferae. It has many characteristics--diploid genetics, rapid growth cycle, relatively low repetitive DNA content, and small genome size--that recommend it as the model for a plant genome project. The current status of the genetic and physical maps, as well as efforts to sequence the genome, are presented. Examples are given of genes isolated by using map-based cloning. The importance of the Arabidopsis project for plant biology in general is discussed.