126 resultados para Single sequence repeat
Resumo:
The root hair is a specialized cell type involved in water and nutrient uptake in plants. In legumes the root hair is also the primary site of recognition and infection by symbiotic nitrogen-fixing Rhizobium bacteria. We have studied the root hairs of Medicago truncatula, which is emerging as an increasingly important model legume for studies of symbiotic nodulation. However, only 27 genes from M. truncatula were represented in GenBank/EMBL as of October, 1997. We report here the construction of a root-hair-enriched cDNA library and single-pass sequencing of randomly selected clones. Expressed sequence tags (899 total, 603 of which have homology to known genes) were generated and made available on the Internet. We believe that the database and the associated DNA materials will provide a useful resource to the community of scientists studying the biology of roots, root tips, root hairs, and nodulation.
Resumo:
Genetic instability can be induced by unusual DNA structures and sequence repeats. We have previously demonstrated that a large palindrome in the mouse germ line derived from transgene integration is extremely unstable and undergoes stabilizing rearrangements at high frequency, often through deletions that produce asymmetry. We have now characterized other palindrome rearrangements that arise from complex homologous recombination events. The structure of the recombinants is consistent with homologous recombination occurring by a noncrossover gene conversion mechanism in which a break induced in the palindrome promotes homologous strand invasion and repair synthesis, similar to mitotic break repair events reported in mammalian cells. Some of the homologous recombination events led to expansion in the size of the palindromic locus, which in the extreme case more than doubled the number of repeats. These results may have implications for instability observed at naturally occurring palindromic or quasipalindromic sequences.
Resumo:
Most eukaryotic telomeres contain a repeating motif with stretches of guanine residues that form a 3′-terminal overhang extending beyond the telomeric duplex region. The telomeric repeat of hypotrichous ciliates, d(T4G4), forms a 16-nucleotide 3′-overhang. Such sequences can adopt parallel-stranded as well as antiparallel-stranded quadruplex conformations in vitro. Although it has been proposed that guanine-quadruplex conformations may have important cellular roles including telomere function, recombination, and transcription, evidence for the existence of this DNA structure in vivo has been elusive to date. We have generated high-affinity single-chain antibody fragment (scFv) probes for the guanine-quadruplex formed by the Stylonychia telomeric repeat, by ribosome display from the Human Combinatorial Antibody Library. Of the scFvs selected, one (Sty3) had an affinity of Kd = 125 pM for the parallel-stranded guanine-quadruplex and could discriminate with at least 1,000-fold specificity between parallel or antiparallel quadruplex conformations formed by the same sequence motif. A second scFv (Sty49) bound both the parallel and antiparallel quadruplex with similar (Kd = 3–5 nM) affinity. Indirect immunofluorescence studies show that Sty49 reacts specifically with the macronucleus but not the micronucleus of Stylonychia lemnae. The replication band, the region where replication and telomere elongation take place, was also not stained, suggesting that the guanine-quadruplex is resolved during replication. Our results provide experimental evidence that the telomeres of Stylonychia macronuclei adopt in vivo a guanine-quadruplex structure, indicating that this structure may have an important role for telomere functioning.
Resumo:
In this paper, a new way to think about, and to construct, pairwise as well as multiple alignments of DNA and protein sequences is proposed. Rather than forcing alignments to either align single residues or to introduce gaps by defining an alignment as a path running right from the source up to the sink in the associated dot-matrix diagram, we propose to consider alignments as consistent equivalence relations defined on the set of all positions occurring in all sequences under consideration. We also propose constructing alignments from whole segments exhibiting highly significant overall similarity rather than by aligning individual residues. Consequently, we present an alignment algorithm that (i) is based on segment-to-segment comparison instead of the commonly used residue-to-residue comparison and which (ii) avoids the well-known difficulties concerning the choice of appropriate gap penalties: gaps are not treated explicity, but remain as those parts of the sequences that do not belong to any of the aligned segments. Finally, we discuss the application of our algorithm to two test examples and compare it with commonly used alignment methods. As a first example, we aligned a set of 11 DNA sequences coding for functional helix-loop-helix proteins. Though the sequences show only low overall similarity, our program correctly aligned all of the 11 functional sites, which was a unique result among the methods tested. As a by-product, the reading frames of the sequences were identified. Next, we aligned a set of ribonuclease H proteins and compared our results with alignments produced by other programs as reported by McClure et al. [McClure, M. A., Vasi, T. K. & Fitch, W. M. (1994) Mol. Biol. Evol. 11, 571-592]. Our program was one of the best scoring programs. However, in contrast to other methods, our protein alignments are independent of user-defined parameters.
Resumo:
Aminoacyl-tRNA synthetases (tRNA synthetases) of higher eukaryotes form a multiprotein complex. Sequence elements that are responsible for the protein assembly were searched by using a yeast two-hybrid system. Human cytoplasmic isoleucyl-tRNA synthetase is a component of the multi-tRNA synthetase complex and it contains a unique C-terminal appendix. This part of the protein was used as bait to identify an interacting protein from a HeLa cDNA library. The selected sequence represented the internal 317 amino acids of human bifunctional (glutamyl- and prolyl-) tRNA synthetase, which is also known to be a component of the complex. Both the C-terminal appendix of the isoleucyl-tRNA synthetase and the internal region of bifunctional tRNA synthetase comprise repeating sequence units, two repeats of about 90 amino acids, and three repeats of 57 amino acids, respectively. Each repeated motif of the two proteins was responsible for the interaction, but the stronger interaction was shown by the native structures containing multiple motifs. Interestingly, the N-terminal extension of human glycyl-tRNA synthetase containing a single motif homologous to those in the bifunctional tRNA synthetase also interacted with the C-terminal motif of the isoleucyl-tRNA synthetase although the enzyme is not a component of the complex. The data indicate that the multiplicity of the binding motif in the tRNA synthetases is necessary for enhancing the interaction strength and may be one of the determining factors for the tRNA synthetases to be involved in the formation of the multi-tRNA synthetase complex.
Resumo:
Tissue-specific transcription is regulated in part by cell type-restricted proteins that bind to defined sequences in target genes. The DNA-binding domain of these proteins is often evolutionarily conserved. On this basis, liver-enriched transcription factors were classified into five families. We describe here the mammalian prototype of a sixth family, which we therefore call hepatocyte nuclear factor 6 (HNF-6). It activates the promoter of a gene involved in the control of glucose metabolism. HNF-6 contains two different DNA-binding domains. One of these corresponds to a novel type of homeodomain. The other is homologous to the Drosophila cut domain. A similar bipartite sequence is coded by the genome of Caenorhabditis elegans.
Resumo:
Nucleosomes, the basic structural elements of chromosomes, consist of 146 bp of DNA coiled around an octamer of histone proteins, and their presence can strongly influence gene expression. Considerations of the anisotropic flexibility of nucleotide triplets containing 3 cytosines or guanines suggested that a [5'(G/C)3 NN3']n motif might resist wrapping around a histone octamer. To test this, DNAs were constructed containing a 5'-CCGNN-3' pentanucleotide repeat with the Ns varied. Using in vitro nucleosome reconstitution and electron microscopy, a plasmid with 48 contiguous CCGNN repeats strongly excluded nucleosomes in the repeat region. Competitive reconstitution gel retardation experiments using DNA fragments containing 12, 24, or 48 CCGNN repeats showed that the propensity to exclude nucleosomes increased with the length of the repeat. Analysis showed that a 268-bp DNA containing a (CCGNN)48 block is 4.9 +/- 0.6-fold less efficient in nucleosome assembly than a similar length pUC19 fragment and approximately 78-fold less efficient than a similar length (CTG)n sequence, based on results from previous studies. Computer searches against the GenBank database for matches with a [(G/C)3NN]48 sequence revealed numerous examples that frequently were present in the control regions of "TATA-less" genes, including the human ETS-2 and human dihydrofolate reductase genes. In both cases the (G/C)3NN repeat, present in the promoter region, co-maps with loci previously shown to be nuclease hypersensitive sites.
Resumo:
Hairpin polyamides are synthetic ligands for sequence-specific recognition in the minor groove of double-helical DNA. A thermodynamic characterization of the DNA-binding properties exhibited by a six-ring hairpin polyamide, ImPyPy-gamma-PyPyPy-beta-Dp (where Im = imidazole, Py = pyrrole, gamma = gamma-aminobutyric acid, beta = beta-alanine, and Dp = dimethylaminopropylamide), reveals an approximately 1-2 kcal/mol greater affinity for the designated match site, 5'-TGTTA-3', relative to the single base pair mismatch sites, 5'-TGGTA-3' and 5'-TATTA-3'. The enthalpy and entropy data at 20 degrees C reveal this sequence specificity to be entirely enthalpic in origin. Correlations between the thermodynamic driving forces underlying the sequence specificity exhibited by ImPyPy-gamma-PyPyPy-beta-Dp and the structural properties of the heterodimeric complex of PyPyPy and ImPyPy bound to the minor groove of DNA provide insight into the molecular forces that govern the affinity and specificity of pyrrole-imidazole polyamides.
Resumo:
Several recent reports indicate that mobile elements are frequently found in and flanking many wild-type plant genes. To determine the extent of this association, we performed computer-based systematic searches to identify mobile elements in the genes of two "model" plants, Oryza sativa (domesticated rice) and Arabidopsis thaliana. Whereas 32 common sequences belonging to nine putative mobile element families were found in the noncoding regions of rice genes, none were found in Arabidopsis genes. Five of the nine families (Gaijin, Castaway, Ditto, Wanderer, and Explorer) are first described in this report, while the other four were described previously (Tourist, Stowaway, p-SINE1, and Amy/LTP). Sequence similarity, structural similarity, and documentation of past mobility strongly suggests that many of the rice common sequences are bona fide mobile elements. Members of four of the new rice mobile element families are similar in some respects to members of the previously identified inverted-repeat element families, Tourist and Stowaway. Together these elements are the most prevalent type of transposons found in the rice genes surveyed and form a unique collection of inverted-repeat transposons we refer to as miniature inverted-repeat transposable elements or MITEs. The sequence and structure of MITEs are clearly distinct from short or long interspersed nuclear elements (SINEs or LINEs), the most common transposable elements associated with mammalian nuclear genes. Mobile elements, therefore, are associated with both animal and plant genes, but the identity of these elements is strikingly different.
Resumo:
Proliferation of dispersed plant cells in culture is strictly dependent on cell density, and cells in a low-density culture can only grow in the presence of conditioned medium (CM). No known plant hormones have been able to substitute for CM. To quantify the mitogenic activity of CM, we examined conditions for the assay system using mechanically dispersed mesophyll cells of Asparagus officinalis L. and established a highly sensitive bioassay method. By use of this method, the mitogenic activity of CM prepared from asparagus cells was characterized: it was heat-stable, susceptible to pronase digestion, and resistant to glycosidase treatment. On the basis of these results, the mitogenic activity in CM was purified 10(7)-fold by column chromatography, and two factors named phytosulfokine-alpha and -beta (PSK-alpha and PSK-beta) were obtained. By amino acid sequence analysis and mass spectrometry, the structures of these two factors were determined to be sulfated pentapeptide (H-Tyr(SO3H)-Ile-Tyr(SO3H)-Thr-Gln-OH) and sulfated tetrapeptide (H-Tyr(SO3H)-Ile-Tyr(SO3H)-Thr-OH). PSK-alpha and PSK-beta were prepared by chemical synthesis and enzymatic sulfation. The synthetic peptides exhibited the same activity as the natural factors, confirming the structure for PSK-alpha and PSK-beta mentioned above. This is the first elucidation of the structure of a conditioned medium factor required for the growth of low-density plant cell cultures.
Resumo:
Retroviruses undergo a high frequency of genetic alterations during the process of copying their RNA genomes. However, little is known about the replication fidelity of other elements that transpose via reverse transcription of an RNA intermediate. The complete sequence of 29 independently integrated copies of the yeast retrotransposon Ty1 (173,043 nt) was determined, and the mutation rate during a single cycle of replication was calculated. The observed base substitution rate of 2.5 x 10(-5) bp per replication cycle suggests that this intracellular element can mutate as rapidly as retroviruses. The pattern and distribution of errors in the Ty1 genome is nonrandom and provides clues to potential in vivo molecular mechanisms of reverse transcriptase-mediated error generation, including heterogeneous RNase H cleavage of Ty1 RNA, addition of terminal nontemplated bases, and transient dislocation and realignment of primer-templates. Overall, analysis of errors generated during Ty1 replication underscores the utility of a genetically tractable model system for the study of reverse transcriptase fidelity.
Resumo:
The effect of histone H1 binding on the cleavage of superhelical plasmids by single-strand-specific nucleases was investigated. Mapping of P1 cleavage sites in pBR322, achieved by EcoRI digestion after the original P1 attack, showed an intriguing phenomenon: preexisting susceptible sites became "protected," whereas some new sites appeared at high levels of H1. Similar results were obtained with another single-strand-specific nuclease, S1. Disappearance of cutting at preexisting sites and appearance of new sites was also observed in a derivative plasmid that contains a 36-bp stretch of alternating d(AT) sequence that is known to adopt an altered P1-sensitive conformation. On the other hand, H1 titration of a dimerized version of the d(AT)18-containing plasmid led to protection of all preexisting sites except the d(AT)18 inserts, which were still cut even at high H1 levels; in this plasmid no new sites appeared. The protection of preexisting sites is best explained by long-range effects of histone H1 binding on the superhelical torsion of the plasmid. The appearance of new sites, on the other hand, probably also involves a local effect of stabilization of specific sequences in Pl-sensitive conformation, due to direct H1 binding to such sequences. That such binding involves linker histone N- and/or C-terminal tails is indicated by the fact that titration with the globular domain of H5, while causing disappearance of preexisting sites, does not lead to the appearance of any new sites.
Resumo:
Previously, we reported that a 61-bp subgenomic HBV DNA sequence (designated as 15AB, nt 1855-1915) is a hot spot for genomic recombination and that a cellular protein binding to 15AB may be the putative recombinogenic protein. In the present study, we established the existence of a 15AB-like sequence in human and rat chromosomal DNA by Southern blot analysis. The 15AB-like sequence isolated from the rat chromosome demonstrated a 80.9% identity with 5'-CCAAGCTGTGCCTTGGGTGGC-3', at 1872-1892 of the hepatitis B virus genome, thought to be the essential region for recombination. Interestingly, this 15AB-like sequence also contained the pentanucleotide motifs GCTGG and CCAGC as an inverted repeat, part of the chi known hot spot for recombination in Escherichia coli. Importantly, a portion of the 15AB-like sequence is homologous (82.1%, 23/28 bp) to break point clusters of the human promyelocytic leukemia (PML) gene, characterized by a translocation [t(15;17)], and to rearranged mouse DNA for the immunoglobulin kappa light chain. Moreover, 15AB and 15AB-like sequences have striking homologies (12/15 = 80.0% and 13/15 = 86.7%, respectively) to the consensus sequence for topoisomerase II. Our present results suggest that this 15AB-like sequence in the rat genome might be a recombinogenic candidate triggering genomic instability in carcinogenesis.
Resumo:
Formation of deletions by recombination between short direct repeats is thought to involve either a break-join or a copy-choice process. The key step of the latter is slippage of the replication machinery between the repeats. We report that the main replicase of Escherichia coli, DNA polymerase III holoenzyme, slips between two direct repeats of 27 bp that flank an inverted repeat of approximately equal 300bp. Slippage was detected in vitro, on a single-stranded DNA template, in a primer extension assay. It requires the presence of a short (8 bp) G+C-rich sequence at the base of a hairpin that can form by annealing of the inverted repeats. It is stimulated by (i) high salt concentration, which might stabilize the hairpin, and (ii) two proteins that ensure the processivity of the DNA polymerase III holoenzyme: the single-stranded DNA binding protein and the beta subunit of the polymerase. Slippage is rather efficient under optimal reaction conditions because it can take place on >50% of template molecules. This observation supports the copy-choice model for recombination between short direct repeats.
Resumo:
Phenomena that can be observed for a large number of molecules may not be understood if it is not possible to observe the events on the single-molecule level. We measured the fluorescence lifetimes of individual tetramethylrhodamine molecules, linked to an 18-mer deoxyribonucleotide sequence specific for M13 DNA, by time-resolved, single-photon counting in a confocal fluorescence microscope during Brownian motion in solution. When many molecules were observed, a biexponential fluorescence decay was observed with equal amplitudes. However, on the single-molecule level, the fraction of one of the amplitudes spanned from 0 to unity for a collection of single-molecule detections. Further analysis by fluorescence correlation spectroscopy made on many molecules revealed a process that obeys a stretched exponential relaxation law. These facts, combined with previous evidence of the quenching effect of guanosine on rhodamines, indicate that the tetramethylrhodamine molecule senses conformational transitions as it associates and dissociates to a guanosine-rich area. Thus, our results reveal conformational transitions in a single molecule in solution under conditions that are relevant for biological processes.