989 resultados para Simple Sequence Repeats


Relevância:

30.00% 30.00%

Publicador:

Resumo:

The key requirements for high-throughput single-nucleotide polymorphism (SNP) typing of DNA samples in large-scale disease case-control studies are automatability, simplicity, and robustness, coupled with minimal cost. In this paper we describe a fluorescence technique for the detection of SNPs that have been amplified by using the amplification refractory mutation system (ARMS)-PCR procedure. Its performance was evaluated using 32 sequence-specific primer mixes to assign the HLA-DRB alleles to 80 lymphoblastoid cell line DNAs chosen from our database for their diversity. All had been typed previously by alternative methods, either direct sequencing or gel electrophoresis. We believe the detection system that we call AMDI (alkaline-mediated differential interaction) satisfies the above criteria and is suitable for general high-throughput SNP typing.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

SF3b155 is an essential spliceosomal protein, highly conserved during evolution. It has been identified as a subunit of splicing factor SF3b, which, together with a second multimeric complex termed SF3a, interacts specifically with the 12S U2 snRNP and converts it into the active 17S form. The protein displays a characteristic intranuclear localization. It is diffusely distributed in the nucleoplasm but highly concentrated in defined intranuclear structures termed “speckles,” a subnuclear compartment enriched in small ribonucleoprotein particles and various splicing factors. The primary sequence of SF3b155 suggests a multidomain structure, different from those of other nuclear speckles components. To identify which part of SF3b155 determines its specific intranuclear localization, we have constructed expression vectors encoding a series of epitope-tagged SF3b155 deletion mutants as well as chimeric combinations of SF3b155 sequences with the soluble cytoplasmic protein pyruvate kinase. Following transfection of cultured mammalian cells, we have identified (i) a functional nuclear localization signal of the monopartite type (KRKRR, amino acids 196–200) and (ii) a molecular segment with multiple threonine-proline repeats (amino acids 208–513), which is essential and sufficient to confer a specific accumulation in nuclear speckles. This latter sequence element, in particular amino acids 208–440, is required for correct subcellular localization of SF3b155 and is also sufficient to target a reporter protein to nuclear speckles. Moreover, this “speckle-targeting sequence” transfers the capacity for interaction with other U2 snRNP components.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Previously conducted sequence analysis of Arabidopsis thaliana (ecotype Columbia-0) reported an insertion of 270-kb mtDNA into the pericentric region on the short arm of chromosome 2. DNA fiber-based fluorescence in situ hybridization analyses reveal that the mtDNA insert is 618 ± 42 kb, ≈2.3 times greater than that determined by contig assembly and sequencing analysis. Portions of the mitochondrial genome previously believed to be absent were identified within the insert. Sections of the mtDNA are repeated throughout the insert. The cytological data illustrate that DNA contig assembly by using bacterial artificial chromosomes tends to produce a minimal clone path by skipping over duplicated regions, thereby resulting in sequencing errors. We demonstrate that fiber-fluorescence in situ hybridization is a powerful technique to analyze large repetitive regions in the higher eukaryotic genomes and is a valuable complement to ongoing large genome sequencing projects.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

We present an approach for assessing the significance of sequence and structure comparisons by using nearly identical statistical formalisms for both sequence and structure. Doing so involves an all-vs.-all comparison of protein domains [taken here from the Structural Classification of Proteins (scop) database] and then fitting a simple distribution function to the observed scores. By using this distribution, we can attach a statistical significance to each comparison score in the form of a P value, the probability that a better score would occur by chance. As expected, we find that the scores for sequence matching follow an extreme-value distribution. The agreement, moreover, between the P values that we derive from this distribution and those reported by standard programs (e.g., blast and fasta validates our approach. Structure comparison scores also follow an extreme-value distribution when the statistics are expressed in terms of a structural alignment score (essentially the sum of reciprocated distances between aligned atoms minus gap penalties). We find that the traditional metric of structural similarity, the rms deviation in atom positions after fitting aligned atoms, follows a different distribution of scores and does not perform as well as the structural alignment score. Comparison of the sequence and structure statistics for pairs of proteins known to be related distantly shows that structural comparison is able to detect approximately twice as many distant relationships as sequence comparison at the same error rate. The comparison also indicates that there are very few pairs with significant similarity in terms of sequence but not structure whereas many pairs have significant similarity in terms of structure but not sequence.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

This computer simulation is based on a model of the origin of life proposed by H. Kuhn and J. Waser, where the evolution of short molecular strands is assumed to take place in a distinct spatiotemporal structured environment. In their model, the prebiotic situation is strongly simplified to grasp essential features of the evolution of the genetic apparatus without attempts to trace the historic path. With the tool of computer implementation confining to principle aspects and focused on critical features of the model, a deeper understanding of the model's premises is achieved. Each generation consists of three steps: (i) construction of devices (entities exposed to selection) presently available; (ii) selection; and (iii) multiplication of the isolated strands (R oligomers) by complementary copying with occasional variation by copying mismatch. In the beginning, the devices are single strands with random sequences; later, increasingly complex aggregates of strands form devices such as a hairpin-assembler device which develop in favorable cases. A monomers interlink by binding to the hairpin-assembler device, and a translation machinery, called the hairpin-assembler-enzyme device, emerges, which translates the sequence of R1 and R2 monomers in the assembler strand to the sequence of A1 and A2 monomers in the A oligomer, working as an enzyme.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The whole genome sequence (1.83 Mbp) of Haemophilus influenzae strain Rd was searched to identify tandem oligonucleotide repeat sequences. Loss or gain of one or more nucleotide repeats through a recombination-independent slippage mechanism is known to mediate phase variation of surface molecules of pathogenic bacteria, including H. influenzae. This facilitates evasion of host defenses and adaptation to the varying microenvironments of the host. We reasoned that iterative nucleotides could identify novel genes relevant to microbe-host interactions. Our search of the Rd genome sequence identified 9 novel loci with multiple (range 6-36, mean 22) tandem tetranucleotide repeats. All were found to be located within putative open reading frames and included homologues of hemoglobin-binding proteins of Neisseria, a glycosyltransferase (IgtC gene product) of Neisseria, and an adhesin of Yersinia. These tetranucleotide repeat sequences were also shown to be present in two other epidemiologically different H. influenzae type b strains, although the number and distribution of repeats was different. Further characterization of the IgtC gene showed that it was involved in phenotypic switching of a lipopolysaccharide epitope and that this variable expression was associated with changes in the number of tetranucleotide repeats. Mutation of IgtC resulted in attenuated virulence of H. influenzae in an infant rat model of invasive infection. These data indicate the rapidity, economy, and completeness with which whole genome sequences can be used to investigate the biology of pathogenic bacteria.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Aminoacyl-tRNA synthetases (tRNA synthetases) of higher eukaryotes form a multiprotein complex. Sequence elements that are responsible for the protein assembly were searched by using a yeast two-hybrid system. Human cytoplasmic isoleucyl-tRNA synthetase is a component of the multi-tRNA synthetase complex and it contains a unique C-terminal appendix. This part of the protein was used as bait to identify an interacting protein from a HeLa cDNA library. The selected sequence represented the internal 317 amino acids of human bifunctional (glutamyl- and prolyl-) tRNA synthetase, which is also known to be a component of the complex. Both the C-terminal appendix of the isoleucyl-tRNA synthetase and the internal region of bifunctional tRNA synthetase comprise repeating sequence units, two repeats of about 90 amino acids, and three repeats of 57 amino acids, respectively. Each repeated motif of the two proteins was responsible for the interaction, but the stronger interaction was shown by the native structures containing multiple motifs. Interestingly, the N-terminal extension of human glycyl-tRNA synthetase containing a single motif homologous to those in the bifunctional tRNA synthetase also interacted with the C-terminal motif of the isoleucyl-tRNA synthetase although the enzyme is not a component of the complex. The data indicate that the multiplicity of the binding motif in the tRNA synthetases is necessary for enhancing the interaction strength and may be one of the determining factors for the tRNA synthetases to be involved in the formation of the multi-tRNA synthetase complex.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Current evidence on the long-term evolutionary effect of insertion of sequence elements into gene regions is reviewed, restricted to cases where a sequence derived from a past insertion participates in the regulation of expression of a useful gene. Ten such examples in eukaryotes demonstrate that segments of repetitive DNA or mobile elements have been inserted in the past in gene regions, have been preserved, sometimes modified by selection, and now affect control of transcription of the adjacent gene. Included are only examples in which transcription control was modified by the insert. Several cases in which merely transcription initiation occurred in the insert were set aside. Two of the examples involved the long terminal repeats of mammalian endogenous retroviruses. Another two examples were control of transcription by repeated sequence inserts in sea urchin genomes. There are now six published examples in which Alu sequences were inserted long ago into human gene regions, were modified, and now are central in control/enhancement of transcription. The number of published examples of Alu sequences affecting gene control has grown threefold in the last year and is likely to continue growing. Taken together, all of these examples show that the insertion of sequence elements in the genome has been a significant source of regulatory variation in evolution.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Telomeres are specialized structures located at the ends of linear eukaryotic chromosomes that ensure their complete replication and protect them from fusion and degradation. We report here the characterization of the telomeres of the nematode Caenorhabditis elegans. We show that the chromosomes terminate in 4-9 kb of tandem repeats of the sequence TTAGGC. Furthermore, we have isolated clones corresponding to 11 of the 12 C. elegans telomeres. Their subtelomeric sequences are all different from each other, demonstrating that the terminal TTAGGC repeats are sufficient for general chromosomal capping functions. Finally, we demonstrate that the me8 meiotic mutant, which is defective in X chromosome crossing over and segregation, bears a terminal deficiency, that was healed by the addition of telomeric repeats, presumably by the activity of a telomerase enzyme. The 11 cloned telomeres represent an important advance for the completion of the physical map and for the determination of the entire sequence of the C. elegans genome.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Megalin (gp330), an epithelial endocytic receptor, is a major target antigen of Heymann nephritis (HN), an autoimmune disease in rats. To elucidate the mechanisms of HN, we have mapped a pathogenic epitope in megalin that binds anti-megalin antibodies. We focused our attention on four clusters of cysteine-rich, low density lipoprotein receptor (LDLR) ligand binding repeats in the extracellular domain of megalin because they represent putative ligand binding regions and therefore would be expected to be exposed in vivo and to be able to bind circulating antibodies. Rat megalin cDNA fragments I through IV encoding the first through fourth clusters of ligand-binding repeats, respectively, were expressed in a baculovirus system. All four expression products were detected by immunoblotting with two antisera capable of inducing passive HN (pHN). When antibodies eluted from glomeruli of rats with pHN were used for immunoblotting, only the expression product encoded by fragment II was detected. This indicates that the second cluster of LDLR ligand binding repeats is directly involved in binding anti-megalin antibodies and in the induction of pHN. To narrow the major epitope in this domain, fragment II was used to prepare proteins sequentially truncated from the C- and N-terminal ends by in vitro translation. Analysis of the truncated translation products by immunoprecipitation with anti-megalin IgG revealed that the fifth ligand-binding repeat (amino acids 1160-1205) contains the major epitope recognized. This suggests that a 46-amino acid sequence in the second cluster of LDLR ligand binding repeats contains a major pathogenic epitope that plays a key role in pHN. Identification of this epitope will facilitate studies on the pathogenesis of HN.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

A functional methyl-directed mismatch repair pathway in Escherichia coli prevents the formation of deletions between 101-bp tandem repeats with 4% sequence divergence. Deletions between perfectly homologous repeats are unaffected. Deletion in both cases occurs independently of the homologous recombination gene, recA. Because the methyl-directed mismatch repair pathway detects and excises one strand of a mispaired duplex, an intermediate for RecA-independent deletion of tandem repeats must therefore be a heteroduplex formed between strands of each repeat. We find that MutH endonuclease, which in vivo incises specifically the newly replicated strand of DNA, and the Dam methylase, the source of this strand-discrimination, are required absolutely for the exclusion of "homeologous" (imperfectly homologous) tandem deletion. This supports the idea that the heteroduplex intermediate for deletion occurs during or shortly after DNA replication in the context of hemi-methylation. Our findings confirm a "replication slippage" model for deletion formation whereby the displacement and misalignment of the nascent strand relative to the repeated sequence in the template strand accomplishes the deletion.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Several human neurological disorders are associated with proteins containing abnormally long runs of glutamine residues. Strikingly, most of these proteins contain two or more additional long runs of amino acids other than glutamine. We screened the current human, mouse, Drosophila, yeast, and Escherichia coli protein sequence data bases and identified all proteins containing multiple long homopeptides. This search found multiple long homopeptides in about 12% of Drosophila proteins but in only about 1.7% of human, mouse, and yeast proteins and none among E. coli proteins. Most of these sequences show other unusual sequence features, including multiple charge clusters and excessive counts of homopeptides of length > or = two amino acid residues. Intriguingly, a large majority of the identified Drosophila proteins are essential developmental proteins and, in particular, most play a role in central nervous system development. Almost half of the human and mouse proteins identified are homeotic homologs. The role of long homopeptides in fine-tuning protein conformation for multiple functional activities is discussed. The relative contributions of strand slippage and of dynamic mutation are also addressed. Several new experiments are proposed.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Integration of viral DNA into the host nuclear genome, although not unusual in bacterial and animal systems, has surprisingly not been reported for plants. We have discovered geminvirus-related DNA (GRD) sequences, in the form of distinct sets of multiple direct repeats comprising three related repeat classes, situated in a unique locus in the Nicotiana tabacum (tobacco) nuclear genome. The organization of these sequences is similar or identical in eight different tobacco cultivars we have examined. DNA sequence analysis reveals that each repeat has sequences most resembling those of the New World geminiviral DNA replication origin plus the adjacent AL1 gene, encoding the viral replication protein. We believe these GRD sequences originated quite recently in Nicotiana evolution through integration of geminiviral DNA by some combination of the processes of illegitimate recombination, amplification, deletions, and rearrangements. These events must have occurred in plant tissue that was subsequently able to contribute to meristematic tissue yielding gametes. GRD may have been retained in tobacco by selection or by random fixation in a small evolving population. Although we cannot detect transcription of these sequences, this does not exclude the possibility that they may originally have been expressed.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Inverted repeats of DNA are widespread in the genomes of eukaryotes and prokaryotes and can mediate genome rearrangement. We studied rearrangement mediated by plasmid-borne inverted repeats in Escherichia coli. We show that inverted repeats can mediate an efficient and recA-independent recombination event. Surprisingly, the product of this recombination is not that of simple inversion between the inverted repeats, but almost exclusively an unusual head-to-head dimer with complex DNA rearrangement. Moreover, this recombination is dramatically reduced by increasing the distance separating the repeats. These results can be readily explained by a model involving reciprocal switching of the leading and lagging strands of DNA replication within the inverted repeats, which leads to the formation of a Holliday junction. Reciprocal strand switching during DNA replication might be a common mechanism for genome rearrangement associated with inverted duplication.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Long CTG triplet repeats which are associated with several human hereditary neuromuscular disease genes are stabilized in ColE1-derived plasmids in Escherichia coli containing mutations in the methyl-directed mismatch repair genes (mutS, mutL, or mutH). When plasmids containing (CTG)180 were grown for about 100 generations in mutS, mutL, or mutH strains, 60-85% of the plasmids contained a full-length repeat, whereas in the parent strain only about 20% of the plasmids contained the full-length repeat. The deletions occur only in the (CTG)180 insert, not in DNA flanking the repeat. While many products of the deletions are heterogeneous in length, preferential deletion products of about 140, 100, 60, and 20 repeats were observed. We propose that the E. coli mismatch repair proteins recognize three-base loops formed during replication and then generate long single-stranded gaps where stable hairpin structures may form which can be bypassed by DNA polymerase during the resynthesis of duplex DNA. Similar studies were conducted with plasmids containing CGG repeats; no stabilization of these triplets was found in the mismatch repair mutants. Since prokaryotic and human mismatch repair proteins are similar, and since several carcinoma cell lines which are defective in mismatch repair show instability of simple DNA microsatellites, these mechanistic investigations in a bacterial cell may provide insights into the molecular basis for some human genetic diseases.