74 resultados para tandem repeat
em National Center for Biotechnology Information - NCBI
Resumo:
To test a different approach to understanding the relationship between the sequence of part of a protein and its conformation in the overall folded structure, the amino acid sequence corresponding to an α-helix of T4 lysozyme was duplicated in tandem. The presence of such a sequence repeat provides the protein with “choices” during folding. The mutant protein folds with almost wild-type stability, is active, and crystallizes in two different space groups, one isomorphous with wild type and the other with two molecules in the asymmetric unit. The fold of the mutant is essentially the same in all cases, showing that the inserted segment has a well-defined structure. More than half of the inserted residues are themselves helical and extend the helix present in the wild-type protein. Participation of additional duplicated residues in this helix would have required major disruption of the parent structure. The results clearly show that the residues within the duplicated sequence tend to maintain a helical conformation even though the packing interactions with the remainder of the protein are different from those of the original helix. It supports the hypothesis that the structures of individual α-helices are determined predominantly by the nature of the amino acids within the helix, rather than the structural environment provided by the rest of the protein.
Resumo:
A class of tandemly repeated DNA sequences (TR-1) of 350-bp unit length was isolated from the knob DNA of chromosome 9 of Zea mays L. Comparative fluorescence in situ hybridization revealed that TR-1 elements are also present in cytologically detectable knobs on other maize chromosomes in different proportions relative to the previously described 180-bp repeats. At least one knob on chromosome 4 is composed predominantly of the TR-1 repeat. In addition, several small clusters of the TR-1 and 180-bp repeats have been found in different chromosomes, some not located in obvious knob heterochromatin. Variation in restriction fragment fingerprints and copy number of the TR-1 elements was found among maize lines and among maize chromosomes. TR-1 tandem arrays up to 70 kilobases in length can be interspersed with stretches of 180-bp tandem repeat arrays. DNA sequence analysis and restriction mapping of one particular stretch of tandemly arranged TR-1 units indicate that these elements may be organized in the form of fold-back DNA segments. The TR-1 repeat shares two short segments of homology with the 180-bp repeat. The longest of these segments (31 bp; 64% identity) corresponds to the conserved region among 180-bp repeats. The polymorphism and complex structure of knob DNA suggest that, similar to the fold-back DNA-containing giant transposons in Drosophila, maize knob DNA may have some properties of transposable elements.
Resumo:
The National Institute of Standards and Technology (NIST) has compiled and maintained a Short Tandem Repeat DNA Internet Database (http://www.cstl.nist.gov/biotech/strbase/) since 1997 commonly referred to as STRBase. This database is an information resource for the forensic DNA typing community with details on commonly used short tandem repeat (STR) DNA markers. STRBase consolidates and organizes the abundant literature on this subject to facilitate on-going efforts in DNA typing. Observed alleles and annotated sequence for each STR locus are described along with a review of STR analysis technologies. Additionally, commercially available STR multiplex kits are described, published polymerase chain reaction (PCR) primer sequences are reported, and validation studies conducted by a number of forensic laboratories are listed. To supplement the technical information, addresses for scientists and hyperlinks to organizations working in this area are available, along with the comprehensive reference list of over 1300 publications on STRs used for DNA typing purposes.
Resumo:
Cross-contamination between cell lines is a longstanding and frequent cause of scientific misrepresentation. Estimates from national testing services indicate that up to 36% of cell lines are of a different origin or species to that claimed. To test a standard method of cell line authentication, 253 human cell lines from banks and research institutes worldwide were analyzed by short tandem repeat profiling. The short tandem repeat profile is a simple numerical code that is reproducible between laboratories, is inexpensive, and can provide an international reference standard for every cell line. If DNA profiling of cell lines is accepted and demanded internationally, scientific misrepresentation because of cross-contamination can be largely eliminated.
Resumo:
Group B streptococci (GBS) are the most common cause of neonatal sepsis, pneumonia, and meningitis. The alpha C protein is a surface-associated antigen; the gene (bca) for this protein contains a series of tandem repeats (each encoding 82 aa) that are identical at the nucleotide level and express a protective epitope. We previously reported that GBS isolates from two of 14 human maternal and neonatal pairs differed in the number of repeats contained in their alpha C protein; in both pairs, the alpha C protein of the neonatal isolate was smaller in molecular size. We now demonstrate by PCR that the neonatal isolates contain fewer tandem repeats. Maternal isolates were susceptible to opsonophagocytic killing in the presence of alpha C protein-specific antiserum, whereas the discrepant neonatal isolates proliferated. An animal model was developed to further study this phenomenon. Adult mice passively immunized with antiserum to the alpha C protein were challenged with an alpha C protein-expressing strain of GBS. Splenic isolates of GBS from these mice showed a high frequency of mutation in bca--most commonly a decrease in repeat number. Isolates from non-immune mice were not altered. Spontaneous deletions in the repeat region were observed at a much lower frequency (6 x 10(-4)); thus, deletions in that region are selected for under specific antibody pressure and appear to lower the organism's susceptibility to killing by antibody specific to the alpha C protein. This mechanism of antigenic variation may provide a means whereby GBS evade host immunity.
Resumo:
The mouse insulin-like growth factor 2 (Igf2) locus is a complex genomic region that produces multiple transcripts from alternative promoters. Expression at this locus is regulated by parental imprinting. However, despite the existence of putative imprinting control elements in the Igf2 upstream region, imprinted transcriptional repression is abolished by null mutations at the linked H19 locus. To clarify the extent to which the Igf2 upstream region contains autonomous imprinting control elements we have performed functional and comparative analyses of the region in the mouse and human. Here we report the existence of multiple, overlapping imprinted (maternally repressed) sense and antisense transcripts that are associated with a tandem repeat in the mouse Igf2 upstream region. Regions flanking the repeat exhibit tissue-specific parental allelic methylation patterns, suggesting the existence of tissue-specific control elements in the upstream region. Studies in H19 null mice indicate that both parental allelic methylation and monoallelic expression of the upstream transcripts depends on an intact H19 gene acting in cis. The homologous region in human IGF2 is structurally conserved, with the significant exception that it does not contain a tandem repeat. Our results support the proposal that tandem repeats act to target methylation to imprinted genetic loci.
Resumo:
The human genome contains many repeated DNA sequences that vary in complexity of repeating unit from a single nucleotide to a whole gene. The repeat sequences can be widely dispersed or in simple tandem arrays. Arrays of up to 5 or 6 nt are known as simple tandem repeats, and these are widely dispersed and highly polymorphic. Members of one group of the simple tandem repeats, the trinucleotide repeats, can undergo an increase in copy number by a process of dynamic mutation. Dynamic mutations of the CCG trinucleotide give rise to one group of fragile sites on human chromosomes, the rare folate-sensitive group. One member of this group, the fragile X (FRAXA) is responsible for the most common familial form of mental retardation. Another member of the group FRAXE is responsible for a rarer mild form of mental retardation. Similar mutations of AGC repeats give rise to a number of neurological disorders. The expanded repeats are unstable between generations and somatically. The intergenerational instability gives rise to unusual patterns of inheritance--particularly anticipation, the increasing severity and/or earlier age of onset of the disorder in successive generations. Dynamic mutations have been found only in the human species, and possible reasons for this are considered. The mechanism of dynamic mutation is discussed, and a number of observations of simple tandem repeat mutation that could assist in understanding this phenomenon are commented on.
Resumo:
Although integration of viral DNA into host chromosomes occurs regularly in bacteria and animals, there are few reported cases in plants, and these involve insertion at only one or a few sites. Here, we report that pararetrovirus-like sequences have integrated repeatedly into tobacco chromosomes, attaining a copy number of ≈103. Insertion apparently occurred by illegitimate recombination. From the sequences of 22 independent insertions recovered from a healthy plant, an 8-kilobase genome encoding a previously uncharacterized pararetrovirus that does not contain an integrase function could be assembled. Preferred boundaries of the viral inserts may correspond to recombinogenic gaps in open circular viral DNA. An unusual feature of the integrated viral sequences is a variable tandem repeat cluster, which might reflect defective genomes that preferentially recombine into plant DNA. The recurrent invasion of pararetroviral DNA into tobacco chromosomes demonstrates that viral sequences can contribute significantly to plant genome evolution.
Resumo:
The syndecans are transmembrane proteoglycans that place structurally heterogeneous heparan sulfate chains at the cell surface and a highly conserved polypeptide in the cytoplasm. Their versatile heparan sulfate moieties support various processes of molecular recognition, signaling, and trafficking. Here we report the identification of a protein that binds to the cytoplasmic domains of the syndecans in yeast two-hybrid screens, surface plasmon resonance experiments, and ligand-overlay assays. This protein, syntenin, contains a tandem repeat of PDZ domains that reacts with the FYA C-terminal amino acid sequence of the syndecans. Recombinant enhanced green fluorescent protein (eGFP)–syntenin fusion proteins decorate the plasmamembrane and intracellular vesicles, where they colocalize and cosegregate with syndecans. Cells that overexpress eGFP–syntenin show numerous cell surface extensions, suggesting effects of syntenin on cytoskeleton–membrane organization. We propose that syntenin may function as an adaptor that couples syndecans to cytoskeletal proteins or cytosolic downstream signal-effectors.
Somatic mosaicism in Wiskott–Aldrich syndrome suggests in vivo reversion by a DNA slippage mechanism
Resumo:
Somatic mosaicism caused by in vivo reversion of inherited mutations has been described in several human genetic disorders. Back mutations resulting in restoration of wild-type sequences and second-site mutations leading to compensatory changes have been shown in mosaic individuals. In most cases, however, the precise genetic mechanisms underlying the reversion events have remained unclear, except for the few instances where crossing over or gene conversion have been demonstrated. Here, we report a patient affected with Wiskott–Aldrich syndrome (WAS) caused by a 6-bp insertion (ACGAGG) in the WAS protein gene, which abrogates protein expression. Somatic mosaicism was documented in this patient whose majority of T lymphocytes expressed nearly normal levels of WAS protein. These lymphocytes were found to lack the deleterious mutation and showed a selective growth advantage in vivo. Analysis of the sequence surrounding the mutation site showed that the 6-bp insertion followed a tandem repeat of the same six nucleotides. These findings strongly suggest that DNA polymerase slippage was the cause of the original germ-line insertion mutation in this family and that the same mechanism was responsible for its deletion in one of the propositus T cell progenitors, thus leading to reversion mosaicism.
Resumo:
Microsatellites are tandem repeat sequences abundant in the genomes of higher eukaryotes and hitherto considered as "junk DNA." Analysis of a human genome representative data base (2.84 Mb) reveals a distinct juxtaposition of A-rich microsatellites and retroposons and suggests their coevolution. The analysis implies that most microsatellites were generated by a 3'-extension of retrotranscripts, similar to mRNA polyadenylylation, and that they serve in turn as "retroposition navigators," directing the retroposons via homology-driven integration into defined sites. Thus, they became instrumental in the preservation and extension of primordial genomic patterns. A role is assigned to these reiterating A-rich loci in the higher-order organization of the chromatin. The disease-associated triplet repeats are mostly found in coding regions and do not show an association with retroposons, constituting a unique set within the family of microsatellite sequences.
Resumo:
We have generated a physical map of human chromosome bands 20q11.2-20q13.1, a region containing a gene involved in the development of one form of early-onset, non-insulin-dependent diabetes mellitus, MODY1, as well as a putative myeloid tumor suppressor gene. The yeast artificial chromosome contig consists of 71 clones onto which 71 markers, including 20 genes, 5 expressed sequence tags, 32 simple tandem repeat DNA polymorphisms, and 14 sequence-tagged sites have been ordered. This region spans about 18 Mb, which represents about 40% of the physical length of 20q. Using this physical map, we have refined the location of MODY1 to a 13-centimorgan interval (approximately equal to 7 Mb) between D20S169 and D20S176. The myeloid tumor suppressor gene was localized to an 18-centimorgan interval (approximately equal to 13 Mb) between RPN2 and D20S17. This physical map will facilitate the isolation of MODY1 and the myeloid tumor suppressor gene.
Resumo:
Aberrant glycosylation of the mucin molecule (encoded by the gene MUC-1) on human epithelial cell tumors leads to the exposure of tumor-associated epitopes recognized by patients' antibodies and cytotoxic T cells. Consequently, these epitopes could be considered targets for immunotherapy. We designed a cellular vaccine, employing, instead of tumor cells, autologous Epstein-Barr virus (EBV)-immortalized B cells as carriers of tumor-associated mucin, to take advantage of their costimulatory molecules for T-cell activation. The vaccine was tested in chimpanzees because of the identity of the human and chimpanzee MUC-1 tandem repeat sequence. EBV-immortalized B cells derived from two chimpanzees were transfected with MUC-1 cDNA, treated with glycosylation inhibitor phenyl-N-acetyl-alpha-D-galactosaminide to expose tumor-associated epitopes, irradiated, and injected subcutaneously four times at 3-week intervals. One vaccine preparation also contained cells transduced with the interleukin 2 (IL-2) cDNA and producing low levels of IL-2. Already after the first injection we found in the peripheral blood measurable frequency of cytotoxic T-cell precursors specific for underglycosylated mucin. The highest frequency observed was after the last boost, in the lymph node draining the vaccination site. Delayed-type hypersensitivity reaction to the injected immunogens was also induced, whereas no appearance of mucin-specific antibodies was seen. Long-term observation of the animals yielded no signs of adverse effects of this immunization. Autologous antigen-presenting cells, like EBV-immortalized B cells, expressing tumor-associated antigens are potentially useful immunogens for induction of cellular anti-tumor responses in vivo.
Resumo:
Li and Chakravarti [Li, C.C. & Chakravarti, A. (1994) Hum. Hered. 44, 100-109] compared the probability (MO) of a random match between the two DNA profiles of a pair of individuals drawn from a random-mating population to the probability (MF) of the match between a pair of random individuals drawn from a subdivided population. The level of heterogeneity in this subdivided population is measured by the parameter F, where there is no subdivision when F = 0 and increasing values of F indicate increasing subdivisions. Li and Chakravarti concluded that it is conservative to use the match probability MO, which is derived under the assumption that the two individuals are drawn from a homogeneous random-mating population without subdivision. However, MO may not be always greater than MF, even for biologically reasonable values of F. We explore here those mathematical conditions under which MO is less than MF, and we find that MO is not conservative mainly when there is an allele with a much higher frequency than all the other alleles. When empirical data for both variable number of tandem repeat (VNTR) and short tandem repeat (STR) systems are evaluated, we find that in the majority of cases MO represents a conservative probability of a match, and so the subdivision of human populations may usually be ignored for a random match, although not, of course, for relatives. Loci for which MO is not conservative should be avoided for forensic inference.
Resumo:
By using an expression cloning strategy, we isolated a single positive clone encoding a tilapia prolactin (PRL) receptor. Tilapia PRL188 was used to screen a freshwater tilapia kidney expression library transfected in COS cells. The tilapia PRL receptor is a mature protein of 606 amino acids. The extracellular domain is devoid of the tandem repeat units present in birds and has two pairs of cysteine residues, a Trp-Ser-Xaa-Trp-Ser motif, and two potential N-glycosylation sites. The cytoplasmic domain contains 372 amino acids, including box 1, a sequence previously shown to be important for signal transduction in mammalian species. Thus, the general structure is similar to the long form of mammalian PRL receptors; however, amino acid comparisons reveal a rather low identity (approximately 37%). Northern blot analysis shows the existence of a single transcript in osmoregulatory tissues and reproductive organs. This localization is in agreement with known functions of PRL in teleosts.