981 resultados para Tandem Repeats
Resumo:
We sequenced across all of the gene boundaries in the mitochondrial genome of the cattle tick, Boophilus microplus, to determine the arrangement of its genes. The mtDNA of B. microplus has a coding region, composed of tRNA(Glu) and 60 bp of the 3' end of ND1, that is repeated five times. Boophilus microplus is the first coelomate animal known to have more than two copies of a coding sequence. The mitochondrial genome of B, microplus has other unusual features, including (1) reduced T arms in tRNAs, (2) an AT bias in codon use, (3) two control regions that have evolved in concert, (4) three gene rearrangements, and (5) a stem-loop between tRNA(Gln) and tRNA(Phe). The short T arms and small control regions (CRs) of B. microplus and other ticks suggest strong selection for small genomes. Imprecise termination of replication beyond its origin, which can account for the evolution of tandem repeats of coding regions in other mitochondrial genomes, cannot explain the evolution of the fivefold repeated sequence in the mitochondrial genome of B. microplus. Instead, slipped-strand mispairing or recombination are the most plausible explanations for the evolution of these tandem repeats.
Resumo:
Microtubule-associated protein 2 (MAP2) exists in both high- and low-molecular mass isoforms, each of which has a tubulin-binding domain consisting of 3 imperfect tandem repeats of 31 amino acids containing a more highly conserved 18 amino acid 'core' sequence. We describe here a novel form of low molecular mass MAP2 (MAP2c) that contains an additional 4th repeat of this tubulin-binding motif. Like the 3 previously known repeat sequences, this 4th copy is highly conserved between MAP2 and the two other known members of the same gene family, tau and MAP4. In each of these three genes the additional 4th repeat is inserted between the 1st and 2nd repeats of the 3-repeat form of the molecule. Experiments with brain cell cultures, in which the relative proportions of neurons and glia had been manipulated by drug treatment, showed that 4-repeat MAP2c is associated with glial cells whereas 3-repeat MAP2c is expressed in neurons. Whereas 3-repeat MAP2c is expressed early in development and then declines, the level of 4-repeat MAP2c increases later in development, corresponding to the relatively late differentiation of glial cells compared to neurons. When transfected into non-neuronal cells, the 4-repeat version of MAP2c behaved indistinguishably from the 3-repeat form in stabilising and rearranging cellular microtubules. The presence of an additional 4th repeat of the tubulin-binding motif in all three members of the MAP2 gene family suggests that this variant arose prior to their differentiation from an ancestral gene.
Resumo:
We performed spoligotyping and 12-mycobacterial interspersed repetitive unit-variable number tandem repeats (MIRU-VNTRs) typing to characterise Mycobacterium bovis isolates collected from tissue samples of bovines with lesions suggestive for tuberculosis during slaughter inspection procedures in abattoirs in Brazil. High-quality genotypes were obtained with both procedures for 61 isolates that were obtained from 185 bovine tissue samples and all of these isolates were identified as M. bovis by conventional identification procedures. On the basis of the spoligotyping, 53 isolates were grouped into nine clusters and the remaining eight isolates were unique types, resulting in 17 spoligotypes. The majority of the Brazilian M. bovis isolates displayed spoligotype patterns that have been previously observed in strains isolated from cattle in other countries. MIRU-VNTR typing produced 16 distinct genotypes, with 53 isolates forming eight of the groups, and individual isolates with unique VNTR profiles forming the remaining eight groups. The allelic diversity of each VNTR locus was calculated and only two of the 12-MIRU-VNTR loci presented scores with either a moderate (0.4, MIRU16) or high (0.6, MIRU26) discriminatory index (h). Both typing methods produced similar discriminatory indexes (spoligotyping h = 0.85; MIRU-VNTR h = 0.86) and the combination of the two methods increased the h value to 0.94, resulting in 29 distinct patterns. These results confirm that spoligotyping and VNTR analysis are valuable tools for studying the molecular epidemiology of M. bovis infections in Brazil.
Resumo:
P-selectin glycoprotein ligand-1 (PSGL-1) interacts with selectins to support leukocyte rolling along vascular wall. L- and P-selectin bind to N-terminal tyrosine sulfate residues and to core-2 O-glycans attached to Thr-57, whereas tyrosine sulfation is not required for E-selectin binding. PSGL-1 extracellular domain contains decameric repeats, which extend L- and P-selectin binding sites far above the plasma membrane. We hypothesized that decamers may play a role in regulating PSGL-1 interactions with selectins. Chinese hamster ovary cells expressing wild-type PSGL-1 or PSGL-1 molecules exhibiting deletion or substitution of decamers with the tandem repeats of platelet glycoprotein Ibalpha were compared in their ability to roll on selectins and to bind soluble L- or P-selectin. Deletion of decamers abrogated soluble L-selectin binding and cell rolling on L-selectin, whereas their substitution partially reversed these diminutions. P-selectin-dependent interactions with PSGL-1 were less affected by decamer deletion. Videomicroscopy analysis showed that decamers are required to stabilize L-selectin-dependent rolling. Importantly, adhesion assays performed on recombinant decamers demonstrated that they directly bind to E-selectin and promote slow rolling. Our results indicate that the role of decamers is to extend PSGL-1 N terminus far above the cell surface to support and stabilize leukocyte rolling on L- or P-selectin. In addition, they function as a cell adhesion receptor, which supports approximately 80% of E-selectin-dependent rolling.
Resumo:
Association studies have revealed expression quantitative trait loci (eQTLs) for a large number of genes. However, the causative variants that regulate gene expression levels are generally unknown. We hypothesized that copy-number variation of sequence repeats contribute to the expression variation of some genes. Our laboratory has previously identified that the rare expansion of a repeat c.-174CGGGGCGGGGCG in the promoter region of the CSTB gene causes a silencing of the gene, resulting in progressive myoclonus epilepsy. Here, we genotyped the repeat length and quantified CSTB expression by quantitative real-time polymerase chain reaction in 173 lymphoblastoid cell lines (LCLs) and fibroblast samples from the GenCord collection. The majority of alleles contain either two or three copies of this repeat. Independent analysis revealed that the c.-174CGGGGCGGGGCG repeat length is strongly associated with CSTB expression (P = 3.14 × 10(-11)) in LCLs only. Examination of both genotyped and imputed single-nucleotide polymorphisms (SNPs) within 2 Mb of CSTB revealed that the dodecamer repeat represents the strongest cis-eQTL for CSTB in LCLs. We conclude that the common two or three copy variation is likely the causative cis-eQTL for CSTB expression variation. More broadly, we propose that polymorphic tandem repeats may represent the causative variation of a fraction of cis-eQTLs in the genome.
Resumo:
We describe an improved multiple-locus variable-number tandem-repeat (VNTR) analysis (MLVA) scheme for genotyping Staphylococcus aureus. We compare its performance to those of multilocus sequence typing (MLST) and spa typing in a survey of 309 strains. This collection includes 87 epidemic methicillin-resistant S. aureus (MRSA) strains of the Harmony collection, 75 clinical strains representing the major MLST clonal complexes (CCs) (50 methicillin-sensitive S. aureus [MSSA] and 25 MRSA), 135 nasal carriage strains (133 MSSA and 2 MRSA), and 13 published S. aureus genome sequences. The results show excellent concordance between the techniques' results and demonstrate that the discriminatory power of MLVA is higher than those of both MLST and spa typing. Two hundred forty-two genotypes are discriminated with 14 VNTR loci (diversity index, 0.9965; 95% confidence interval, 0.9947 to 0.9984). Using a cutoff value of 45%, 21 clusters are observed, corresponding to the CCs previously defined by MLST. The variability of the different tandem repeats allows epidemiological studies, as well as follow-up of the evolution of CCs and the identification of potential ancestors. The 14 loci can conveniently be analyzed in two steps, based upon a first-line simplified assay comprising a subset of 10 loci (panel 1) and a second subset of 4 loci (panel 2) that provides higher resolution when needed. In conclusion, the MLVA scheme proposed here, in combination with available on-line genotyping databases (including http://mlva.u-psud.fr/), multiplexing, and automatic sizing, can provide a basis for almost-real-time large-scale population monitoring of S. aureus.
Resumo:
Amino acid tandem repeats, also called homopolymeric tracts, are extremely abundant in eukaryotic proteins. To gain insight into the genome-wide evolution of these regions in mammals, we analyzed the repeat content in a large data set of rat-mouse-human orthologs. Our results show that human proteins contain more amino acid repeats than rodent proteins and that trinucleotide repeats are also more abundant in human coding sequences. Using the human species as an outgroup, we were able to address differences in repeat loss and repeat gain in the rat and mouse lineages. In this data set, mouse proteins contain substantially more repeats than rat proteins, which can be at least partly attributed to a higher repeat loss in the rat lineage. The data are consistent with a role for trinucleotide slippage in the generation of novel amino acid repeats. We confirm the previously observed functional bias of proteins with repeats, with overrepresentation of transcription factors and DNA-binding proteins. We show that genes encoding amino acid repeats tend to have an unusually high GC content, and that differences in coding GC content among orthologs are directly related to the presence/absence of repeats. We propose that the different GC content isochore structure in rodents and humans may result in an increased amino acid repeat prevalence in the human lineage.
Resumo:
Background: Multi-drug resistance and severe/ complicated cases are the emerging phenotypes of vivax malaria, which may deteriorate current anti-malarial control measures. The emergence of these phenotypes could be associated with either of the two Plasmodium vivax lineages. The two lineages had been categorized as Old World and New World, based on geographical sub-division and genetic and phenotypical markers. This study revisited the lineage hypothesis of P. vivax by typing the distribution of lineages among global isolates and evaluated their genetic relatedness using a panel of new mini-satellite markers. Methods: 18S SSU rRNA S-type gene was amplified from 420 Plasmodium vivax field isolates collected from different geographical regions of India, Thailand and Colombia as well as four strains each of P. vivax originating from Nicaragua, Panama, Thailand (Pak Chang), and Vietnam (ONG). A mini-satellite marker panel was then developed to understand the population genetic parameters and tested on a sample subset of both lineages. Results: 18S SSU rRNA S-type gene typing revealed the distribution of both lineages (Old World and New World) in all geographical regions. However, distribution of Plasmodium vivax lineages was highly variable in every geographical region. The lack of geographical sub-division between lineages suggests that both lineages are globally distributed. Ten mini-satellites were scanned from the P. vivax genome sequence; these tandem repeats were located in eight of the chromosomes. Mini-satellites revealed substantial allelic diversity (7-21, AE = 14.6 +/- 2.0) and heterozygosity (He = 0.697-0.924, AE = 0.857 +/- 0.033) per locus. Mini-satellite comparison between the two lineages revealed high but similar pattern of genetic diversity, allele frequency, and high degree of allele sharing. A Neighbour-Joining phylogenetic tree derived from genetic distance data obtained from ten mini-satellites also placed both lineages together in every cluster. Conclusions: The global lineage distribution, lack of genetic distance, similar pattern of genetic diversity, and allele sharing strongly suggested that both lineages are a single species and thus new emerging phenotypes associated with vivax malaria could not be clearly classified as belonging to a particular lineage on basis of their geographical origin.
Resumo:
Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)
Resumo:
Using computer programs developed for this purpose, we searched for various repeated sequences including inverted, direct tandem, and homopurine–homopyrimidine mirror repeats in various prokaryotes, eukaryotes, and an archaebacterium. Comparison of observed frequencies with expectations revealed that in bacterial genomes and organelles the frequency of different repeats is either random or enriched for inverted and/or direct tandem repeats. By contrast, in all eukaryotic genomes studied, we observed an overrepresentation of all repeats, especially homopurine–homopyrimidine mirror repeats. Analysis of the genomic distribution of all abundant repeats showed that they are virtually excluded from coding sequences. Unexpectedly, the frequencies of abundant repeats normalized for their expectations were almost perfect exponential functions of their size, and for a given repeat this function was indistinguishable between different genomes.
Resumo:
Recently, we established that satellite III (TGGAA)n tandem repeats, which occur at the centromeres of human chromosomes, pair with themselves to form an unusual "self-complementary" antiparallel duplex containing (GGA)2 motifs in which two unpaired guanines from opposite strands intercalate between sheared G.A base pairs. In separate studies, we have also established that the GCA triplet does not form bimolecular (GCA)2 motifs but instead promotes the formation of hairpins containing a GCA-turn motif in which the loop contains a single cytidine closed by a sheared G.A pair. Since TGCAA is the most frequent variant of TGGAA found in satellite III repeats, we reasoned that the potential of this variant to form GCA-turn miniloop fold-back structures might be an important factor in modulating the local structure in natural (TGGAA)n repeats. We report here the NMR-derived solution structure of the heptadecadeoxynucleotide (G)TGGAATGCAATGGAA(C) in which a central TGCAA pentamer is flanked by two TGGAA pentamers. This 17-mer forms a rather unusual and very stable hairpin structure containing eight base pairs in the stem, only four of which are Watson-Crick pairs, and a loop consisting of a single cytidine residue. The stem contains a (GGA)2 motif with intercalative 14G/4G stacking between two sheared G.A base pairs; the loop end of the stem consists of a sheared 8G.10A closing pair with the cytosine base of the 9C loop stacked on 8G. The remarkable stability of this unusual hairpin structure (Tm = 63 degrees C) suggests that it probably plays an important role in modulating the folding of satellite III (TGGAA)n repeats at the centromere.
Resumo:
Telomeres are specialized structures located at the ends of linear eukaryotic chromosomes that ensure their complete replication and protect them from fusion and degradation. We report here the characterization of the telomeres of the nematode Caenorhabditis elegans. We show that the chromosomes terminate in 4-9 kb of tandem repeats of the sequence TTAGGC. Furthermore, we have isolated clones corresponding to 11 of the 12 C. elegans telomeres. Their subtelomeric sequences are all different from each other, demonstrating that the terminal TTAGGC repeats are sufficient for general chromosomal capping functions. Finally, we demonstrate that the me8 meiotic mutant, which is defective in X chromosome crossing over and segregation, bears a terminal deficiency, that was healed by the addition of telomeric repeats, presumably by the activity of a telomerase enzyme. The 11 cloned telomeres represent an important advance for the completion of the physical map and for the determination of the entire sequence of the C. elegans genome.
Resumo:
Group B streptococci (GBS) are the most common cause of neonatal sepsis, pneumonia, and meningitis. The alpha C protein is a surface-associated antigen; the gene (bca) for this protein contains a series of tandem repeats (each encoding 82 aa) that are identical at the nucleotide level and express a protective epitope. We previously reported that GBS isolates from two of 14 human maternal and neonatal pairs differed in the number of repeats contained in their alpha C protein; in both pairs, the alpha C protein of the neonatal isolate was smaller in molecular size. We now demonstrate by PCR that the neonatal isolates contain fewer tandem repeats. Maternal isolates were susceptible to opsonophagocytic killing in the presence of alpha C protein-specific antiserum, whereas the discrepant neonatal isolates proliferated. An animal model was developed to further study this phenomenon. Adult mice passively immunized with antiserum to the alpha C protein were challenged with an alpha C protein-expressing strain of GBS. Splenic isolates of GBS from these mice showed a high frequency of mutation in bca--most commonly a decrease in repeat number. Isolates from non-immune mice were not altered. Spontaneous deletions in the repeat region were observed at a much lower frequency (6 x 10(-4)); thus, deletions in that region are selected for under specific antibody pressure and appear to lower the organism's susceptibility to killing by antibody specific to the alpha C protein. This mechanism of antigenic variation may provide a means whereby GBS evade host immunity.
Resumo:
Cannabis sativa is the most frequently used of all illicit drugs in the United States. Cannabis has been used throughout history for its stems in the production of hemp fiber, for its seed for oil and food, and for its buds and leaves as a psychoactive drug. Short tandem repeats (STRs), were chosen as molecular markers because of their distinct advantages over other genetic methods. STRs are co-dominant, can be standardized such that reproducibility between laboratories can be easily achieved, have a high discrimination power and can be multiplexed. ^ In this study, six STR markers previously described for Cannabis were multiplexed into one reaction. The multiplex reaction was able to individualize 98 Cannabis samples (14 hemp and 84 marijuana, authenticated as originating from 33 of the 50 United States) and detect 29 alleles averaging 4.8 alleles per loci. The data did not relate the samples from the same state to each other. This is the first study to report a single reaction six-plex and apply it to the analysis of almost 100 Cannabis samples of known geographic collection site. ^
Resumo:
The genomes of many strains of baker’s yeast, Saccharomyces cerevisiae, contain multiple repeats of the copper-binding protein Cup1. Cup1 is a member of the metallothionein family, and is found in a tandem array on chromosome VIII. In this thesis, I describe studies that characterized these tandem arrays and their mechanism of formation across diverse strains of yeast. I show that CUP1 arrays are an illuminating model system for observing recombination in eukaryotes, and describe insights derived from these observations.
In our first study, we analyzed 101 natural isolates of S. cerevisiae in order to examine the diversity of CUP1-containing repeats across different strains. We identified five distinct classes of repeats that contain CUP1. We also showed that some strains have only a single copy of CUP1. By comparing the sequences of all the strains, we were able to elucidate the mechanism of formation of the CUP1 tandem arrays, which involved unequal non-homologous recombination events starting from a strain that had only a single CUP1 gene. Our observation of CUP1 repeat formation allows more general insights about the formation of tandem repeats from single-copy genes in eukaryotes, which is one of the most important mechanisms by which organisms evolve.
In our second study, we delved deeper into our mechanistic investigations by measuring the relative rates of inter-homolog and intra-/inter-sister chromatid recombination in CUP1 tandem arrays. We used a diploid strain that is heterozygous both for insertion of a selectable marker (URA3) inside the tandem array, and also for markers at either end of the array. The intra-/inter-sister chromatid recombination rate turned out to be more than ten-fold greater than the inter-homolog rate. Moreover, we found that loss of the proteins Rad51 and Rad52, which are required for most inter-homolog recombination, did not greatly reduce recombination in the CUP1 tandem repeats. Additionally, we investigated the effects of elevated copper levels on the rate of each type of recombination at the CUP1 locus. Both types of recombination are increased at high concentrations of copper (as is known to be the case for CUP1 transcription). Furthermore, the inter-homolog recombination rate at the CUP1 locus is higher than the average over the genome during mitosis, but is lower than the average during meiosis.
The research described in Chapter 2 is published in 2014.