980 resultados para Short Tandem Repeats
Resumo:
Using computer programs developed for this purpose, we searched for various repeated sequences including inverted, direct tandem, and homopurine–homopyrimidine mirror repeats in various prokaryotes, eukaryotes, and an archaebacterium. Comparison of observed frequencies with expectations revealed that in bacterial genomes and organelles the frequency of different repeats is either random or enriched for inverted and/or direct tandem repeats. By contrast, in all eukaryotic genomes studied, we observed an overrepresentation of all repeats, especially homopurine–homopyrimidine mirror repeats. Analysis of the genomic distribution of all abundant repeats showed that they are virtually excluded from coding sequences. Unexpectedly, the frequencies of abundant repeats normalized for their expectations were almost perfect exponential functions of their size, and for a given repeat this function was indistinguishable between different genomes.
Resumo:
Recently, we established that satellite III (TGGAA)n tandem repeats, which occur at the centromeres of human chromosomes, pair with themselves to form an unusual "self-complementary" antiparallel duplex containing (GGA)2 motifs in which two unpaired guanines from opposite strands intercalate between sheared G.A base pairs. In separate studies, we have also established that the GCA triplet does not form bimolecular (GCA)2 motifs but instead promotes the formation of hairpins containing a GCA-turn motif in which the loop contains a single cytidine closed by a sheared G.A pair. Since TGCAA is the most frequent variant of TGGAA found in satellite III repeats, we reasoned that the potential of this variant to form GCA-turn miniloop fold-back structures might be an important factor in modulating the local structure in natural (TGGAA)n repeats. We report here the NMR-derived solution structure of the heptadecadeoxynucleotide (G)TGGAATGCAATGGAA(C) in which a central TGCAA pentamer is flanked by two TGGAA pentamers. This 17-mer forms a rather unusual and very stable hairpin structure containing eight base pairs in the stem, only four of which are Watson-Crick pairs, and a loop consisting of a single cytidine residue. The stem contains a (GGA)2 motif with intercalative 14G/4G stacking between two sheared G.A base pairs; the loop end of the stem consists of a sheared 8G.10A closing pair with the cytosine base of the 9C loop stacked on 8G. The remarkable stability of this unusual hairpin structure (Tm = 63 degrees C) suggests that it probably plays an important role in modulating the folding of satellite III (TGGAA)n repeats at the centromere.
Resumo:
Telomeres are specialized structures located at the ends of linear eukaryotic chromosomes that ensure their complete replication and protect them from fusion and degradation. We report here the characterization of the telomeres of the nematode Caenorhabditis elegans. We show that the chromosomes terminate in 4-9 kb of tandem repeats of the sequence TTAGGC. Furthermore, we have isolated clones corresponding to 11 of the 12 C. elegans telomeres. Their subtelomeric sequences are all different from each other, demonstrating that the terminal TTAGGC repeats are sufficient for general chromosomal capping functions. Finally, we demonstrate that the me8 meiotic mutant, which is defective in X chromosome crossing over and segregation, bears a terminal deficiency, that was healed by the addition of telomeric repeats, presumably by the activity of a telomerase enzyme. The 11 cloned telomeres represent an important advance for the completion of the physical map and for the determination of the entire sequence of the C. elegans genome.
Resumo:
Group B streptococci (GBS) are the most common cause of neonatal sepsis, pneumonia, and meningitis. The alpha C protein is a surface-associated antigen; the gene (bca) for this protein contains a series of tandem repeats (each encoding 82 aa) that are identical at the nucleotide level and express a protective epitope. We previously reported that GBS isolates from two of 14 human maternal and neonatal pairs differed in the number of repeats contained in their alpha C protein; in both pairs, the alpha C protein of the neonatal isolate was smaller in molecular size. We now demonstrate by PCR that the neonatal isolates contain fewer tandem repeats. Maternal isolates were susceptible to opsonophagocytic killing in the presence of alpha C protein-specific antiserum, whereas the discrepant neonatal isolates proliferated. An animal model was developed to further study this phenomenon. Adult mice passively immunized with antiserum to the alpha C protein were challenged with an alpha C protein-expressing strain of GBS. Splenic isolates of GBS from these mice showed a high frequency of mutation in bca--most commonly a decrease in repeat number. Isolates from non-immune mice were not altered. Spontaneous deletions in the repeat region were observed at a much lower frequency (6 x 10(-4)); thus, deletions in that region are selected for under specific antibody pressure and appear to lower the organism's susceptibility to killing by antibody specific to the alpha C protein. This mechanism of antigenic variation may provide a means whereby GBS evade host immunity.
Resumo:
An analysis of the historic H1 subtype, H1-1, in eight legumes belonging to four genera of the tribe Vicieae (Pisum, Lathyrus, Lens, and Vicia), revealed an extended region consisting of the tandemly repeated AKPAAK motifs. We named this region the Regular zone (RZ). The AKPAAK motifs are organized into two blocks separated by a short (two or six amino acids) intervening sequence (IS). The distal block contains six AKPAAK motifs, while the number of repeats in the proximal block varies from six in V. faba to seven in the other species. In V. hirsuta, the first two repeated units of the proximal block are octapeptides AKAKPAAK. The apparent rate of synonymous substitutions in the blocks of RZ is much higher than in the rest of the gene. This can be explained by repeat shuffling within each block. In the C-domain of the orthologous H1 subtype froth Medicago truncatula (tribe Trifolieae), a region corresponding to the RZ of Vicieae species was found. It also consists of two blocks of AKPAAK motifs (four and three repeats in the proximal and distal blocks, respectively). These blocks are separated by a 20-amino acid IS. The first 20 amino acids of the Medicago RZ are not part of AKPAAK repeats. We hypothesise that the RZ has most probably evolved as a result of an expansion of AKPAAK repeats from two separate sites in the C-domain. This process started tens of millions of years ago and was most likely directed by positive selection.
Resumo:
The genomes of many strains of baker’s yeast, Saccharomyces cerevisiae, contain multiple repeats of the copper-binding protein Cup1. Cup1 is a member of the metallothionein family, and is found in a tandem array on chromosome VIII. In this thesis, I describe studies that characterized these tandem arrays and their mechanism of formation across diverse strains of yeast. I show that CUP1 arrays are an illuminating model system for observing recombination in eukaryotes, and describe insights derived from these observations.
In our first study, we analyzed 101 natural isolates of S. cerevisiae in order to examine the diversity of CUP1-containing repeats across different strains. We identified five distinct classes of repeats that contain CUP1. We also showed that some strains have only a single copy of CUP1. By comparing the sequences of all the strains, we were able to elucidate the mechanism of formation of the CUP1 tandem arrays, which involved unequal non-homologous recombination events starting from a strain that had only a single CUP1 gene. Our observation of CUP1 repeat formation allows more general insights about the formation of tandem repeats from single-copy genes in eukaryotes, which is one of the most important mechanisms by which organisms evolve.
In our second study, we delved deeper into our mechanistic investigations by measuring the relative rates of inter-homolog and intra-/inter-sister chromatid recombination in CUP1 tandem arrays. We used a diploid strain that is heterozygous both for insertion of a selectable marker (URA3) inside the tandem array, and also for markers at either end of the array. The intra-/inter-sister chromatid recombination rate turned out to be more than ten-fold greater than the inter-homolog rate. Moreover, we found that loss of the proteins Rad51 and Rad52, which are required for most inter-homolog recombination, did not greatly reduce recombination in the CUP1 tandem repeats. Additionally, we investigated the effects of elevated copper levels on the rate of each type of recombination at the CUP1 locus. Both types of recombination are increased at high concentrations of copper (as is known to be the case for CUP1 transcription). Furthermore, the inter-homolog recombination rate at the CUP1 locus is higher than the average over the genome during mitosis, but is lower than the average during meiosis.
The research described in Chapter 2 is published in 2014.
Resumo:
Campylobacter jejuni followed by Campylobacter coli contribute substantially to the economic and public health burden attributed to food-borne infections in Australia. Genotypic characterisation of isolates has provided new insights into the epidemiology and pathogenesis of C. jejuni and C. coli. However, currently available methods are not conducive to large scale epidemiological investigations that are necessary to elucidate the global epidemiology of these common food-borne pathogens. This research aims to develop high resolution C. jejuni and C. coli genotyping schemes that are convenient for high throughput applications. Real-time PCR and High Resolution Melt (HRM) analysis are fundamental to the genotyping schemes developed in this study and enable rapid, cost effective, interrogation of a range of different polymorphic sites within the Campylobacter genome. While the sources and routes of transmission of campylobacters are unclear, handling and consumption of poultry meat is frequently associated with human campylobacteriosis in Australia. Therefore, chicken derived C. jejuni and C. coli isolates were used to develop and verify the methods described in this study. The first aim of this study describes the application of MLST-SNP (Multi Locus Sequence Typing Single Nucleotide Polymorphisms) + binary typing to 87 chicken C. jejuni isolates using real-time PCR analysis. These typing schemes were developed previously by our research group using isolates from campylobacteriosis patients. This present study showed that SNP + binary typing alone or in combination are effective at detecting epidemiological linkage between chicken derived Campylobacter isolates and enable data comparisons with other MLST based investigations. SNP + binary types obtained from chicken isolates in this study were compared with a previously SNP + binary and MLST typed set of human isolates. Common genotypes between the two collections of isolates were identified and ST-524 represented a clone that could be worth monitoring in the chicken meat industry. In contrast, ST-48, mainly associated with bovine hosts, was abundant in the human isolates. This genotype was, however, absent in the chicken isolates, indicating the role of non-poultry sources in causing human Campylobacter infections. This demonstrates the potential application of SNP + binary typing for epidemiological investigations and source tracing. While MLST SNPs and binary genes comprise the more stable backbone of the Campylobacter genome and are indicative of long term epidemiological linkage of the isolates, the development of a High Resolution Melt (HRM) based curve analysis method to interrogate the hypervariable Campylobacter flagellin encoding gene (flaA) is described in Aim 2 of this study. The flaA gene product appears to be an important pathogenicity determinant of campylobacters and is therefore a popular target for genotyping, especially for short term epidemiological studies such as outbreak investigations. HRM curve analysis based flaA interrogation is a single-step closed-tube method that provides portable data that can be easily shared and accessed. Critical to the development of flaA HRM was the use of flaA specific primers that did not amplify the flaB gene. HRM curve analysis flaA interrogation was successful at discriminating the 47 sequence variants identified within the 87 C. jejuni and 15 C. coli isolates and correlated to the epidemiological background of the isolates. In the combinatorial format, the resolving power of flaA was additive to that of SNP + binary typing and CRISPR (Clustered regularly spaced short Palindromic repeats) HRM and fits the PHRANA (Progressive hierarchical resolving assays using nucleic acids) approach for genotyping. The use of statistical methods to analyse the HRM data enhanced sophistication of the method. Therefore, flaA HRM is a rapid and cost effective alternative to gel- or sequence-based flaA typing schemes. Aim 3 of this study describes the development of a novel bioinformatics driven method to interrogate Campylobacter MLST gene fragments using HRM, and is called ‘SNP Nucleated Minim MLST’ or ‘Minim typing’. The method involves HRM interrogation of MLST fragments that encompass highly informative “Nucleating SNPS” to ensure high resolution. Selection of fragments potentially suited to HRM analysis was conducted in silico using i) “Minimum SNPs” and ii) the new ’HRMtype’ software packages. Species specific sets of six “Nucleating SNPs” and six HRM fragments were identified for both C. jejuni and C. coli to ensure high typeability and resolution relevant to the MLST database. ‘Minim typing’ was tested empirically by typing 15 C. jejuni and five C. coli isolates. The association of clonal complexes (CC) to each isolate by ‘Minim typing’ and SNP + binary typing were used to compare the two MLST interrogation schemes. The CCs linked with each C. jejuni isolate were consistent for both methods. Thus, ‘Minim typing’ is an efficient and cost effective method to interrogate MLST genes. However, it is not expected to be independent, or meet the resolution of, sequence based MLST gene interrogation. ‘Minim typing’ in combination with flaA HRM is envisaged to comprise a highly resolving combinatorial typing scheme developed around the HRM platform and is amenable to automation and multiplexing. The genotyping techniques described in this thesis involve the combinatorial interrogation of differentially evolving genetic markers on the unified real-time PCR and HRM platform. They provide high resolution and are simple, cost effective and ideally suited to rapid and high throughput genotyping for these common food-borne pathogens.
Resumo:
Germ-line mutations in CDKN2A have been shown to predispose to cutaneous malignant melanoma. We have identified 2 new melanoma kindreds which carry a duplication of a 24bp repeat present in the 5' region of CDKN2A previously identified in melanoma families from Australia and the United States. This mutation has now been reported in 5 melanoma families from 3 continents: Europe, North America, and Australasia. The M53I mutation in exon 2 of CDKN2A has also been documented in 5 melanoma families from Australia and North America. The aim of this study was to determine whether the occurrence of the mutations in these families from geographically diverse populations represented mutation hotspots within CDKN2A or were due to common ancestors. Haplotypes of 11 microsatellite markers flanking CDKN2A were constructed in 5 families carrying the M53I mutation and 5 families carrying the 24bp duplication. There were some differences in the segregating haplotypes due primarily to recombinations and mutations within the short tandem-repeat markers; however, the data provide evidence to indicate that there were at least 3 independent 24bp duplication events and possibly only 1 original M53I mutation. This is the first study to date which indicates common founders in melanoma families from different continents.
Resumo:
Microbial pollution in water periodically affects human health in Australia, particularly in times of drought and flood. There is an increasing need for the control of waterborn microbial pathogens. Methods, allowing the determination of the origin of faecal contamination in water, are generally referred to as Microbial Source Tracking (MST). Various approaches have been evaluated as indicatorsof microbial pathogens in water samples, including detection of different microorganisms and various host-specific markers. However, until today there have been no universal MST methods that could reliably determine the source (human or animal) of faecal contamination. Therefore, the use of multiple approaches is frequently advised. MST is currently recognised as a research tool, rather than something to be included in routine practices. The main focus of this research was to develop novel and universally applicable methods to meet the demands for MST methods in routine testing of water samples. Escherichia coli was chosen initially as the object organism for our studies as, historically and globally, it is the standard indicator of microbial contamination in water. In this thesis, three approaches are described: single nucleotide polymorphism (SNP) genotyping, clustered regularly interspaced short palindromic repeats (CRISPR) screening using high resolution melt analysis (HRMA) methods and phage detection development based on CRISPR types. The advantage of the combination SNP genotyping and CRISPR genes has been discussed in this study. For the first time, a highly discriminatory single nucleotide polymorphism interrogation of E. coli population was applied to identify the host-specific cluster. Six human and one animal-specific SNP profile were revealed. SNP genotyping was successfully applied in the field investigations of the Coomera watershed, South-East Queensland, Australia. Four human profiles [11], [29], [32] and [45] and animal specific SNP profile [7] were detected in water. Two human-specific profiles [29] and [11] were found to be prevalent in the samples over a time period of years. The rainfall (24 and 72 hours), tide height and time, general land use (rural, suburban), seasons, distance from the river mouth and salinity show a lack of relashionship with the diversity of SNP profiles present in the Coomera watershed (p values > 0.05). Nevertheless, SNP genotyping method is able to identify and distinquish between human- and non-human specific E. coli isolates in water sources within one day. In some samples, only mixed profiles were detected. To further investigate host-specificity in these mixed profiles CRISPR screening protocol was developed, to be used on the set of E. coli, previously analysed for SNP profiles. CRISPR loci, which are the pattern of previous DNA coliphages attacks, were considered to be a promising tool for detecting host-specific markers in E. coli. Spacers in CRISPR loci could also reveal the dynamics of virulence in E. coli as well in other pathogens in water. Despite the fact that host-specificity was not observed in the set of E. coli analysed, CRISPR alleles were shown to be useful in detection of the geographical site of sources. HRMA allows determination of ‘different’ and ‘same’ CRISPR alleles and can be introduced in water monitoring as a cost-effective and rapid method. Overall, we show that the identified human specific SNP profiles [11], [29], [32] and [45] can be useful as marker genotypes globally for identification of human faecal contamination in water. Developed in the current study, the SNP typing approach can be used in water monitoring laboratories as an inexpensive, high-throughput and easy adapted protocol. The unique approach based on E. coli spacers for the search for unknown phage was developed to examine the host-specifity in phage sequences. Preliminary experiments on the recombinant plasmids showed the possibility of using this method for recovering phage sequences. Future studies will determine the host-specificity of DNA phage genotyping as soon as first reliable sequences can be acquired. No doubt, only implication of multiple approaches in MST will allow identification of the character of microbial contamination with higher confidence and readability.
Resumo:
Telomeres are the termini of linear eukaryotic chromosomes consisting of tandem repeats of DNA and proteins that bind to these repeat sequences. Telomeres ensure the complete replication of chromosome ends, impart protection to ends from nucleolytic degradation, end-to-end fusion, and guide the localization of chromosomes within the nucleus. In addition, a combination of genetic, biochemical, and molecular biological approaches have implicated key roles for telomeres in diverse cellular processes such as regulation of gene expression, cell division, cell senescence, and cancer. This review focuses on recent advances in our understanding of the organization of telomeres, telomere replication, proteins that bind telomeric DNA, and the establishment of telomere length equilibrium.
Resumo:
In complement activation, Factor H (FH) and C4b-binding protein (C4bp) are the key regulators that prevent the complement cascade from attacking host tissues. Some bacteria may bind and deposit these regulators on their own surfaces and thus provide themselves with an efficient means to avoid complement activation. In consequence, bacteria resist complement-mediated lysis and opsonin-dependent phagocytosis. This study has demonstrated that Y. enterocolitica, similar to many other pathogens, recruits both FH and C4bp to its surface to ensure protection against the complement-mediated killing. YadA and Ail, the most crucial serum resistance factors of Y.enterocolitica, mediate the binding of FH and C4bp. FH - YadA interaction involves multiple higher structural motifs on the YadA stalk and the short consensus repeats (SCRs) of the entire polypeptide chain of FH. The Ail binding site on FH has been located to SCRs 6 and 7. The binding site for FH on Ail, however, remains undetermined. Both YadA- and Ail-bound regulators display full cofactor activity for FI-mediated cleavage of C3b/C4b. FH/C4bp-binding characteristics do, however, differ between YadA and Ail. In addition, Ail captures the regulators only in the absence of blocking lipopolysaccharide O-antigen and outer core, whereas YadA binds FH/C4bp independent of the presence of other surface factors Independent of mode of binding, however, YadA and Ail provide Y. enterocolitica a means to avoid complement-mediated lysis, enhancing chances for the bacteria to survive in the host during various phases of infection.
Resumo:
Schizophrenia, affecting about 1% of population worldwide, is a severe mental disorder characterized by positive and negative symptoms, such as psychosis and anhedonia, as well as cognitive deficits. At present, schizophrenia is considered a complex disorder of neurodevelopmental origin with both genetic and environmental factors contributing to its onset. Although a number of candidate genes for schizophrenia have been highlighted, only very few schizophrenia patients are likely to share identical genetic liability. This study is based on the nation-wide schizophrenia family sample of the National Institute for Health and Welfare, and represents one of the largest and most well-characterized familial series in the world. In the first part of this study, we investigated the roles of the DTNBP1, NRG1, and AKT1 genes in the background of schizophrenia in Finland. Although these genes are associated with schizophrenia liability in several populations, any significant association with clinical diagnostic information of schizophrenia remained absent in our sample of 441 schizophrenia families. In the second part of this study, we first replicated schizophrenia linkage on the long arm of chromosome 7 in 352 schizophrenia families. In the following association analysis, we utilized additional clinical disorder features and intermediate phenotypes – endophenotypes - in addition to diagnostic information from altogether 290 neuropsychologically assessed schizophrenia families. An intragenic short tandem repeat allele of the regional RELN gene, supposed to play a role in the background of several neurodevelopmental disorders, showed significant association with poorer cognitive functioning and more severe schizophrenia symptoms. Additionally, this risk allele was significantly more prevalent among the individuals affected with schizophrenia spectrum disorders. We have previously identified linkage of schizophrenia and its cognitive endophenotypes on the long arms of chromosomes 2, 4, and 5. In the last part of this study, we selected altogether 104 functionally relevant candidate genes from the linked regions. We detected several promising associations, of which especially interesting are the ERBB4 gene, showing association with the severity of schizophrenia symptoms and impairments in traits related to verbal abilities, and the GRIA1 gene, showing association with the severity of schizophrenia symptoms. Our results extend the previous evidence that the genetic risk for schizophrenia is at least partially mediated via the effects of the candidate genes and their combinations on relevant brain systems, resulting in alterations in different disorder domains, such as the cognitive deficits.
Resumo:
Accurate determination of same-sex twin zygosity is important for medical, scientific and personal reasons. Determination may be based upon questionnaire data, blood group, enzyme isoforms and fetal membrane examination, but assignment of zygosity must ultimately be confirmed by genotypic data. Here methods are reviewed for calculating average probabilities of correctly concluding a twin pair is monozygotic, given they share the same genotypes across all loci for commonly utilized multiplex short tandem repeat (STR) kits.
Resumo:
We report here the formation of plasmid linear multimers promoted by the Red-system of phage lambda using a multicopy plasmid comprised of lambda red alpha and red beta genes, under the control of the lambda cI857 repressor. Our observations have revealed that the multimerization of plasmid DNA is dependent on the red beta and recA genes, suggesting a concerted role for these functions in the formation of plasmid multimers. The formation of multimers occurred in a recBCD+ sbcB+ xthA+ lon genetic background at a higher frequency than in the isogenic lon+ host cells. The multimers comprised tandem repeats of monomer plasmid DNA. Treatment of purified plasmid DNA with exonuclease III revealed the presence of free double-chain ends in the molecules. Determination of the size of multimeric DNA, by pulse field gel electrophoresis, revealed that the bulk of the DNA was in the range 50-240 kb, representing approximately 5-24 unit lengths of monomeric plasmid DNA. We provide a conceptual framework for Red-system-promoted formation and enhanced accumulation of plasmid linear multimers in lon mutants of E. coli.
Resumo:
Over the years, a wide range of methods to verify identity have been developed. Molecular markers have been used for identification since the 1920s, commencing with blood types and culminating with the advent of DNA techniques in the 1980s. Identification is required by authorities in many occasions, e.g. in disputed paternity cases, identification of deceased, or crime investigation. To clarify maternal and paternal lineages, uniparental DNA markers in mtDNA and Y-chromosome can be utilized. These markers have several advantages: male specific Y-chromosome can be used to identify a male from a mixture of male and female cells, e.g. in rape cases. MtDNA is durable and has a high copy number, allowing analyses even from old or degraded samples. However, both markers are lineage-specific, not individualizing, and susceptible to genetic drift. Prior to the application of any DNA marker in forensic casework, it is of utmost importance to investigate its qualities and peculiarities in the target population. Earlier studies on the Finnish population have shown reduced variation in the Y-chromosome, but in mtDNA results have been ambiguous. The obtained results confirmed the low diversity in Y-chromosome in Finland. Detailed population analysis revealed large regional differences, and extremely reduced diversity especially in East Finland. Analysis of the qualities affecting Y-chromosomal short tandem repeat (Y-STR) variation and mutation frequencies, and search of new polymorphic markers resulted a set of Y-STRs with especially high diversity in Finland. Contrary to Y-chromosome, neither reduced diversity nor regional differences were found in mtDNA within Finland. In fact, mtDNA diversity was found similar to other European populations. The revealed peculiarities in the uniparental markers are a legacy of the Finnish population history. The obtained results challenge the traditional explanation which emphasizes relatively recent founder effects creating the observed east-west patterns. Uniparentally inherited markers, both mtDNA and Y-chromosome, are applicable for identification purposes in Finland. By adjusting the analysed Y marker set to meet the characteristics of Finnish population, Y-chromosomal diversity increases and the regional differentiation decreases, resulting increase in discrimination power and thus usefulness of Y-chromosomal analysis in forensic casework.