20 resultados para DNA-Directed DNA Polymerase
em Helda - Digital Repository of University of Helsinki
Resumo:
Thesis focuses on mutations of POLG1 gene encoding catalytic subunit polγ-α of mitochondrial DNA polymerase gamma holoenzyme (polG) and the association of mutations with different clinical phenotypes. In addition, particular defective mutant variants of the protein were characterized biochemically in vitro. PolG-holoenzyme is the sole DNA polymerase found in mitochondria. It is involved in replication and repair of the mitochondrial genome, mtDNA. Holoenzyme also includes the accessory subunit polγ-β, which is required for the enhanced processivity of polγ-α. Defective polγ-α causes accumulation of secondary mutations on mtDNA, which leads to a defective oxidative phosphorylation system. The clinical consequences of such mutations are variable, affecting nervous system, skeletal muscles, liver and other post-mitotic tissues. The aims of the studies included: 1) Determination of the role of POLG1 mutations in neurological syndromes with features of mitochondrial dysfunction and an unknown molecular cause. 2) Development and set up of diagnostic tests for routine clinical purposes. 3) Biochemical characterization of the functional consequences of the identified polγ-α variants. Studies describe new neurological phenotypes in addition to PEO caused by POLG1 mutations, including parkinsonism, premature amenorrhea, ataxia and Parkinson s disease (PD). POLG1 mutations and polymorphisms are both common and/or potential genetic risk factors at least among the Finnish population. The major findings and applications reported here are: 1) POLG1 mutations cause parkinsonism and premature menopause in PEO families in either a recessive or a dominant manner. 2) A common recessive POLG1 mutations (A467T and W748S) in the homozygous state causes severe adult or juvenile-onset ataxia without muscular symptoms or histological or mtDNA abnormalities in muscles. 3) A common recessive pathogenic change A467T can also cause a mild dominant disease in heterozygote carriers. 4) The A467T variant shows reduced polymerase activity due to defective template binding. 5) Rare polyglutamine tract length variants of POLG1 are significantly enriched in Finnish idiopathic Parkinson s disease patients. 6) Dominant mutations are clearly restricted to the highly conserved polymerase domain motifs, whereas recessive ones are more evenly distributed along the protein. The present results highlight and confirm the new role of mitochondria in parkinsonism/Parkinson s disease and describe a new mitochondrial ataxia. Based on these results, a POLG1 diagnostic routine has been set up in Helsinki University Central Hospital (HUSLAB).
Resumo:
Common migraine, i.e. migraine with (MA) or without aura (MO), is a chronic neurological disorder affecting about 10% of the Caucasian population. In MA, migraine headache is preceded by visual, sensoric and/or dysphasic reversible aura symptoms. Twin and family studies have suggested a multifactorial mode of inheritance for common migraine, and a stronger genetic component for MA than for MO. Since there is no biological or genetic marker to identify common migraine, aura symptoms provide a distinctive character to identify those suspected of suffering from migraine. The aim of this study was to identify MA susceptibility loci in well-phenotyped migraine samples with familial predisposition using different gene mapping methods. Genes coding for endothelin1 and its receptors EDNRA and ENDRB are potential candidate genes for cortical spreading depression (CSD), which is considered to be the underlying mechanism of migraine aura. The role of these genes in MA was studied in 850 Finnish migraine cases and 890 control individuals. Rare homozygous EDNRA SNPs showed nominal association with MA and with the age of onset trait (20 years). This result was also detected in the pooled analysis on 648 German MA cases and 651 control individuals when the test was adjusted for gender and sample origin. Evaluation of SNP genotyping reactions with two different DNA polymerase enzymes ensured that the genotype quality was high, and thus the discovered associations are considered reliable. The role of the 19p13 region was studied in a linkage analysis of 72 Finnish MA families. This region contains two migraine-associated genes: CACNA1A, which is associated with a predisposition to a rare Mendelian form of MA, familial hemiplegic migraine (FHM), and the insulin receptor gene (INSR) that is associated with common migraine. No evidence of linkage between the 19p13 and MA was detected. A novel visual aura locus was mapped to chromosome 9q21-q22 with significant evidence of linkage using a genome-wide linkage approach in 36 Finnish MA families. Five additional, potential loci were also detected. The 9q21-q22 region has previously been linked to occipitotemporal lobe epilepsy and MA, both of which involve prominent visual symptoms. Our result further supports a shared background for these episodic disorders.
Resumo:
Defects in mitochondrial DNA (mtDNA) maintenance cause a range of human diseases, including autosomal dominant progressive external ophthalmoplegia (adPEO). This study aimed to clarify the molecular background of adPEO. We discovered that deoxynucleoside triphosphate (dNTP) metabolism plays a crucial in mtDNA maintenance and were thus prompted to search for therapeutic strategies based on the modulation of cellular dNTP pools or mtDNA copy number. Human mtDNA is a 16.6 kb circular molecule present in hundreds to thousands of copies per cell. mtDNA is compacted into nucleoprotein clusters called nucleoids. mtDNA maintenance diseases result from defects in nuclear encoded proteins that maintain the mtDNA. These syndromes typically afflict highly differentiated, post-mitotic tissues such as muscle and nerve, but virtually any organ can be affected. adPEO is a disease where mtDNA molecules with large-scale deletions accumulate in patients tissues, particularly in skeletal muscle. Mutations in five nuclear genes, encoding the proteins ANT1, Twinkle, POLG, POLG2 and OPA1, have previously been shown to cause adPEO. Here, we studied a large North American pedigree with adPEO, and identified a novel heterozygous mutation in the gene RRM2B, which encodes the p53R2 subunit of the enzyme ribonucleotide reductase (RNR). RNR is the rate-limiting enzyme in dNTP biosynthesis, and is required both for nuclear and mitochondrial DNA replication. The mutation results in the expression of a truncated form of p53R2, which is likely to compete with the wild-type allele. A change in enzyme function leads to defective mtDNA replication due to altered dNTP pools. Therefore, RRM2B is a novel adPEO disease gene. The importance of adequate dNTP pools and RNR function for mtDNA maintenance has been established in many organisms. In yeast, induction of RNR has previously been shown to increase mtDNA copy number, and to rescue the phenotype caused by mutations in the yeast mtDNA polymerase. To further study the role of RNR in mammalian mtDNA maintenance, we used mice that broadly overexpress the RNR subunits Rrm1, Rrm2 or p53R2. Active RNR is a heterotetramer consisting of two large subunits (Rrm1) and two small subunits (either Rrm2 or p53R2). We also created bitransgenic mice that overexpress Rrm1 together with either Rrm2 or p53R2. In contrast to the previous findings in yeast, bitransgenic RNR overexpression led to mtDNA depletion in mouse skeletal muscle, without mtDNA deletions or point mutations. The mtDNA depletion was associated with imbalanced dNTP pools. Furthermore, the mRNA expression levels of Rrm1 and p53R2 were found to correlate with mtDNA copy number in two independent mouse models, suggesting nuclear-mitochondrial cross talk with regard to mtDNA copy number. We conclude that tight regulation of RNR is needed to prevent harmful alterations in the dNTP pool balance, which can lead to disordered mtDNA maintenance. Increasing the copy number of wild-type mtDNA has been suggested as a strategy for treating PEO and other mitochondrial diseases. Only two proteins are known to cause a robust increase in mtDNA copy number when overexpressed in mice; the mitochondrial transcription factor A (TFAM), and the mitochondrial replicative helicase Twinkle. We studied the mechanisms by which Twinkle and TFAM elevate mtDNA levels, and showed that Twinkle specifically implements mtDNA synthesis. Furthermore, both Twinkle and TFAM were found to increase mtDNA content per nucleoid. Increased mtDNA content in mouse tissues correlated with an age-related accumulation of mtDNA deletions, depletion of mitochondrial transcripts, and progressive respiratory dysfunction. Simultaneous overexpression of Twinkle and TFAM led to a further increase in the mtDNA content of nucleoids, and aggravated the respiratory deficiency. These results suggested that high mtDNA levels have detrimental long-term effects in mice. These data have to be considered when developing and evaluating treatment strategies for elevating mtDNA copy number.
Resumo:
Prostate cancer is the most common noncutaneous malignancy and the second leading cause of cancer mortality in men. In 2004, 5237 new cases were diagnosed and altogether 25 664 men suffered from prostate cancer in Finland (Suomen Syöpärekisteri). Although extensively investigated, we still have a very rudimentary understanding of the molecular mechanisms leading to the frequent transformation of the prostate epithelium. Prostate cancer is characterized by several unique features including the multifocal origin of tumors and extreme resistance to chemotherapy, and new treatment options are therefore urgently needed. The integrity of genomic DNA is constantly challenged by genotoxic insults. Cellular responses to DNA damage involve elegant checkpoint cascades enforcing cell cycle arrest, thus facilitating damage repair, apoptosis or cellular senescence. Cellular DNA damage triggers the activation of tumor suppressor protein p53 and Wee1 kinase which act as executors of the cellular checkpoint responses. These are essential for genomic integrity, and are activated in early stages of tumorigenesis in order to function as barriers against tumor formation. Our work establishes that the primary human prostatic epithelial cells and prostatic epithelium have unexpectedly indulgent checkpoint surveillance. This is evidenced by the absence of inhibitory Tyr15 phosphorylation on Cdk2, lack of p53 response, radioresistant DNA synthesis, lack of G1/S and G2/M phase arrest, and presence of persistent gammaH2AX damage foci. We ascribe the absence of inhibitory Tyr15 phosphorylation to low levels of Wee1A, a tyrosine kinase and negative regulator of cell cycle progression. Ectopic Wee1A kinase restored Cdk2-Tyr15 phosphorylation and efficiently rescued the ionizing radiation-induced checkpoints in the human prostatic epithelial cells. As variability in the DNA damage responses has been shown to underlie susceptibility to cancer, our results imply that a suboptimal checkpoint arrest may greatly increase the accumulation of genetic lesions in the prostate epithelia. We also show that small molecules can restore p53 function in prostatic epithelial cells and may serve as a paradigm for the development of future therapeutic agents for the treatment of prostate cancer We hypothesize that the prostate has evolved to activate the damage surveillance pathways and molecules involved in these pathways only to certain stresses in extreme circumstances. In doing so, this organ inadvertently made itself vulnerable to genotoxic stress, which may have implications in malignant transformation. Recognition of the limited activity of p53 and Wee1 in the prostate could drive mechanism-based discovery of preventative and therapeutic agents.
Resumo:
Megasphaera cerevisiae, Pectinatus cerevisiiphilus, Pectinatus frisingensis, Selenomonas lacticifex, Zymophilus paucivorans and Zymophilus raffinosivorans are strictly anaerobic Gram-stain-negative bacteria that are able to spoil beer by producing off-flavours and turbidity. They have only been isolated from the beer production chain. The species are phylogenetically affiliated to the Sporomusa sub-branch in the class "Clostridia". Routine cultivation methods for detection of strictly anaerobic bacteria in breweries are time-consuming and do not allow species identification. The main aim of this study was to utilise DNA-based techniques in order to improve detection and identification of the Sporomusa sub-branch beer-spoilage bacteria and to increase understanding of their biodiversity, evolution and natural sources. Practical PCR-based assays were developed for monitoring of M. cerevisiae, Pectinatus species and the group of Sporomusa sub-branch beer spoilers throughout the beer production process. The developed assays reliably differentiated the target bacteria from other brewery-related microbes. The contaminant detection in process samples (10 1,000 cfu/ml) could be accomplished in 2 8 h. Low levels of viable cells in finished beer (≤10 cfu/100 ml) were usually detected after 1 3 d culture enrichment. Time saving compared to cultivation methods was up to 6 d. Based on a polyphasic approach, this study revealed the existence of three new anaerobic spoilage species in the beer production chain, i.e. Megasphaera paucivorans, Megasphaera sueciensis and Pectinatus haikarae. The description of these species enabled establishment of phenotypic and DNA-based methods for their detection and identification. The 16S rRNA gene based phylogenetic analysis of the Sporomusa sub-branch showed that the genus Selenomonas originates from several ancestors and will require reclassification. Moreover, Z. paucivorans and Z. raffinosivorans were found to be in fact members of the genus Propionispira. This relationship implies that they were carried to breweries along with plant material. The brewery-related Megasphaera species formed a distinct sub-group that did not include any sequences from other sources, suggesting that M. cerevisiae, M. paucivorans and M. sueciensis may be uniquely adapted to the brewery ecosystem. M. cerevisiae was also shown to exhibit remarkable resistance against many brewery-related stress conditions. This may partly explain why it is a brewery contaminant. This study showed that DNA-based techniques provide useful tools for obtaining more rapid and specific information about the presence and identity of the strictly anaerobic spoilage bacteria in the beer production chain than is possible using cultivation methods. This should ensure financial benefits to the industry and better product quality to customers. In addition, DNA-based analyses provided new insight into the biodiversity as well as natural sources and relations of the Sporomusa sub-branch bacteria. The data can be exploited for taxonomic classification of these bacteria and for surveillance and control of contaminations.
Resumo:
This thesis consists of two parts; in the first part we performed a single-molecule force extension measurement with 10kb long DNA-molecules from phage-λ to validate the calibration and single-molecule capability of our optical tweezers instrument. Fitting the worm-like chain interpolation formula to the data revealed that ca. 71% of the DNA tethers featured a contour length within ±15% of the expected value (3.38 µm). Only 25% of the found DNA had a persistence length between 30 and 60 nm. The correct value should be within 40 to 60 nm. In the second part we designed and built a precise temperature controller to remove thermal fluctuations that cause drifting of the optical trap. The controller uses feed-forward and PID (proportional-integral-derivative) feedback to achieve 1.58 mK precision and 0.3 K absolute accuracy. During a 5 min test run it reduced drifting of the trap from 1.4 nm/min in open-loop to 0.6 nm/min in closed-loop.
Resumo:
This thesis presents methods for locating and analyzing cis-regulatory DNA elements involved with the regulation of gene expression in multicellular organisms. The regulation of gene expression is carried out by the combined effort of several transcription factor proteins collectively binding the DNA on the cis-regulatory elements. Only sparse knowledge of the 'genetic code' of these elements exists today. An automatic tool for discovery of putative cis-regulatory elements could help their experimental analysis, which would result in a more detailed view of the cis-regulatory element structure and function. We have developed a computational model for the evolutionary conservation of cis-regulatory elements. The elements are modeled as evolutionarily conserved clusters of sequence-specific transcription factor binding sites. We give an efficient dynamic programming algorithm that locates the putative cis-regulatory elements and scores them according to the conservation model. A notable proportion of the high-scoring DNA sequences show transcriptional enhancer activity in transgenic mouse embryos. The conservation model includes four parameters whose optimal values are estimated with simulated annealing. With good parameter values the model discriminates well between the DNA sequences with evolutionarily conserved cis-regulatory elements and the DNA sequences that have evolved neutrally. In further inquiry, the set of highest scoring putative cis-regulatory elements were found to be sensitive to small variations in the parameter values. The statistical significance of the putative cis-regulatory elements is estimated with the Two Component Extreme Value Distribution. The p-values grade the conservation of the cis-regulatory elements above the neutral expectation. The parameter values for the distribution are estimated by simulating the neutral DNA evolution. The conservation of the transcription factor binding sites can be used in the upstream analysis of regulatory interactions. This approach may provide mechanistic insight to the transcription level data from, e.g., microarray experiments. Here we give a method to predict shared transcriptional regulators for a set of co-expressed genes. The EEL (Enhancer Element Locator) software implements the method for locating putative cis-regulatory elements. The software facilitates both interactive use and distributed batch processing. We have used it to analyze the non-coding regions around all human genes with respect to the orthologous regions in various other species including mouse. The data from these genome-wide analyzes is stored in a relational database which is used in the publicly available web services for upstream analysis and visualization of the putative cis-regulatory elements in the human genome.
Resumo:
The object of this study is a tailless internal membrane-containing bacteriophage PRD1. It has a dsDNA genome with covalently bound terminal proteins required for replication. The uniqueness of the structure makes this phage a desirable object of research. PRD1 has been studied for some 30 years during which time a lot of information has accumulated on its structure and life-cycle. The two least characterised steps of the PRD1 life-cycle, the genome packaging and virus release are investigated here. PRD1 shares the main principles of virion assembly (DNA packaging in particular) and host cell lysis with other dsDNA bacteriophages. However, this phage has some fascinating individual peculiarities, such as DNA packaging into a membrane vesicle inside the capsid, absence of apparent portal protein, holin inhibitor and procapsid expansion. In the course of this study we have identified the components of the DNA packaging vertex of the capsid, and determined the function of protein P6 in packaging. We managed to purify the procapsids for an in vitro packaging system, optimise the reaction and significantly increase its efficiency. We developed a new method to determine DNA translocation and were able to quantify the efficiency and the rate of packaging. A model for PRD1 DNA packaging was also proposed. Another part of this study covers the lysis of the host cell. As other dsDNA bacteriophages PRD1 has been proposed to utilise a two-component lysis system. The existence of this lysis system in PRD1 has been proven by experiments using recombinant proteins and the multi-step nature of the lysis process has been established.
Resumo:
Double-stranded RNA and associated proteins are known to regulate the gene expression of most eukaryotic organisms. These regulation pathways have different components, outcomes and distinct nomenclature depending on the model system, and often they are referred to collectively as RNA silencing. In many cases, RNA-dependent RNA polymerases (RdRPs) are found to be involved in the RNA silencing, but their targets, activities, interaction partners and reaction products remain enigmatic. In the filamentous fungus Neurospora crassa, the RdRP QDE-1 is critical for silencing of transgenes a phenomenon known as quelling. In this thesis the structure, biochemical activities and biological functions of QDE-1 were extensively studied. This dimeric RdRP was shown to possess five distinct catalytic in vitro activities that could be dissected by mutagenesis and by altering reaction conditions. The biochemical characterization implied that QDE-1 is actually an active DNA-dependent RNA polymerase that has additional RdRP activity. It also provided a structural explanation for the dimerization and suggested a biological framework for the functions of QDE-1 in vivo. (I) QDE-1 was also studied in a broader context along with the other components of the quelling pathway. It was shown that DNA damage in Neurospora causes a dramatic increase in the expression level of the Argonaute protein QDE-2 as well as the synthesis of a novel class of small RNAs known as qiRNAs. The accumulation of qiRNAs was shown to be dependent on several quelling components, and particularly to be derived from an aberrant ssRNA (aRNA) molecule that is synthesized by QDE-1 in the nucleus. The genomic distribution of qiRNA targets was analyzed and the possible biological significance of qiRNAs was studied. Importantly, qiRNAs are the first class of small RNAs that are induced by DNA damage. (II) After establishing that QDE-1 is a multifunctional RNA polymerase with several activities, template specificities and subcellular locations, the focus was turned onto its interaction partners. It had been previously known that QDE-1 associates with Replication Protein A (RPA), but the RecQ helicase QDE-3 was now shown to regulate this interaction. RPA was also observed to promote QDE-1 dependent dsRNA synthesis in vitro. By characterizing the interplay between QDE-1, QDE-3 and RPA, a working model of quelling and qiRNA pathways in Neurospora was presented. (III) This work sheds light on the complexity of the various RNA silencing pathways of a fungal model system. It shows how an RdRP can regulate gene expression on many levels, and suggests novel lines of research in other eukaryotic organisms.
Resumo:
For most RNA viruses RNA-dependent RNA polymerases (RdRPs) encoded by the virus are responsible for the entire RNA metabolism. Thus, RdRPs are critical components in the viral life cycle. However, it is not fully understood how these important enzymes function during viral replication. Double-stranded RNA (dsRNA) viruses perform the synthesis of their RNA genome within a proteinacous viral particle containing an RdRP as a minor constituent. The phi6 bacteriophage is the best-studied dsRNA virus, providing an excellent background for studies of its RNA synthesis. The purified recombinant phi6 RdRP is highly active in vitro and it possesses both RNA replication and transcription activities. The crystal structure of the phi6 polymerase, solved in complex with a number of ligands, provides a working model for detailed in vitro studies of RNA-dependent RNA polymerization. In this thesis, the primer-independent initiation of the phi6 RdRP was studied in vitro using biochemical and structural methods. A C-terminal, four-amino-acid-long loop protruding into the central cavity of the phi6 RdRP has been suggested to stabilize the incoming nucleotides of the initiation complex formation through stacking interactions. A similar structural element has been found from several other viral RdRPs. In this thesis, this so-called initiation platform loop was subjected to site-directed mutagenesis to address its role in the initiation. It was found that the initiation mode of the mutants is primer-dependent, requiring either an oligonucleotide primer or a back-priming initiation mechanism for the RNA synthesis. The crystal structure of a mutant RdRP with altered initiation platform revealed a set of contacts important for primer-independent initiation. Since phi6 RdRP is structurally and functionally homologous to several viral RdRPs, among them the hepatitis C virus RdRP, these results provide further general insight to understand primer-independent initiation. In this study it is demonstrated that manganese phasing could be used as a practical tool for solving structures of large proteins with a bound manganese ion. The phi6 RdRP was used as a case study to obtain phases for crystallographic analysis. Manganese ions are naturally bound to the phi6 RdRP at the palm domain of the enzyme. In a crystallographic experiment, X-ray diffraction data from a phi6 RdRP crystal were collected at a wavelength of 1.89 Å, which is the K edge of manganese. With this data an automatically built model of the core region of the protein could be obtained. Finally, in this work terminal nucleotidyl transferase (TNTase) activity of the phi6 RdRP was documented in the isolated polymerase as well as in the viral particle. This is the first time that such an activity has been reported in a polymerase of a dsRNA virus. The phi6 RdRP used uridine triphosphates as the sole substrate in a TNTase reaction but could accept several heterologous templates. The RdRP was able to add one or a few non-templated nucleotides to the 3' end of the single- or double-stranded RNA substrate. Based on the results on particle-mediated TNTase activity and previous structural information of the polymerase, a model for termination of the RNA-dependent RNA synthesis is suggested in this thesis.
Resumo:
Extraintestinal pathogenic Escherichia coli (ExPEC) represent a diverse group of strains of E. coli, which infect extraintestinal sites, such as the urinary tract, the bloodstream, the meninges, the peritoneal cavity, and the lungs. Urinary tract infections (UTIs) caused by uropathogenic E. coli (UPEC), the major subgroup of ExPEC, are among the most prevalent microbial diseases world wide and a substantial burden for public health care systems. UTIs are responsible for serious morbidity and mortality in the elderly, in young children, and in immune-compromised and hospitalized patients. ExPEC strains are different, both from genetic and clinical perspectives, from commensal E. coli strains belonging to the normal intestinal flora and from intestinal pathogenic E. coli strains causing diarrhea. ExPEC strains are characterized by a broad range of alternate virulence factors, such as adhesins, toxins, and iron accumulation systems. Unlike diarrheagenic E. coli, whose distinctive virulence determinants evoke characteristic diarrheagenic symptoms and signs, ExPEC strains are exceedingly heterogeneous and are known to possess no specific virulence factors or a set of factors, which are obligatory for the infection of a certain extraintestinal site (e. g. the urinary tract). The ExPEC genomes are highly diverse mosaic structures in permanent flux. These strains have obtained a significant amount of DNA (predictably up to 25% of the genomes) through acquisition of foreign DNA from diverse related or non-related donor species by lateral transfer of mobile genetic elements, including pathogenicity islands (PAIs), plasmids, phages, transposons, and insertion elements. The ability of ExPEC strains to cause disease is mainly derived from this horizontally acquired gene pool; the extragenous DNA facilitates rapid adaptation of the pathogen to changing conditions and hence the extent of the spectrum of sites that can be infected. However, neither the amount of unique DNA in different ExPEC strains (or UPEC strains) nor the mechanisms lying behind the observed genomic mobility are known. Due to this extreme heterogeneity of the UPEC and ExPEC populations in general, the routine surveillance of ExPEC is exceedingly difficult. In this project, we presented a novel virulence gene algorithm (VGA) for the estimation of the extraintestinal virulence potential (VP, pathogenicity risk) of clinically relevant ExPECs and fecal E. coli isolates. The VGA was based on a DNA microarray specific for the ExPEC phenotype (ExPEC pathoarray). This array contained 77 DNA probes homologous with known (e.g. adhesion factors, iron accumulation systems, and toxins) and putative (e.g. genes predictably involved in adhesion, iron uptake, or in metabolic functions) ExPEC virulence determinants. In total, 25 of DNA probes homologous with known virulence factors and 36 of DNA probes representing putative extraintestinal virulence determinants were found at significantly higher frequency in virulent ExPEC isolates than in commensal E. coli strains. We showed that the ExPEC pathoarray and the VGA could be readily used for the differentiation of highly virulent ExPECs both from less virulent ExPEC clones and from commensal E. coli strains as well. Implementing the VGA in a group of unknown ExPECs (n=53) and fecal E. coli isolates (n=37), 83% of strains were correctly identified as extraintestinal virulent or commensal E. coli. Conversely, 15% of clinical ExPECs and 19% of fecal E. coli strains failed to raster into their respective pathogenic and non-pathogenic groups. Clinical data and virulence gene profiles of these strains warranted the estimated VPs; UPEC strains with atypically low risk-ratios were largely isolated from patients with certain medical history, including diabetes mellitus or catheterization, or from elderly patients. In addition, fecal E. coli strains with VPs characteristic for ExPEC were shown to represent the diagnostically important fraction of resident strains of the gut flora with a high potential of causing extraintestinal infections. Interestingly, a large fraction of DNA probes associated with the ExPEC phenotype corresponded to novel DNA sequences without any known function in UTIs and thus represented new genetic markers for the extraintestinal virulence. These DNA probes included unknown DNA sequences originating from the genomic subtractions of four clinical ExPEC isolates as well as from five novel cosmid sequences identified in the UPEC strains HE300 and JS299. The characterized cosmid sequences (pJS332, pJS448, pJS666, pJS700, and pJS706) revealed complex modular DNA structures with known and unknown DNA fragments arranged in a puzzle-like manner and integrated into the common E. coli genomic backbone. Furthermore, cosmid pJS332 of the UPEC strain HE300, which carried a chromosomal virulence gene cluster (iroBCDEN) encoding the salmochelin siderophore system, was shown to be part of a transmissible plasmid of Salmonella enterica. Taken together, the results of this project pointed towards the assumptions that first, (i) homologous recombination, even within coding genes, contributes to the observed mosaicism of ExPEC genomes and secondly, (ii) besides en block transfer of large DNA regions (e.g. chromosomal PAIs) also rearrangements of small DNA modules provide a means of genomic plasticity. The data presented in this project supplemented previous whole genome sequencing projects of E. coli and indicated that each E. coli genome displays a unique assemblage of individual mosaic structures, which enable these strains to successfully colonize and infect different anatomical sites.