23 resultados para primer DNA

em Helda - Digital Repository of University of Helsinki


Relevância:

20.00% 20.00%

Publicador:

Resumo:

Prostate cancer is the most common noncutaneous malignancy and the second leading cause of cancer mortality in men. In 2004, 5237 new cases were diagnosed and altogether 25 664 men suffered from prostate cancer in Finland (Suomen Syöpärekisteri). Although extensively investigated, we still have a very rudimentary understanding of the molecular mechanisms leading to the frequent transformation of the prostate epithelium. Prostate cancer is characterized by several unique features including the multifocal origin of tumors and extreme resistance to chemotherapy, and new treatment options are therefore urgently needed. The integrity of genomic DNA is constantly challenged by genotoxic insults. Cellular responses to DNA damage involve elegant checkpoint cascades enforcing cell cycle arrest, thus facilitating damage repair, apoptosis or cellular senescence. Cellular DNA damage triggers the activation of tumor suppressor protein p53 and Wee1 kinase which act as executors of the cellular checkpoint responses. These are essential for genomic integrity, and are activated in early stages of tumorigenesis in order to function as barriers against tumor formation. Our work establishes that the primary human prostatic epithelial cells and prostatic epithelium have unexpectedly indulgent checkpoint surveillance. This is evidenced by the absence of inhibitory Tyr15 phosphorylation on Cdk2, lack of p53 response, radioresistant DNA synthesis, lack of G1/S and G2/M phase arrest, and presence of persistent gammaH2AX damage foci. We ascribe the absence of inhibitory Tyr15 phosphorylation to low levels of Wee1A, a tyrosine kinase and negative regulator of cell cycle progression. Ectopic Wee1A kinase restored Cdk2-Tyr15 phosphorylation and efficiently rescued the ionizing radiation-induced checkpoints in the human prostatic epithelial cells. As variability in the DNA damage responses has been shown to underlie susceptibility to cancer, our results imply that a suboptimal checkpoint arrest may greatly increase the accumulation of genetic lesions in the prostate epithelia. We also show that small molecules can restore p53 function in prostatic epithelial cells and may serve as a paradigm for the development of future therapeutic agents for the treatment of prostate cancer We hypothesize that the prostate has evolved to activate the damage surveillance pathways and molecules involved in these pathways only to certain stresses in extreme circumstances. In doing so, this organ inadvertently made itself vulnerable to genotoxic stress, which may have implications in malignant transformation. Recognition of the limited activity of p53 and Wee1 in the prostate could drive mechanism-based discovery of preventative and therapeutic agents.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

In this thesis, two separate single nucleotide polymorphism (SNP) genotyping techniques were set up at the Finnish Genome Center, pooled genotyping was evaluated as a screening method for large-scale association studies, and finally, the former approaches were used to identify genetic factors predisposing to two distinct complex diseases by utilizing large epidemiological cohorts and also taking environmental factors into account. The first genotyping platform was based on traditional but improved restriction-fragment-length-polymorphism (RFLP) utilizing 384-microtiter well plates, multiplexing, small reaction volumes (5 µl), and automated genotype calling. We participated in the development of the second genotyping method, based on single nucleotide primer extension (SNuPeTM by Amersham Biosciences), by carrying out the alpha- and beta tests for the chemistry and the allele-calling software. Both techniques proved to be accurate, reliable, and suitable for projects with thousands of samples and tens of markers. Pooled genotyping (genotyping of pooled instead of individual DNA samples) was evaluated with Sequenom s MassArray MALDI-TOF, in addition to SNuPeTM and PCR-RFLP techniques. We used MassArray mainly as a point of comparison, because it is known to be well suited for pooled genotyping. All three methods were shown to be accurate, the standard deviations between measurements being 0.017 for the MassArray, 0.022 for the PCR-RFLP, and 0.026 for the SNuPeTM. The largest source of error in the process of pooled genotyping was shown to be the volumetric error, i.e., the preparation of pools. We also demonstrated that it would have been possible to narrow down the genetic locus underlying congenital chloride diarrhea (CLD), an autosomal recessive disorder, by using the pooling technique instead of genotyping individual samples. Although the approach seems to be well suited for traditional case-control studies, it is difficult to apply if any kind of stratification based on environmental factors is needed. Therefore we chose to continue with individual genotyping in the following association studies. Samples in the two separate large epidemiological cohorts were genotyped with the PCR-RFLP and SNuPeTM techniques. The first of these association studies concerned various pregnancy complications among 100,000 consecutive pregnancies in Finland, of which we genotyped 2292 patients and controls, in addition to a population sample of 644 blood donors, with 7 polymorphisms in the potentially thrombotic genes. In this thesis, the analysis of a sub-study of pregnancy-related venous thromboses was included. We showed that the impact of factor V Leiden polymorphism on pregnancy-related venous thrombosis, but not the other tested polymorphisms, was fairly large (odds ratio 11.6; 95% CI 3.6-33.6), and increased multiplicatively when combined with other risk factors such as obesity or advanced age. Owing to our study design, we were also able to estimate the risks at the population level. The second epidemiological cohort was the Helsinki Birth Cohort of men and women who were born during 1924-1933 in Helsinki. The aim was to identify genetic factors that might modify the well known link between small birth size and adult metabolic diseases, such as type 2 diabetes and impaired glucose tolerance. Among ~500 individuals with detailed birth measurements and current metabolic profile, we found that an insertion/deletion polymorphism of the angiotensin converting enzyme (ACE) gene was associated with the duration of gestation, and weight and length at birth. Interestingly, the ACE insertion allele was also associated with higher indices of insulin secretion (p=0.0004) in adult life, but only among individuals who were born small (those among the lowest third of birth weight). Likewise, low birth weight was associated with higher indices of insulin secretion (p=0.003), but only among carriers of the ACE insertion allele. The association with birth measurements was also found with a common haplotype of the glucocorticoid receptor (GR) gene. Furthermore, the association between short length at birth and adult impaired glucose tolerance was confined to carriers of this haplotype (p=0.007). These associations exemplify the interaction between environmental factors and genotype, which, possibly due to altered gene expression, predisposes to complex metabolic diseases. Indeed, we showed that the common GR gene haplotype associated with reduced mRNA expression in thymus of three individuals (p=0.0002).

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Megasphaera cerevisiae, Pectinatus cerevisiiphilus, Pectinatus frisingensis, Selenomonas lacticifex, Zymophilus paucivorans and Zymophilus raffinosivorans are strictly anaerobic Gram-stain-negative bacteria that are able to spoil beer by producing off-flavours and turbidity. They have only been isolated from the beer production chain. The species are phylogenetically affiliated to the Sporomusa sub-branch in the class "Clostridia". Routine cultivation methods for detection of strictly anaerobic bacteria in breweries are time-consuming and do not allow species identification. The main aim of this study was to utilise DNA-based techniques in order to improve detection and identification of the Sporomusa sub-branch beer-spoilage bacteria and to increase understanding of their biodiversity, evolution and natural sources. Practical PCR-based assays were developed for monitoring of M. cerevisiae, Pectinatus species and the group of Sporomusa sub-branch beer spoilers throughout the beer production process. The developed assays reliably differentiated the target bacteria from other brewery-related microbes. The contaminant detection in process samples (10 1,000 cfu/ml) could be accomplished in 2 8 h. Low levels of viable cells in finished beer (≤10 cfu/100 ml) were usually detected after 1 3 d culture enrichment. Time saving compared to cultivation methods was up to 6 d. Based on a polyphasic approach, this study revealed the existence of three new anaerobic spoilage species in the beer production chain, i.e. Megasphaera paucivorans, Megasphaera sueciensis and Pectinatus haikarae. The description of these species enabled establishment of phenotypic and DNA-based methods for their detection and identification. The 16S rRNA gene based phylogenetic analysis of the Sporomusa sub-branch showed that the genus Selenomonas originates from several ancestors and will require reclassification. Moreover, Z. paucivorans and Z. raffinosivorans were found to be in fact members of the genus Propionispira. This relationship implies that they were carried to breweries along with plant material. The brewery-related Megasphaera species formed a distinct sub-group that did not include any sequences from other sources, suggesting that M. cerevisiae, M. paucivorans and M. sueciensis may be uniquely adapted to the brewery ecosystem. M. cerevisiae was also shown to exhibit remarkable resistance against many brewery-related stress conditions. This may partly explain why it is a brewery contaminant. This study showed that DNA-based techniques provide useful tools for obtaining more rapid and specific information about the presence and identity of the strictly anaerobic spoilage bacteria in the beer production chain than is possible using cultivation methods. This should ensure financial benefits to the industry and better product quality to customers. In addition, DNA-based analyses provided new insight into the biodiversity as well as natural sources and relations of the Sporomusa sub-branch bacteria. The data can be exploited for taxonomic classification of these bacteria and for surveillance and control of contaminations.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This thesis consists of two parts; in the first part we performed a single-molecule force extension measurement with 10kb long DNA-molecules from phage-λ to validate the calibration and single-molecule capability of our optical tweezers instrument. Fitting the worm-like chain interpolation formula to the data revealed that ca. 71% of the DNA tethers featured a contour length within ±15% of the expected value (3.38 µm). Only 25% of the found DNA had a persistence length between 30 and 60 nm. The correct value should be within 40 to 60 nm. In the second part we designed and built a precise temperature controller to remove thermal fluctuations that cause drifting of the optical trap. The controller uses feed-forward and PID (proportional-integral-derivative) feedback to achieve 1.58 mK precision and 0.3 K absolute accuracy. During a 5 min test run it reduced drifting of the trap from 1.4 nm/min in open-loop to 0.6 nm/min in closed-loop.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This thesis presents methods for locating and analyzing cis-regulatory DNA elements involved with the regulation of gene expression in multicellular organisms. The regulation of gene expression is carried out by the combined effort of several transcription factor proteins collectively binding the DNA on the cis-regulatory elements. Only sparse knowledge of the 'genetic code' of these elements exists today. An automatic tool for discovery of putative cis-regulatory elements could help their experimental analysis, which would result in a more detailed view of the cis-regulatory element structure and function. We have developed a computational model for the evolutionary conservation of cis-regulatory elements. The elements are modeled as evolutionarily conserved clusters of sequence-specific transcription factor binding sites. We give an efficient dynamic programming algorithm that locates the putative cis-regulatory elements and scores them according to the conservation model. A notable proportion of the high-scoring DNA sequences show transcriptional enhancer activity in transgenic mouse embryos. The conservation model includes four parameters whose optimal values are estimated with simulated annealing. With good parameter values the model discriminates well between the DNA sequences with evolutionarily conserved cis-regulatory elements and the DNA sequences that have evolved neutrally. In further inquiry, the set of highest scoring putative cis-regulatory elements were found to be sensitive to small variations in the parameter values. The statistical significance of the putative cis-regulatory elements is estimated with the Two Component Extreme Value Distribution. The p-values grade the conservation of the cis-regulatory elements above the neutral expectation. The parameter values for the distribution are estimated by simulating the neutral DNA evolution. The conservation of the transcription factor binding sites can be used in the upstream analysis of regulatory interactions. This approach may provide mechanistic insight to the transcription level data from, e.g., microarray experiments. Here we give a method to predict shared transcriptional regulators for a set of co-expressed genes. The EEL (Enhancer Element Locator) software implements the method for locating putative cis-regulatory elements. The software facilitates both interactive use and distributed batch processing. We have used it to analyze the non-coding regions around all human genes with respect to the orthologous regions in various other species including mouse. The data from these genome-wide analyzes is stored in a relational database which is used in the publicly available web services for upstream analysis and visualization of the putative cis-regulatory elements in the human genome.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The object of this study is a tailless internal membrane-containing bacteriophage PRD1. It has a dsDNA genome with covalently bound terminal proteins required for replication. The uniqueness of the structure makes this phage a desirable object of research. PRD1 has been studied for some 30 years during which time a lot of information has accumulated on its structure and life-cycle. The two least characterised steps of the PRD1 life-cycle, the genome packaging and virus release are investigated here. PRD1 shares the main principles of virion assembly (DNA packaging in particular) and host cell lysis with other dsDNA bacteriophages. However, this phage has some fascinating individual peculiarities, such as DNA packaging into a membrane vesicle inside the capsid, absence of apparent portal protein, holin inhibitor and procapsid expansion. In the course of this study we have identified the components of the DNA packaging vertex of the capsid, and determined the function of protein P6 in packaging. We managed to purify the procapsids for an in vitro packaging system, optimise the reaction and significantly increase its efficiency. We developed a new method to determine DNA translocation and were able to quantify the efficiency and the rate of packaging. A model for PRD1 DNA packaging was also proposed. Another part of this study covers the lysis of the host cell. As other dsDNA bacteriophages PRD1 has been proposed to utilise a two-component lysis system. The existence of this lysis system in PRD1 has been proven by experiments using recombinant proteins and the multi-step nature of the lysis process has been established.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Metanogeenit ovat hapettomissa oloissa eläviä arkkien pääryhmään kuuluvia mikrobeja, joiden ainutlaatuisen aineenvaihdunnan seurauksena syntyy metaania. Ilmakehässä metaani on voimakas kasvihuonekaasu. Yksi suurimmista luonnon metaanilähteistä ovat kosteikot. Pohjoisten soiden metaanipäästöt vaihtelevat voimakkaasti eri soiden välillä ja yhden suon sisälläkin, riippuen muun muassa vuodenajasta, suotyypistä ja kasvillisuudesta. Väitöskirjatyössä tutkittiin metaanipäästöjen vaihtelun mikrobiologista taustaa. Tutkimuksessa selvitettiin suotyypin, vuodenajan, tuhkalannoituksen ja turvesyvyyden vaikutusta metanogeeniyhteisöihin sekä metaanintuottoon kolmella suomalaisella suolla. Lisäksi tutkittiin ei-metanogeenisia arkkeja ja bakteereita, koska ne muodostavat metaanin tuoton lähtöaineet osana hapetonta hajotusta. Mikrobiyhteisöt analysoitiin DNA- ja RNA-lähtöisillä, polymeraasiketjureaktioon (PCR) perustuvilla menetelmillä. Merkkigeeneinä käytettiin metaanin tuottoon liittyvää mcrA-geeniä sekä arkkien ja bakteerien ribosomaalista 16S RNA-geeniä. Metanogeeniyhteisöt ja metaanintuotto erosivat huomattavasti happaman ja vähäravinteisen rahkasuon sekä ravinteikkaampien sarasoiden välillä. Rahkasuolta löytyi lähes yksinomaan Methanomicrobiales-lahkon metanogeeneja, jotka tuottavat metaania vedystä ja hiilidioksidista. Sarasoiden metanogeeniyhteisöt olivat monimuotoisempia, ja niillä esiintyi myös asetaattia käyttäviä metanogeeneja. Vuodenaika vaikutti merkittävästi metaanintuottoon. Talvella havaittiin odottamattoman suuri metaanintuottopotentiaali sekä viitteitä aktiivisista metanogeeneista. Arkkiyhteisön koostumus sen sijaan vaihteli vain vähän. Tuhkalannoitus, jonka tarkoituksena on edistää puiden kasvua ojitetuilla soilla, ei merkittävästi vaikuttanut metaanintuottoon tai -tuottajiin. Ojitetun suon yhteisöt kuitenkin muuttuivat turvesyvyyden mukaan. Vertailtaessa erilaisia PCR-menetelmiä todettiin, että kolmella mcrA-geeniin kohdistuvalla alukeparilla havaittiin pääosin samat ojitetun suon metanogeenit, mutta lajien runsaussuhteet riippuvat käytetyistä alukkeista. Soilla havaitut bakteerit kuuluivat pääjaksoihin Deltaproteobacteria, Acidobacteria ja Verrucomicrobia. Lisäksi löydettiin Crenarchaeota-pääjakson ryhmiin 1.1c ja 1.3 kuuluvia ei-metanogeenisia arkkeja. Tulokset ryhmien esiintymisestä hapettomassa turpeessa antavat lähtökohdan selvittää niiden mahdollisia vuorovaikutuksia metanogeenien kanssa. Tutkimuksen tulokset osoittivat, että metanogeeniyhteisön koostumus heijastaa metaanintuottoon vaikuttavia kemiallisia tai kasvillisuuden vaihteluita kuten suotyyppiä. Soiden metanogeenien ja niiden fysiologian parempi tuntemus voi auttaa ennustamaan ympäristömuutosten vaikutusta soiden metaanipäästöihin.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

For most RNA viruses RNA-dependent RNA polymerases (RdRPs) encoded by the virus are responsible for the entire RNA metabolism. Thus, RdRPs are critical components in the viral life cycle. However, it is not fully understood how these important enzymes function during viral replication. Double-stranded RNA (dsRNA) viruses perform the synthesis of their RNA genome within a proteinacous viral particle containing an RdRP as a minor constituent. The phi6 bacteriophage is the best-studied dsRNA virus, providing an excellent background for studies of its RNA synthesis. The purified recombinant phi6 RdRP is highly active in vitro and it possesses both RNA replication and transcription activities. The crystal structure of the phi6 polymerase, solved in complex with a number of ligands, provides a working model for detailed in vitro studies of RNA-dependent RNA polymerization. In this thesis, the primer-independent initiation of the phi6 RdRP was studied in vitro using biochemical and structural methods. A C-terminal, four-amino-acid-long loop protruding into the central cavity of the phi6 RdRP has been suggested to stabilize the incoming nucleotides of the initiation complex formation through stacking interactions. A similar structural element has been found from several other viral RdRPs. In this thesis, this so-called initiation platform loop was subjected to site-directed mutagenesis to address its role in the initiation. It was found that the initiation mode of the mutants is primer-dependent, requiring either an oligonucleotide primer or a back-priming initiation mechanism for the RNA synthesis. The crystal structure of a mutant RdRP with altered initiation platform revealed a set of contacts important for primer-independent initiation. Since phi6 RdRP is structurally and functionally homologous to several viral RdRPs, among them the hepatitis C virus RdRP, these results provide further general insight to understand primer-independent initiation. In this study it is demonstrated that manganese phasing could be used as a practical tool for solving structures of large proteins with a bound manganese ion. The phi6 RdRP was used as a case study to obtain phases for crystallographic analysis. Manganese ions are naturally bound to the phi6 RdRP at the palm domain of the enzyme. In a crystallographic experiment, X-ray diffraction data from a phi6 RdRP crystal were collected at a wavelength of 1.89 Å, which is the K edge of manganese. With this data an automatically built model of the core region of the protein could be obtained. Finally, in this work terminal nucleotidyl transferase (TNTase) activity of the phi6 RdRP was documented in the isolated polymerase as well as in the viral particle. This is the first time that such an activity has been reported in a polymerase of a dsRNA virus. The phi6 RdRP used uridine triphosphates as the sole substrate in a TNTase reaction but could accept several heterologous templates. The RdRP was able to add one or a few non-templated nucleotides to the 3' end of the single- or double-stranded RNA substrate. Based on the results on particle-mediated TNTase activity and previous structural information of the polymerase, a model for termination of the RNA-dependent RNA synthesis is suggested in this thesis.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Extraintestinal pathogenic Escherichia coli (ExPEC) represent a diverse group of strains of E. coli, which infect extraintestinal sites, such as the urinary tract, the bloodstream, the meninges, the peritoneal cavity, and the lungs. Urinary tract infections (UTIs) caused by uropathogenic E. coli (UPEC), the major subgroup of ExPEC, are among the most prevalent microbial diseases world wide and a substantial burden for public health care systems. UTIs are responsible for serious morbidity and mortality in the elderly, in young children, and in immune-compromised and hospitalized patients. ExPEC strains are different, both from genetic and clinical perspectives, from commensal E. coli strains belonging to the normal intestinal flora and from intestinal pathogenic E. coli strains causing diarrhea. ExPEC strains are characterized by a broad range of alternate virulence factors, such as adhesins, toxins, and iron accumulation systems. Unlike diarrheagenic E. coli, whose distinctive virulence determinants evoke characteristic diarrheagenic symptoms and signs, ExPEC strains are exceedingly heterogeneous and are known to possess no specific virulence factors or a set of factors, which are obligatory for the infection of a certain extraintestinal site (e. g. the urinary tract). The ExPEC genomes are highly diverse mosaic structures in permanent flux. These strains have obtained a significant amount of DNA (predictably up to 25% of the genomes) through acquisition of foreign DNA from diverse related or non-related donor species by lateral transfer of mobile genetic elements, including pathogenicity islands (PAIs), plasmids, phages, transposons, and insertion elements. The ability of ExPEC strains to cause disease is mainly derived from this horizontally acquired gene pool; the extragenous DNA facilitates rapid adaptation of the pathogen to changing conditions and hence the extent of the spectrum of sites that can be infected. However, neither the amount of unique DNA in different ExPEC strains (or UPEC strains) nor the mechanisms lying behind the observed genomic mobility are known. Due to this extreme heterogeneity of the UPEC and ExPEC populations in general, the routine surveillance of ExPEC is exceedingly difficult. In this project, we presented a novel virulence gene algorithm (VGA) for the estimation of the extraintestinal virulence potential (VP, pathogenicity risk) of clinically relevant ExPECs and fecal E. coli isolates. The VGA was based on a DNA microarray specific for the ExPEC phenotype (ExPEC pathoarray). This array contained 77 DNA probes homologous with known (e.g. adhesion factors, iron accumulation systems, and toxins) and putative (e.g. genes predictably involved in adhesion, iron uptake, or in metabolic functions) ExPEC virulence determinants. In total, 25 of DNA probes homologous with known virulence factors and 36 of DNA probes representing putative extraintestinal virulence determinants were found at significantly higher frequency in virulent ExPEC isolates than in commensal E. coli strains. We showed that the ExPEC pathoarray and the VGA could be readily used for the differentiation of highly virulent ExPECs both from less virulent ExPEC clones and from commensal E. coli strains as well. Implementing the VGA in a group of unknown ExPECs (n=53) and fecal E. coli isolates (n=37), 83% of strains were correctly identified as extraintestinal virulent or commensal E. coli. Conversely, 15% of clinical ExPECs and 19% of fecal E. coli strains failed to raster into their respective pathogenic and non-pathogenic groups. Clinical data and virulence gene profiles of these strains warranted the estimated VPs; UPEC strains with atypically low risk-ratios were largely isolated from patients with certain medical history, including diabetes mellitus or catheterization, or from elderly patients. In addition, fecal E. coli strains with VPs characteristic for ExPEC were shown to represent the diagnostically important fraction of resident strains of the gut flora with a high potential of causing extraintestinal infections. Interestingly, a large fraction of DNA probes associated with the ExPEC phenotype corresponded to novel DNA sequences without any known function in UTIs and thus represented new genetic markers for the extraintestinal virulence. These DNA probes included unknown DNA sequences originating from the genomic subtractions of four clinical ExPEC isolates as well as from five novel cosmid sequences identified in the UPEC strains HE300 and JS299. The characterized cosmid sequences (pJS332, pJS448, pJS666, pJS700, and pJS706) revealed complex modular DNA structures with known and unknown DNA fragments arranged in a puzzle-like manner and integrated into the common E. coli genomic backbone. Furthermore, cosmid pJS332 of the UPEC strain HE300, which carried a chromosomal virulence gene cluster (iroBCDEN) encoding the salmochelin siderophore system, was shown to be part of a transmissible plasmid of Salmonella enterica. Taken together, the results of this project pointed towards the assumptions that first, (i) homologous recombination, even within coding genes, contributes to the observed mosaicism of ExPEC genomes and secondly, (ii) besides en block transfer of large DNA regions (e.g. chromosomal PAIs) also rearrangements of small DNA modules provide a means of genomic plasticity. The data presented in this project supplemented previous whole genome sequencing projects of E. coli and indicated that each E. coli genome displays a unique assemblage of individual mosaic structures, which enable these strains to successfully colonize and infect different anatomical sites.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Microbial degradation pathways play a key role in the detoxification and the mineralization of polyaromatic hydrocarbons (PAHs), which are widespread pollutants in soil and constituents of petroleum hydrocarbons. In microbiology the aromatic degradation pathways are traditionally studied from single bacterial strains with capacity to degrade certain pollutant. In soil the degradation of aromatics is performed by a diverse community of micro-organisms. The aim of this thesis was to study biodegradation on different levels starting from a versatile aromatic degrader Sphingobium sp. HV3 and its megaplasmid, extending to revelation of diversity of key catabolic enzymes in the environment and finally studying birch rhizoremediation in PAH-polluted soil. To understand biodegradation of aromatics on bacterial species level, the aromatic degradation capacity of Sphingobium sp. HV3 and the role of the plasmid pSKY4, was studied. Toluene, m-xylene, biphenyl, fluorene, phenanthrene were detected as carbon and energy sources of the HV3 strain. Tn5 transposon mutagenesis linked the degradation capacity of toluene, m-xylene, biphenyl and naphthalene to the pSKY4 plasmid and qPCR expression analysis showed that plasmid extradiol dioxygenases genes (bphC and xylE) are inducted by phenanthrene, m-xylene and biphenyl whereas the 2,4-dichlorophenoxyacetic acid herbicide induced the chlorocatechol 1,2-dioxygenase gene (tfdC) from the ortho-pathway. A method to study upper meta-pathway extradiol dioxygenase gene diversity in soil was developed. The extradiol dioxygenases catalyse cleavage of the aromatic ring between a hydroxylated carbon and an adjacent non-hydroxylated carbon (meta-cleavage). A high diversity of extradiol dioxygenases were detected from polluted soils. The detected extradiol dioxygenases showed sequence similarity to known catabolic genes of Alpha-, Beta-, and Gammaproteobacteria. Five groups of extradiol dioxygenases contained sequences with no close homologues in the database, representing novel genes. In rhizoremediation experiment with birch (Betula pendula) treatment specific changes of extradiol dioxygenase communities were shown. PAH pollution changed the bulk soil extradiol dioxygenase community structure and birch rhizosphere contained a more diverse extradiol dioxygenase community than the bulk soil showing a rhizosphere effect. The degradation of pyrene in soil was enhanced with birch seedlings compared to soil without birch. The complete 280,923 kb nucleotide sequence of pSKY4 plasmid was determined. The open reading frames of pSKY4 were divided into putative conjugative transfer, aromatic degradation, replication/maintaining and transposition/integration function-encoding proteins. Aromatic degradation orfs shared high similarity to corresponding genes in pNL1, a plasmid from the deep subsurface strain Novosphingobium aromaticivorans F199. The plasmid backbones were considerably more divergent with lower similarity, which suggests that the aromatic pathway has functioned as a plasmid independent mobile genetic element. The functional diversity of microbial communities in soil is still largely unknown. Several novel clusters of extradiol dioxygenases representing catabolic bacteria, whose function, biodegradation pathways and phylogenetic position is not known were amplified with single primer pair from polluted soils. These extradiol dioxygenase communities were shown to change upon PAH pollution, which indicates that their hosts function in PAH biodegradation in soil. Although the degradation pathways of specific bacterial species are substantially better depicted than pathways in situ, the evolution of degradation pathways for the xenobiotic compounds is largely unknown. The pSKY4 plasmid contains aromatic degradation genes in putative mobile genetic element causing flexibility/instability to the pathway. The localisation of the aromatic biodegradation pathway in mobile genetic elements suggests that gene transfer and rearrangements are a competetive advantage for Sphingomonas bacteria in the environment.