29 resultados para DNA Sequencing
em Helda - Digital Repository of University of Helsinki
Resumo:
Microarrays are high throughput biological assays that allow the screening of thousands of genes for their expression. The main idea behind microarrays is to compute for each gene a unique signal that is directly proportional to the quantity of mRNA that was hybridized on the chip. A large number of steps and errors associated with each step make the generated expression signal noisy. As a result, microarray data need to be carefully pre-processed before their analysis can be assumed to lead to reliable and biologically relevant conclusions. This thesis focuses on developing methods for improving gene signal and further utilizing this improved signal for higher level analysis. To achieve this, first, approaches for designing microarray experiments using various optimality criteria, considering both biological and technical replicates, are described. A carefully designed experiment leads to signal with low noise, as the effect of unwanted variations is minimized and the precision of the estimates of the parameters of interest are maximized. Second, a system for improving the gene signal by using three scans at varying scanner sensitivities is developed. A novel Bayesian latent intensity model is then applied on these three sets of expression values, corresponding to the three scans, to estimate the suitably calibrated true signal of genes. Third, a novel image segmentation approach that segregates the fluorescent signal from the undesired noise is developed using an additional dye, SYBR green RNA II. This technique helped in identifying signal only with respect to the hybridized DNA, and signal corresponding to dust, scratch, spilling of dye, and other noises, are avoided. Fourth, an integrated statistical model is developed, where signal correction, systematic array effects, dye effects, and differential expression, are modelled jointly as opposed to a sequential application of several methods of analysis. The methods described in here have been tested only for cDNA microarrays, but can also, with some modifications, be applied to other high-throughput technologies. Keywords: High-throughput technology, microarray, cDNA, multiple scans, Bayesian hierarchical models, image analysis, experimental design, MCMC, WinBUGS.
Resumo:
Archaea were long thought to be a group of ancient bacteria, which mainly lived in extreme environments. Due to the development of DNA sequencing methods and molecular phylogenetic analyses, it was shown that the living organisms are in fact divided into three domains; the Archaea, Bacteria and the Eucarya. Since the beginning of the previous decade, it was shown that archaea generally inhabit moderate environments and that these non-extremophilic archaea are more ubiquitous than the extremophiles. Group 1 of non-extreme archaea affiliate with the phylum Crenarchaeota. The most commonly found soil archaea belong to the subgroup 1.1b. However, the Crenarchaeota found in the Fennoscandian boreal forest soil belong to the subgroup 1.1c. The organic top layer of the boreal forest soil, the humus, is dominated by ectomycorrhizal fungal hyphae. These colonise virtually all tree fine root tips in the humus layer and have been shown to harbour distinct bacterial populations different from those in the humus. The archaea have also been shown to colonise both boreal forest humus and the rhizospheres of plants. In this work, studies on the archaeal communities in the ectomycorrhizospheres of boreal forest trees were conducted in microcosms. Archaea belonging to the group 1.1c Crenarchaeota and Euryarchaeota of the genera Halobacterium and Methanolobus were detected. The archaea generally colonised fungal habitats, such as ectomycorrhizas and external mycelia, rather than the non-mycorrhizal fine roots of trees. The species of ectomycorrhizal fungus had a great impact on the archaeal community composition. A stable euryarchaeotal community was detected especially in the mycorrhizas, of most of the tested Scots pine colonising ectomycorrhizal fungi. The Crenarchaeota appeared more sporadically in these habitats, but had a greater diversity than the Euryarchaeota. P. involutus mycorrhizas had a higher diversity of 1.1c Crenarchaeota than the other ectomycorrhizal fungi. The detection level of archaea in the roots of boreal trees was generally low although archaea have been shown to associate with roots of different plants. However, alder showed a high diversity of 1.1c Crenarchaeota, exceeding that of any of the tested mycorrhizas. The archaeal 16S rRNA genes detected from the non-mycorrhizal roots were different from those of the P. involutus mycorrhizas. In the phylogenetic analyses, the archaeal 16S rRNA gene sequences obtained from non-mycorrhizal fine roots fell in a separate cluster within the group 1.1c Crenarchaeota than those from the mycorrhizas. When the roots of the differrent tree species were colonised by P. involutus, the diversity and frequency of the archaeal populations of the different tree species were more similar to each other. Both Cren- and Euryarchaeota were enriched in cultures to which C-1 substrates were added. The 1.1c Crenarchaeota grew anaerobically in mineral medium with CH4 and CO2 as the only available C sources, and in yeast extract media with CO2 and CH4 or H2. The crenarchaeotal diversity was higher in aerobic cultures on mineral medium with CH4 or CH3OH than in the anaerobic cultures. Ecological functions of the mycorrhizal 1.1c Crenarchaeota in both anaerobic and aerobic cycling of C-1 compounds were indicated. The phylogenetic analyses did not divide the detected Crenarchaeota into anaerobic and aerobic groups. This may suggest that the mycorrhizospheric crenarchaeotal communities consist of closely related groups of anaerobic and aerobic 1.1c Crenarchaeota, or the 1.1c Crenarchaeota may be facultatively anaerobic. Halobacteria were enriched in non-saline anaerobic yeast extract medium cultures in which CH4 was either added or produced, but were not detected in the aerobic cultures. They may potentially be involved in anaerobic CH4 cycling in ectomycorrhizas. The CH4 production of the mycorrhizal samples was over 10 times higher than for humus devoid of mycorrhizal hyphae, indicating a high CH4 production potential of the mycorrhizal metanogenic community. Autofluorescent methanogenic archaea were detected by microscopy and 16S rRNA gene sequences of the genus Methanolobus were obtained. The archaeal community depended on both tree species and the type of ectomycorrhizal fungus colonising the roots and the Cren- and Euryarchaeota may have different ecological functions in the different parts of the boreal forest tree rhizosphere and mycorrhizosphere. By employing the results of this study, it may be possible to isolate both 1.1c Crenarchaeota as well as non-halophilic halobacteria and aerotolerant methanogens from mycorrhizospheres. These archaea may be used as indicators for change in the boreal forest soil ecosystem due to different factors, such as exploitations of forests and the rise in global temperature. More information about the microbial populations with apparently low cell numbers but significant ecological impacts, such as the boreal forest soil methanogens, may be of crucial importance to counteract human impacts on such globally important ecosystems as the boreal forests.
Resumo:
Fish farming introduces nutrients, microbes and a wide variety of chemicals such as heavy metals, antifoulants and antibiotics to the surrounding environment. Introduction of antibiotics has been linked with the increased incidence of antibiotic resistant pathogenic bacteria in the farm vicinities. In this thesis molecular methods such as quantitative PCR and DNA sequencing were applied to analyze bacterial communities in sediments from fish farms and pristine locations. Altogether four farms and four pristine sites were sampled in the Baltic Sea. Two farm and two pristine locations were sampled over a surveillance period of four years. Furthermore, a new methodology was developed as a part of the study that permits amplifying single microbial genomes and capturing them according to any genetic traits, including antibiotic resistance genes. The study revealed that several resistance genes for tetracycline were found at the sediment underneath the aquaculture farms. The copy number of these genes remained elevated even at a farm that had not used any antibiotics since year 2000, six years before this study started. Similarly, an increase in the amount of mercury resistance gene merA was observed at the aquaculture sediment. The persistence of the resistance genes in absence of any selection pressure from antibiotics or heavy metals suggests that the genes may be introduced to the sediment by the farming process. This is also supported by the diversity pattern of the merA gene between farm and pristine sediments. The bacterial community-level changes in response to fish farming were very complex and no single phylogenetic groups were found that would be typical to fish farm sediments. However, the community structures had some correlation with the exposure to fish farming. Our studies suggest that the established approaches to deal with antibiotic resistance at the aquaculture, such as antibiotic cycling, are fundamentally flawed because they cannot prevent the introduction of the resistance genes and resistant bacteria to the farm area by the farming process. Further studies are required to study the entire fish farming process to identify the sources of the resistance genes and the resistant bacteria. The results also suggest that in order to prevent major microbiological changes in the surrounding aquatic environment, the farms should not be founded in shallow water where currents do not transport sedimenting matter from the farms. Finally, the technique to amplify and select microbial genomes will potentially have a considerable impact in microbial ecology and genomics.
Resumo:
Epidemiological studies have shown an elevation in the incidence of asthma, allergic symptoms and respiratory infections among people living or working in buildings with moisture and mould problems. Microbial growth is suspected to have a key role, since the severity of microbial contamination and symptoms show a positive correlation, while the removal of contaminated materials relieves the symptoms. However, the cause-and-effect relationship has not been well established and knowledge of the causative agents is incomplete. The present consensus of indoor microbes relies on culture-based methods. Microbial cultivation and identification is known to provide qualitatively and quantitatively biased results, which is suspected to be one of the reasons behind the often inconsistent findings between objectively measured microbiological attributes and health. In the present study the indoor microbial communities were assessed using culture-independent, DNA based methods. Fungal and bacterial diversity was determined by amplifying and sequencing the nucITS- and16S-gene regions, correspondingly. In addition, the cell equivalent numbers of 69 mould species or groups were determined by quantitative PCR (qPCR). The results from molecular analyses were compared with results obtained using traditional plate cultivation for fungi. Using DNA-based tools, the indoor microbial diversity was found to be consistently higher and taxonomically wider than viable diversity. The dominant sequence types of fungi, and also of bacteria were mainly affiliated with well-known microbial species. However, in each building they were accompanied by various rare, uncultivable and unknown species. In both moisture-damaged and undamaged buildings the dominant fungal sequence phylotypes were affiliated with the classes Dothideomycetes (mould-like filamentous ascomycetes); Agaricomycetes (mushroom- and polypore-like filamentous basidiomycetes); Urediniomycetes (rust-like basidiomycetes); Tremellomycetes and the family Malasseziales (both yeast-like basidiomycetes). The most probable source for the majority of fungal types was the outdoor environment. In contrast, the dominant bacterial phylotypes in both damaged and undamaged buildings were affiliated with human-associated members within the phyla Actinobacteria and Firmicutes. Indications of elevated fungal diversity within potentially moisture-damage-associated fungal groups were recorded in two of the damaged buildings, while one of the buildings was characterized by an abundance of members of the Penicillium chrysogenum and P. commune species complexes. However, due to the small sample number and strong normal variation firm conclusions concerning the effect of moisture damage on the species diversity could not be made. The fungal communities in dust samples showed seasonal variation, which reflected the seasonal fluctuation of outdoor fungi. Seasonal variation of bacterial communities was less clear but to some extent attributable to the outdoor sources as well. The comparison of methods showed that clone library sequencing was a feasible method for describing the total microbial diversity, indicated a moderate quantitative correlation between sequencing and qPCR results and confirmed that culture based methods give both a qualitative and quantitative underestimate of microbial diversity in the indoor environment. However, certain important indoor fungi such as Penicillium spp. were clearly underrepresented in the sequence material, probably due to their physiological and genetic properties. Species specific qPCR was a more efficient and sensitive method for detecting and quantitating individual species than sequencing, but in order to exploit the full advantage of the method in building investigations more information is needed about the microbial species growing on damaged materials. In the present study, a new method was also developed for enhanced screening of the marker gene clone libraries. The suitability of the screening method to different kinds of microbial environments including biowaste compost material and indoor settled dusts was evaluated. The usability was found to be restricted to environments that support the growth and subsequent dominance of a small number microbial species, such as compost material.
Resumo:
Extraintestinal pathogenic Escherichia coli (ExPEC) represent a diverse group of strains of E. coli, which infect extraintestinal sites, such as the urinary tract, the bloodstream, the meninges, the peritoneal cavity, and the lungs. Urinary tract infections (UTIs) caused by uropathogenic E. coli (UPEC), the major subgroup of ExPEC, are among the most prevalent microbial diseases world wide and a substantial burden for public health care systems. UTIs are responsible for serious morbidity and mortality in the elderly, in young children, and in immune-compromised and hospitalized patients. ExPEC strains are different, both from genetic and clinical perspectives, from commensal E. coli strains belonging to the normal intestinal flora and from intestinal pathogenic E. coli strains causing diarrhea. ExPEC strains are characterized by a broad range of alternate virulence factors, such as adhesins, toxins, and iron accumulation systems. Unlike diarrheagenic E. coli, whose distinctive virulence determinants evoke characteristic diarrheagenic symptoms and signs, ExPEC strains are exceedingly heterogeneous and are known to possess no specific virulence factors or a set of factors, which are obligatory for the infection of a certain extraintestinal site (e. g. the urinary tract). The ExPEC genomes are highly diverse mosaic structures in permanent flux. These strains have obtained a significant amount of DNA (predictably up to 25% of the genomes) through acquisition of foreign DNA from diverse related or non-related donor species by lateral transfer of mobile genetic elements, including pathogenicity islands (PAIs), plasmids, phages, transposons, and insertion elements. The ability of ExPEC strains to cause disease is mainly derived from this horizontally acquired gene pool; the extragenous DNA facilitates rapid adaptation of the pathogen to changing conditions and hence the extent of the spectrum of sites that can be infected. However, neither the amount of unique DNA in different ExPEC strains (or UPEC strains) nor the mechanisms lying behind the observed genomic mobility are known. Due to this extreme heterogeneity of the UPEC and ExPEC populations in general, the routine surveillance of ExPEC is exceedingly difficult. In this project, we presented a novel virulence gene algorithm (VGA) for the estimation of the extraintestinal virulence potential (VP, pathogenicity risk) of clinically relevant ExPECs and fecal E. coli isolates. The VGA was based on a DNA microarray specific for the ExPEC phenotype (ExPEC pathoarray). This array contained 77 DNA probes homologous with known (e.g. adhesion factors, iron accumulation systems, and toxins) and putative (e.g. genes predictably involved in adhesion, iron uptake, or in metabolic functions) ExPEC virulence determinants. In total, 25 of DNA probes homologous with known virulence factors and 36 of DNA probes representing putative extraintestinal virulence determinants were found at significantly higher frequency in virulent ExPEC isolates than in commensal E. coli strains. We showed that the ExPEC pathoarray and the VGA could be readily used for the differentiation of highly virulent ExPECs both from less virulent ExPEC clones and from commensal E. coli strains as well. Implementing the VGA in a group of unknown ExPECs (n=53) and fecal E. coli isolates (n=37), 83% of strains were correctly identified as extraintestinal virulent or commensal E. coli. Conversely, 15% of clinical ExPECs and 19% of fecal E. coli strains failed to raster into their respective pathogenic and non-pathogenic groups. Clinical data and virulence gene profiles of these strains warranted the estimated VPs; UPEC strains with atypically low risk-ratios were largely isolated from patients with certain medical history, including diabetes mellitus or catheterization, or from elderly patients. In addition, fecal E. coli strains with VPs characteristic for ExPEC were shown to represent the diagnostically important fraction of resident strains of the gut flora with a high potential of causing extraintestinal infections. Interestingly, a large fraction of DNA probes associated with the ExPEC phenotype corresponded to novel DNA sequences without any known function in UTIs and thus represented new genetic markers for the extraintestinal virulence. These DNA probes included unknown DNA sequences originating from the genomic subtractions of four clinical ExPEC isolates as well as from five novel cosmid sequences identified in the UPEC strains HE300 and JS299. The characterized cosmid sequences (pJS332, pJS448, pJS666, pJS700, and pJS706) revealed complex modular DNA structures with known and unknown DNA fragments arranged in a puzzle-like manner and integrated into the common E. coli genomic backbone. Furthermore, cosmid pJS332 of the UPEC strain HE300, which carried a chromosomal virulence gene cluster (iroBCDEN) encoding the salmochelin siderophore system, was shown to be part of a transmissible plasmid of Salmonella enterica. Taken together, the results of this project pointed towards the assumptions that first, (i) homologous recombination, even within coding genes, contributes to the observed mosaicism of ExPEC genomes and secondly, (ii) besides en block transfer of large DNA regions (e.g. chromosomal PAIs) also rearrangements of small DNA modules provide a means of genomic plasticity. The data presented in this project supplemented previous whole genome sequencing projects of E. coli and indicated that each E. coli genome displays a unique assemblage of individual mosaic structures, which enable these strains to successfully colonize and infect different anatomical sites.
Resumo:
The growing interest for sequencing with higher throughput in the last decade has led to the development of new sequencing applications. This thesis concentrates on optimizing DNA library preparation for Illumina Genome Analyzer II sequencer. The library preparation steps that were optimized include fragmentation, PCR purification and quantification. DNA fragmentation was performed with focused sonication in different concentrations and durations. Two column based PCR purification method, gel matrix method and magnetic bead based method were compared. Quantitative PCR and gel electrophoresis in a chip were compared for DNA quantification. The magnetic bead purification was found to be the most efficient and flexible purification method. The fragmentation protocol was changed to produce longer fragments to be compatible with longer sequencing reads. Quantitative PCR correlates better with the cluster number and should thus be considered to be the default quantification method for sequencing. As a result of this study more data have been acquired from sequencing with lower costs and troubleshooting has become easier as qualification steps have been added to the protocol. New sequencing instruments and applications will create a demand for further optimizations in future.
Resumo:
Microbes in natural and artificial environments as well as in the human body are a key part of the functional properties of these complex systems. The presence or absence of certain microbial taxa is a correlate of functional status like risk of disease or course of metabolic processes of a microbial community. As microbes are highly diverse and mostly notcultivable, molecular markers like gene sequences are a potential basis for detection and identification of key types. The goal of this thesis was to study molecular methods for identification of microbial DNA in order to develop a tool for analysis of environmental and clinical DNA samples. Particular emphasis was placed on specificity of detection which is a major challenge when analyzing complex microbial communities. The approach taken in this study was the application and optimization of enzymatic ligation of DNA probes coupled with microarray read-out for high-throughput microbial profiling. The results show that fungal phylotypes and human papillomavirus genotypes could be accurately identified from pools of PCR amplicons generated from purified sample DNA. Approximately 1 ng/μl of sample DNA was needed for representative PCR amplification as measured by comparisons between clone sequencing and microarray. A minimum of 0,25 amol/μl of PCR amplicons was detectable from amongst 5 ng/μl of background DNA, suggesting that the detection limit of the test comprising of ligation reaction followed by microarray read-out was approximately 0,04%. Detection from sample DNA directly was shown to be feasible with probes forming a circular molecule upon ligation followed by PCR amplification of the probe. In this approach, the minimum detectable relative amount of target genome was found to be 1% of all genomes in the sample as estimated from 454 deep sequencing results. Signal-to-noise of contact printed microarrays could be improved by using an internal microarray hybridization control oligonucleotide probe together with a computational algorithm. The algorithm was based on identification of a bias in the microarray data and correction of the bias as shown by simulated and real data. The results further suggest semiquantitative detection to be possible by ligation detection, allowing estimation of target abundance in a sample. However, in practise, comprehensive sequence information of full length rRNA genes is needed to support probe design with complex samples. This study shows that DNA microarray has the potential for an accurate microbial diagnostic platform to take advantage of increasing sequence data and to replace traditional, less efficient methods that still dominate routine testing in laboratories. The data suggests that ligation reaction based microarray assay can be optimized to a degree that allows good signal-tonoise and semiquantitative detection.
Resumo:
Prostate cancer is the most common noncutaneous malignancy and the second leading cause of cancer mortality in men. In 2004, 5237 new cases were diagnosed and altogether 25 664 men suffered from prostate cancer in Finland (Suomen Syöpärekisteri). Although extensively investigated, we still have a very rudimentary understanding of the molecular mechanisms leading to the frequent transformation of the prostate epithelium. Prostate cancer is characterized by several unique features including the multifocal origin of tumors and extreme resistance to chemotherapy, and new treatment options are therefore urgently needed. The integrity of genomic DNA is constantly challenged by genotoxic insults. Cellular responses to DNA damage involve elegant checkpoint cascades enforcing cell cycle arrest, thus facilitating damage repair, apoptosis or cellular senescence. Cellular DNA damage triggers the activation of tumor suppressor protein p53 and Wee1 kinase which act as executors of the cellular checkpoint responses. These are essential for genomic integrity, and are activated in early stages of tumorigenesis in order to function as barriers against tumor formation. Our work establishes that the primary human prostatic epithelial cells and prostatic epithelium have unexpectedly indulgent checkpoint surveillance. This is evidenced by the absence of inhibitory Tyr15 phosphorylation on Cdk2, lack of p53 response, radioresistant DNA synthesis, lack of G1/S and G2/M phase arrest, and presence of persistent gammaH2AX damage foci. We ascribe the absence of inhibitory Tyr15 phosphorylation to low levels of Wee1A, a tyrosine kinase and negative regulator of cell cycle progression. Ectopic Wee1A kinase restored Cdk2-Tyr15 phosphorylation and efficiently rescued the ionizing radiation-induced checkpoints in the human prostatic epithelial cells. As variability in the DNA damage responses has been shown to underlie susceptibility to cancer, our results imply that a suboptimal checkpoint arrest may greatly increase the accumulation of genetic lesions in the prostate epithelia. We also show that small molecules can restore p53 function in prostatic epithelial cells and may serve as a paradigm for the development of future therapeutic agents for the treatment of prostate cancer We hypothesize that the prostate has evolved to activate the damage surveillance pathways and molecules involved in these pathways only to certain stresses in extreme circumstances. In doing so, this organ inadvertently made itself vulnerable to genotoxic stress, which may have implications in malignant transformation. Recognition of the limited activity of p53 and Wee1 in the prostate could drive mechanism-based discovery of preventative and therapeutic agents.
Resumo:
Colorectal cancer (CRC) is the third most common cancer in Finland. Of all CRC tumors, 15% display microsatellite-instability (MSI) caused by defective cellular mismatch repair. Cells displaying MSI accumulate a high number of mutations genome-wide, especially in short repeat areas, microsatellites. When targeting genes essential for cell growth or death, MSI can promote tumorigenesis. In non-coding areas, microsatellite mutations are generally considered as passenger events. Since the discovery of MSI and its linkage to cancer, more that 200 genes have been investigated for a role in MSI tumorigenesis. Although various criteria have been suggested for MSI target gene identification, the challenge has been to distinguish driver mutations from passenger mutations. This study aimed to clarify these key issues in the research field of MSI cancer. Prior to this, background mutation rate in MSI cancer has not been studied in a large-scale. We investigated the background mutation rate in MSI CRC by analyzing the spectrum of microsatellite mutations in non-coding areas. First, semenogelin I was studied for a possible role in MSI carcinogenesis. The intronic T9 repeat of semenogelin I was frequently mutated but no evidence for selection during tumorigenesis was obtained. Second, a sequencing approach was utilized to evaluate the general background mutation rate in MSI CRC. Both intronic and intergenic repeats harbored extremely high mutation rates of ≤ 87% and intergenic repeats were more unstable than the intronic repeats. As mutation rates of presumably neutral microsatellites can be high in MSI CRC in the absence of apparent selection pressure, high mutation frequency alone is not sufficient evidence for identification of driver MSI target genes. Next, an unbiased approach was designed to identify the mutatome of MSI CRC. By combining expression array data and a database search we identified novel genes possibly related to MSI CRC carcinogenesis. One of the genes was studied further. In the functional analysis this gene was observed to cause an abnormal cancer-prone cellular phenotype, possibly through altered responses to DNA damage. In our recent study, smooth muscle myosin heavy chain 11 (MYH11) was identified as a novel MSI CRC gene. Additionally, MYH11 has a well established role in acute myeloid leukemia (AML) through an oncogenic fusion protein CBFB-MYH11. We investigated further the role of MYH11 in AML by sequencing. Three novel missense variants of MYH11 were identified. None of the variants were present in the population-based control material. One of the identified variants, V71A, lies in the N-terminal SH3-like domain of MYH11 of unknown function. The other two variants, K1059E and R1792Q are located in the coil-coiled myosin rod essential for the regulation and filament formation of MYH11. The variant K1059E lies in the close proximity of the K1044N that has been functionally assessed in our earlier work of CRC and has been reported to cause total loss of MYH11 protein regulation. As the functional significance of the three novel variants examined in this work remains unknown, future studies should clarify the further role of MYH11 in AML leukaemogenesis and in other malignancies.
Resumo:
Nemaline myopathy (NM) is a rare muscle disorder characterised by muscle weakness and nemaline bodies in striated muscle tissue. Nemaline bodies are derived from sarcomeric Z discs and may be detected by light microscopy. The disease can be divided into six subclasses varying from very severe, in some cases lethal forms to milder forms. NM is usually the consequence of a gene mutation and the mode of inheritance varies between NM subclasses and different families. Mutations in six genes are known to cause NM; nebulin (NEB), alpha-actin, alpha-tropomyosin (TPM3), troponin T1, beta-tropomyosin (TPM2) and cofilin 2, of which nebulin and -actin are the most common. One of the main interests of my research is NEB. Nebulin is a giant muscle protein (600-900 kDa) expressed mainly in the thin filaments of striated muscle. Mutations in NEB are the main cause of autosomal recessive NM. The gene consists of 183 exons. Thus being gigantic, NEB is very challenging to investigate. NEB was screened for mutations using denaturing High Performance Liquid Chromatography (dHPLC) and sequencing. DNA samples from 44 families were included in this study, and we found and published 45 different mutations in them. To date, we have identified 115 mutations in NEB in a total of 96 families. In addition, we determined the occurrence in a world-wide sample cohort of a 2.5 kb deletion containing NEB exon 55 identified in the Ashkenazi Jewish population. In order to find the seventh putative NM gene a genome-wide linkage study was performed in a series of Turkish families. In two of these families, we identified a homozygous mutation disrupting the termination signal of the TPM3 gene, a previously known NM-causing gene. This mutation is likely a founder mutation in the Turkish population. In addition, we described a novel recessively inherited distal myopathy, named distal nebulin myopathy, caused by two different homozygous missense mutations in NEB in six Finnish patients. Both mutations, when combined in compound heterozygous form with a more disruptive mutation, are known to cause NM. This study consisted of molecular genetic mutation analyses, light and electron microscopic studies of muscle biopsies, muscle imaging and clinical examination of patients. In these patients the distribution of muscle weakness was different from NM. Nemaline bodies were not detectable with routine light microscopy, and they were inconspicuous or absent even using electron microscopy. No genetic cause was known to underlie cap myopathy, a congenital myopathy characterised by cap-like structures in the muscle fibres, until we identified a deletion of one codon of the TPM2 gene, in a 30-year-old cap myopathy patient. This mutation does not change the reading frame of the gene, but a deletion of one amino acid does affect the conformation of the protein produced. In summary, this thesis describes a novel distal myopathy caused by mutations in the nebulin gene, several novel nebulin mutations associated with nemaline myopathy, the first molecular genetic cause of cap myopathy, i.e. a mutation in the beta-tropomyosin gene, and a founder mutation in the alpha-tropomyosin gene underlying autosomal recessive nemaline myopathy in the Turkish population.
Resumo:
My work describes two sectors of the human bacterial environment: 1. The sources of exposure to infectious non-tuberculous mycobacteria. 2. Bacteria in dust, reflecting the airborne bacterial exposure in environments protecting from or predisposing to allergic disorders. Non-tuberculous mycobacteria (NTM) transmit to humans and animals from the environment. Infection by NTM in Finland has increased during the past decade beyond that by Mycobacterium tuberculosis. Among the farm animals, porcine mycobacteriosis is the predominant NTM disease in Finland. Symptoms of mycobacteriosis are found in 0.34 % of slaughtered pigs. Soil and drinking water are suspected as sources for humans and bedding materials for pigs. To achieve quantitative data on the sources of human and porcine NTM exposure, methods for quantitation of environmental NTM are needed. We developed a quantitative real-time PCR method, utilizing primers targeted at the 16S rRNA gene of the genus of Mycobacterium. With this method, I found in Finnish sphagnum peat, sandy soils and mud high contents of mycobacterial DNA, 106 to 107 genome equivalents per gram. A similar result was obtained by a method based on the Mycobacterium-specific hybridization of 16S rRNA. Since rRNA is found mainly in live cells, this result shows that the DNA detected by qPCR mainly represented live mycobacteria. Next, I investigated the occurrence of environmental mycobacteria in the bedding materials obtained from 5 pig farms with high prevalence (>4 %) of mycobacteriosis. When I used for quantification the same qPCR methods as for the soils, I found that piggery samples contained non-mycobacterial DNA that was amplified in spite of several mismatches with the primers. I therefore improved the qPCR assay by designing Mycobacterium-specific detection probes. Using the probe qPCR assay, I found 105 to 107 genome equivalents of mycobacterial DNA in unused bedding materials and up to 1000 fold more in the bedding collected after use in the piggery. This result shows that there was a source of mycobacteria in the bedding materials purchased by the piggery and that mycobacteria increased in the bedding materials during use in the piggery. Allergic diseases have reached epidemic proportions in urbanized countries. At the same time, childhood in rural environment or simple living conditions appears to protect against allergic disorders. Exposure to immunoreactive microbial components in rural environments seems to prevent allergies. I searched for differences in the bacterial communities of two indoor dusts, an urban house dust shown to possess immunoreactivity of the TH2-type and a farm barn dust with TH1-activity. The immunoreactivities of the dusts were revealed by my collaborators, in vitro in human dendritic cells and in vivo in mouse. The dusts accumulated >10 years in the respiratory zone (>1.5 m above floor), thus reflecting the long-term content of airborne bacteria at the two sites. I investigated these dusts by cloning and sequencing of bacterial 16S rRNA genes from dust contained DNA. From the TH2-active urban house dust, I isolated 139 16S rRNA gene clones. The most prevalent genera among the clones were Corynebacterium (5 species, 34 clones), Streptococcus (8 species, 33 clones), Staphylococcus (5 species, 9 clones) and Finegoldia (1 species, 9 clones). Almost all of these species are known as colonizers of the human skin and oral cavity. Species of Corynebacterium and Streptococcus have been reported to contain anti-inflammatory lipoarabinomannans and immunmoreactive beta-glucans respectively. Streptococcus mitis, found in the urban house dust is known as an inducer of TH2 polarized immunity, characteristic of allergic disorders. I isolated 152 DNA clones from the TH1-active farm barn dust and found species quite different from those found from the urban house dust. Among others, I found DNA clones representing Bacillus licheniformis, Acinetobacter lwoffii and Lactobacillus each of which was recently reported to possess anti-allergy immunoreactivity. Moreover, the farm barn dust contained dramatically higher bacterial diversity than the urban house dust. Exposure to this dust thus stimulated the human dendritic cells by multiple microbial components. Such stimulation was reported to promote TH1 immunity. The biodiversity in dust may thus be connected to its immunoreactivity. Furthermore, the bacterial biomass in the farm barn dust consisted of live intact bacteria mainly. In the urban house dust only ~1 % of the biomass appeared as intact bacteria, as judged by microscoping. Fragmented microbes may possess bioactivity different from that of intact cells. This was recently shown for moulds. If this is also valid for bacteria, the different immunoreactivities of the two dusts may be explained by the intactness of dustborne bacteria. Based on these results, we offer three factors potentially contributing to the polarized immunoreactivities of the two dusts: (i) the species-composition, (ii) the biodiversity and (iii) the intactness of the dustborne bacterial biomass. The risk of childhood atopic diseases is 4-fold lower in the Russian compared with the Finnish Karelia. This difference across the country border is not explainable by different geo-climatic factors or genetic susceptibilities of the two populations. Instead, the explanation must be lifestyle-related. It has already been reported that the microbiological quality of drinking water differs on the two sides of the borders. In collaboration with allergists, I investigated dusts collected from homes in the Russian Karelia and in the Finnish Karelia. I found that bacterial 16S rRNA genes cloned from the Russian Karelian dusts (10 homes, 234 clones) predominantly represented Gram-positive taxa (the phyla Actinobacteria and Firmicutes, 67%). The Russian Karelian dusts contained nine-fold more of muramic acid (60 to 70 ng mg-1) than the Finnish Karelian dusts (3 to 11 ng mg-1). Among the DNA clones isolated from the Finnish side (n=231), Gram-negative taxa (40%) outnumbered the Gram-positives (34%). Out of the 465 DNA clones isolated from the Karelian dusts, 242 were assigned to cultured validly described bacterial species. In Russian Karelia, animal-associated species e.g. Staphylococcus and Macrococcus were numerous (27 clones, 14 unique species). This finding may connect to the difference in the prevalence of allergy, as childhood contacts with pets and farm animals have been connected with low allergy risk. Plant-associated bacteria and plant-borne 16S rRNA genes (chloroplast) were frequent among the DNA clones isolated from the Finnish Karelia, indicating components originating from plants. In conclusion, my work revealed three major differences between the bacterial communtites in the Russian and in the Finnish Karelian homes: (i) the high prevalence of Gram-positive bacteria on the Russian side and of Gram-negative bacteria on the Finnish side and (ii) the rich presence of animal-associated bacteria on the Russian side whereas (iii) plant-associated bacteria prevailed on the Finnish side. One or several of these factors may connect to the differences in the prevalence of allergy.
Resumo:
Rhizoremediation is the use of microbial populations present in the rhizosphere of plants for environmental cleanup. The idea of this work was that bacteria living in the rhizosphere of a nitrogen-fixing leguminous plant, goat's rue (Galega orientalis), could take part in the degradation of harmful monoaromatic hydrocarbons, such as benzene, toluene and xylene (BTEX), from oil-contaminated soils. In addition to chemical (e.g. pollutant concentration) and physical (e.g. soil structure) information, the knowledge of biological aspects (e.g. bacteria and their catabolic genes) is essential when developing the rhizoremediation into controlled and effective bioremediation practice. Therefore, the need for reliable biomonitoring methods is obvious. The main aims of this thesis were to evaluate the symbiotic G. orientalis - Rhizobium galegae system for rhizoremediation of oil-contaminated soils, to develop molecular methods for biomonitoring, and to apply these methods for studying the microbiology of rhizoremediation. In vitro, Galega plants and rhizobia remained viable in m-toluate concentrations up to 3000 mg/l. Plant growth and nodulation were inhibited in 500 mg/l m-toluate, but were restored when plants were transferred to clean medium. In the greenhouse, Galega showed good growth, nodulation and nitrogen fixation, and developed a strong rhizosphere in soils contaminated with oil or spiked with 2000 mg/l m-toluate. The high aromatic tolerance of R. galegae and the viability of Galega plants in oil-polluted soils proved this legume system to be a promising method for the rhizoremediation of oil-contaminated soils. Molecular biomonitoring methods were designed and/or developed further for bacteria and their degradation genes. A combination of genomic fingerprinting ((GTG)5-PCR), taxonomic ribotyping of 16S rRNA genes and partial 16S rRNA gene sequencing were chosen for molecular grouping of culturable, heterogeneous rhizosphere bacteria. PCR primers specific for the xylE gene were designed for TOL plasmid detection. Amplified enzyme-coding DNA restriction analysis (AEDRA) with AluI was used to profile both TOL plasmids (xylE primers) and, in general, aromatics-degrading plasmids (C230 primers). The sensitivity of the direct monitoring of TOL plasmids in soil was enhanced by nested C23O-xylE-PCR. Rhizosphere bacteria were isolated from the greenhouse and field lysimeter experiments. High genetic diversity was observed among the 50 isolated, m-toluate tolerating rhizosphere bacteria in the form of five major lineages of the Bacteria domain. Gram-positive Rhodococcus, Bacillus and Arthrobacter and gram-negative Pseudomonas were the most abundant genera. The inoculum Pseudomonas putida PaW85/pWW0 was not found in the rhizosphere samples. Even if there were no ecological niches available for the bioaugmentation bacterium itself, its conjugative catabolic plasmid might have had some additional value for other bacterial species and thus, for rhizoremediation. Only 10 to 20% of the isolated, m-toluate tolerating bacterial strains were also able to degrade m-toluate. TOL plasmids were a major group of catabolic plasmids among these bacteria. The ability to degrade m-toluate by using enzymes encoded by a TOL plasmid was detected only in species of the genus Pseudomonas, and the best m-toluate degraders were these Pseudomonas species. Strain-specific differences in degradation abilities were found for P.oryzihabitans and P. migulae: some of these strains harbored a TOL plasmid - a new finding observed in this work, indicating putative horizontal plasmid transfer in the rhizosphere. One P. oryzihabitans strain harbored the pWW0 plasmid that had probably conjugated from the bioaugmentation Pseudomonas. Some P. migulae and P. oryzihabitans strains seemed to harbor both the pWW0- and the pDK1-type TOL plasmid. Alternatively, they might have harbored a TOL plasmid with both the pWW0- and the pDK1-type xylE gene. The breakdown of m-toluate by gram-negative bacteria was not restricted to the TOL pathway. Also some gram-positive Rhodococcus erythropolis and Arthrobacter aurescens strains were able to degrade m-toluate in the absence of a TOL plasmid. Three aspects of the rhizosphere effect of G. orientalis were manifested in oil-contaminated soil in the field: 1) G. orientalis and Pseudomonas bioaugmentation increased the amount of rhizosphere bacteria. G. orientalis especially together with Pseudomonas bioaugmentation increased the numbers of m-toluate utilizing and catechol positive bacteria indicating an increase in degradation potential. 2) Also the bacterial diversity, when measured as the amount of ribotypes, was increased in the Galega rhizosphere with or without Pseudomonas bioaugmentation. However, the diversity of m-toluate utilizing bacteria did not significantly increase. At the community level, by using the 16S rRNA gene PCR-DGGE method, the highest diversity of species was also observed in vegetated soils compared with non-vegetated soils. Diversified communities may best guarantee the overall success in rhizoremediation by offering various genetic machineries for catabolic processes. 3) At the end of the experiment, no TOL plasmid could be detected by direct DNA analysis in soil treated with both G. orientalis and Pseudomonas. The detection limit for TOL plasmids was encountered indicating decreased amount of degradation plasmids and thus, the success of rhizoremediation. The use of G. orientalis for rhizoremediation is unique. In this thesis new information was obtained about the rhizosphere effect of Galega orientalis in BTEX contaminated soils. The molecular biomonitoring methods can be applied for several purposes within environmental biotechnology, such as for evaluating the intrinsic biodegradation potential, monitoring the enhanced bioremediation, and estimating the success of bioremediation. Environmental protection by using nature's own resources and thus, acting according to the principle of sustainable development, would be both economically and environmentally beneficial for society. Keywords: molecular biomonitoring, genetic fingerprinting, soil bacteria, bacterial diversity, TOL plasmid, catabolic genes, horizontal gene transfer, rhizoremediation, rhizosphere effect, Galega orientalis, aerobic biodegradation, petroleum hydrocarbons, BTEX
Resumo:
Megasphaera cerevisiae, Pectinatus cerevisiiphilus, Pectinatus frisingensis, Selenomonas lacticifex, Zymophilus paucivorans and Zymophilus raffinosivorans are strictly anaerobic Gram-stain-negative bacteria that are able to spoil beer by producing off-flavours and turbidity. They have only been isolated from the beer production chain. The species are phylogenetically affiliated to the Sporomusa sub-branch in the class "Clostridia". Routine cultivation methods for detection of strictly anaerobic bacteria in breweries are time-consuming and do not allow species identification. The main aim of this study was to utilise DNA-based techniques in order to improve detection and identification of the Sporomusa sub-branch beer-spoilage bacteria and to increase understanding of their biodiversity, evolution and natural sources. Practical PCR-based assays were developed for monitoring of M. cerevisiae, Pectinatus species and the group of Sporomusa sub-branch beer spoilers throughout the beer production process. The developed assays reliably differentiated the target bacteria from other brewery-related microbes. The contaminant detection in process samples (10 1,000 cfu/ml) could be accomplished in 2 8 h. Low levels of viable cells in finished beer (≤10 cfu/100 ml) were usually detected after 1 3 d culture enrichment. Time saving compared to cultivation methods was up to 6 d. Based on a polyphasic approach, this study revealed the existence of three new anaerobic spoilage species in the beer production chain, i.e. Megasphaera paucivorans, Megasphaera sueciensis and Pectinatus haikarae. The description of these species enabled establishment of phenotypic and DNA-based methods for their detection and identification. The 16S rRNA gene based phylogenetic analysis of the Sporomusa sub-branch showed that the genus Selenomonas originates from several ancestors and will require reclassification. Moreover, Z. paucivorans and Z. raffinosivorans were found to be in fact members of the genus Propionispira. This relationship implies that they were carried to breweries along with plant material. The brewery-related Megasphaera species formed a distinct sub-group that did not include any sequences from other sources, suggesting that M. cerevisiae, M. paucivorans and M. sueciensis may be uniquely adapted to the brewery ecosystem. M. cerevisiae was also shown to exhibit remarkable resistance against many brewery-related stress conditions. This may partly explain why it is a brewery contaminant. This study showed that DNA-based techniques provide useful tools for obtaining more rapid and specific information about the presence and identity of the strictly anaerobic spoilage bacteria in the beer production chain than is possible using cultivation methods. This should ensure financial benefits to the industry and better product quality to customers. In addition, DNA-based analyses provided new insight into the biodiversity as well as natural sources and relations of the Sporomusa sub-branch bacteria. The data can be exploited for taxonomic classification of these bacteria and for surveillance and control of contaminations.