11 resultados para DNA sequence

em Helda - Digital Repository of University of Helsinki


Relevância:

70.00% 70.00%

Publicador:

Resumo:

Evolutionary genetics incorporates traditional population genetics and studies of the origins of genetic variation by mutation and recombination, and the molecular evolution of genomes. Among the primary forces that have potential to affect the genetic variation within and among populations, including those that may lead to adaptation and speciation, are genetic drift, gene flow, mutations and natural selection. The main challenges in knowing the genetic basis of evolutionary changes is to distinguish the adaptive selection forces that cause existent DNA sequence variants and also to identify the nucleotide differences responsible for the observed phenotypic variation. To understand the effects of various forces, interpretation of gene sequence variation has been the principal basis of many evolutionary genetic studies. The main aim of this thesis was to assess different forms of teleost gene sequence polymorphisms in evolutionary genetic studies of Atlantic salmon (Salmo salar) and other species. Firstly, the level of Darwinian adaptive evolution affected coding regions of the growth hormone (GH) gene during the teleost evolution was investigated based on the sequence data existing in public databases. Secondly, a target gene approach was used to identify within population variation in the growth hormone 1 (GH1) gene in salmon. Then, a new strategy for single nucleotide polymorphisms (SNPs) discovery in salmonid fishes was introduced, and, finally, the usefulness of a limited number of SNP markers as molecular tools in several applications of population genetics in Atlantic salmon was assessed. This thesis showed that the gene sequences in databases can be utilized to perform comparative studies of molecular evolution, and some putative evidence of the existence of Darwinian selection during the teleost GH evolution was presented. In addition, existent sequence data was exploited to investigate GH1 gene variation within Atlantic salmon populations throughout its range. Purifying selection is suggested to be the predominant evolutionary force controlling the genetic variation of this gene in salmon, and some support for gene flow between continents was also observed. The novel approach to SNP discovery in species with duplicated genome fragments introduced here proved to be an effective method, and this may have several applications in evolutionary genetics with different species - e.g. when developing gene-targeted markers to investigate quantitative genetic variation. The thesis also demonstrated that only a few SNPs performed highly similar signals in some of the population genetic analyses when compared with the microsatellite markers. This may have useful applications when estimating genetic diversity in genes having a potential role in ecological and conservation issues, or when using hard biological samples in genetic studies as SNPs can be applied with relatively highly degraded DNA.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Advancements in the analysis techniques have led to a rapid accumulation of biological data in databases. Such data often are in the form of sequences of observations, examples including DNA sequences and amino acid sequences of proteins. The scale and quality of the data give promises of answering various biologically relevant questions in more detail than what has been possible before. For example, one may wish to identify areas in an amino acid sequence, which are important for the function of the corresponding protein, or investigate how characteristics on the level of DNA sequence affect the adaptation of a bacterial species to its environment. Many of the interesting questions are intimately associated with the understanding of the evolutionary relationships among the items under consideration. The aim of this work is to develop novel statistical models and computational techniques to meet with the challenge of deriving meaning from the increasing amounts of data. Our main concern is on modeling the evolutionary relationships based on the observed molecular data. We operate within a Bayesian statistical framework, which allows a probabilistic quantification of the uncertainties related to a particular solution. As the basis of our modeling approach we utilize a partition model, which is used to describe the structure of data by appropriately dividing the data items into clusters of related items. Generalizations and modifications of the partition model are developed and applied to various problems. Large-scale data sets provide also a computational challenge. The models used to describe the data must be realistic enough to capture the essential features of the current modeling task but, at the same time, simple enough to make it possible to carry out the inference in practice. The partition model fulfills these two requirements. The problem-specific features can be taken into account by modifying the prior probability distributions of the model parameters. The computational efficiency stems from the ability to integrate out the parameters of the partition model analytically, which enables the use of efficient stochastic search algorithms.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

The ultimate goal of this study has been to construct metabolically engineered microbial strains capable of fermenting glucose into pentitols D-arabitol and, especially, xylitol. The path that was chosen to achieve this goal required discovery, isolation and sequencing of at least two pentitol phosphate dehydrogenases of different specificity, followed by cloning and expression of their genes and characterization of recombinant arabitol and xylitol phosphate dehydrogenases. An enzyme of a previously unknown specificity, D-arabitol phosphate dehydrogenase (APDH), was discovered in Enterococcus avium. The enzyme was purified to homogenity from E. avium strain ATCC 33665. SDS/PAGE revealed that the enzyme has a molecular mass of 41 ± 2 kDa, whereas a molecular mass of 160 ± 5 kDa was observed under non-denaturing conditions implying that the APDH may exist as a tetramer with identical subunits. Purified APDH was found to have narrow substrate specificity, converting only D-arabitol 1-phosphate and D-arabitol 5-phosphate into D-xylulose 5-phosphate and D-ribulose 5-phosphate, respectively, in the oxidative reaction. Both NAD+ and NADP+ were accepted as co-factors. Based on the partial protein sequences, the gene encoding APDH was cloned. Homology comparisons place APDH within the medium chain dehydrogenase family. Unlike most members of this family, APDH requires Mn2+ but no Zn2+ for enzymatic activity. The DNA sequence surrounding the gene suggests that it belongs to an operon that also contains several components of phosphotransferase system (PTS). The apparent role of the enzyme is to participate in arabitol catabolism via the arabitol phosphate route similar to the ribitol and xylitol catabolic routes described previously. Xylitol phosphate dehydrogenase (XPDH) was isolated from Lactobacillus rhamnosus strain ATCC 15820. The enzyme was partially sequenced. Amino acid sequences were used to isolate the gene encoding the enzyme. The homology comparisons of the deduced amino acid sequence of L. rhamnosus XPDH revealed several similar enzymes in genomes of various species of Gram-positive bacteria. Two enzymes of Clostridium difficile and an enzyme of Bacillus halodurans were cloned and their substrate specificities together with the substrate specificity of L. rhamnosus XPDH were compared. It was found that one of the XPDH enzymes of C. difficile and the XPDH of L. rhamnosus had the highest selectivity towards D-xylulose 5-phosphate. A known transketolase-deficient and D-ribose-producing mutant of Bacillus subtilis (ATCC 31094) was further modified by disrupting its rpi (D-ribose phosphate isomerase) gene to create D-ribulose- and D-xylulose-producing strain. Expression of APDH of E. avium and XPDH of L. rhamnosus and C. difficile in D-ribulose- and D-xylulose-producing strain of B. subtilis resulted in strains capable of converting D-glucose into D-arabitol and xylitol, respectively. The D-arabitol yield on D-glucose was 38 % (w/w). Xylitol production was accompanied by co-production of ribitol limiting xylitol yield to 23 %.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Phylogenetic studies of cyanobacterial lichens Lichens are symbiotic assemblages between fungi (mycobiont) and green algae (phycobiont) or/and cyanobacteria (cyanobiont). Fossil records show that lichen-like symbioses occurred already 600 million years ago. Lichen symbiosis has since then become an important life strategy for the Fungi, particularly for species in the phylum Ascomycota as approximately 98% of the lichenized fungal species are ascomycetes. The taxonomy of lichen associations is based on the mycobiont. We reconstructed, using DNA sequence data, hypotheses of phylogenetic relationships of lichen-forming fungi that include species associated with cyanobacteria. These hypotheses of phylogeny should form the basis for the taxonomy. They also allowed studies of the origin and the evolution of specific symbioses. Genetic diversity and phylogenetic relationships of symbiotic cyanobionts were also studied in order to examine selectivity of cyanobionts and mycobionts as well as possible co-evolution between partners involved in lichen associations. The suggested circumscription of the family Stereocaulaceae to include Stereocaulon and Lepraria is supported. The recently described crustose Stereocaulon species seem to be correctly placed in the genus, although Stereocaulon traditionally included only fruticose species. The monospecific crustose genus Muhria is also shown to be best placed in Stereocaulon. Family Lobariaceae as currently delimited is monophyletic. Within Lobariaceae genus Sticta including Dendriscocaulon dendroides form a monophyletic group while the genera Lobaria and Pseudocyphellaria are non-monophyletic. A new classification of Lobariaceae is obviously needed. Further studies are however required before a final proposal for a new classification can be made. Our results show that the cyanobacterial symbiotic state has been gained repeatedly in the Ascomycota while losses of symbiotic cyanobacteria appear to be rare. The symbiosis with green algae is confirmed to have been gained repeatedly in Ascomycota but also repeatedly lost. Cyanobacterial symbioses therefore seem to be more stable than green algal associations. Cyanobacteria are perhaps more beneficial for the lichen fungi and therefore maintained. The results indicate a dynamic association of the lichen symbiosis. This evolutionary instability will perhaps be important for the lichen fungi as the utilization of options will perhaps enable lichens to colonize new substrates and survive environmental changes. Some cyanobacterial lichen genera seem to be highly selective towards the cyanobiont while others form symbioses with a broad spectrum of cyanobacteria. No evidence of co-evolution between fungi and cyanobacteria in cyanolichens could be demonstrated.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Human parvovirus B19 is a minute ssDNA virus causing a wide variety of diseases, including erythema infectiosum, arthropathy, anemias, and fetal death. After primary infection, genomic DNA of B19 has been shown to persist in solid tissues of not only symptomatic but also of constitutionally healthy, immunocompetent individuals. In this thesis, the viral DNA was shown to persist as an apparently intact molecule of full length, and without persistence-specific mutations. Thus, although the mere presence of B19 DNA in tissue can not be used as a diagnostic criterion, a possible role in the pathogenesis of diseases e.g. through mRNA or protein production can not be excluded. The molecular mechanism, the host-cell type and the possible clinical significance of B19 DNA tissue persistence are yet to be elucidated. In the beginning of this work, the B19 genomic sequence was considered highly conserved. However, new variants were found: V9 was detected in 1998 in France, in serum of a child with aplastic crisis. This variant differed from the prototypic B19 sequences by ~10 %. In 2002 we found, persisting in skin of constitutionally healthy humans, DNA of another novel B19 variant, LaLi. Genetically this variant differed from both the prototypic sequences and the variant V9 also by ~10%. Simultaneously, B19 isolates with DNA sequences similar to LaLi were introduced by two other groups, in the USA and France. Based on phylogeny, a classification scheme based on three genotypes (B19 types 1-3) was proposed. Although the B19 virus is mainly transmitted via the respiratory route, blood and plasma-derived products contaminated with high levels of B19 DNA have also been shown to be infectious. The European Pharmacopoeia stipulates that, in Europe, from the beginning of 2004, plasma pools for manufacture must contain less than 104 IU/ml of B19 DNA. Quantitative PCR screening is therefore a prerequisite for restriction of the B19 DNA load and obtaining of safe plasma products. Due to the DNA sequence variation among the three B19 genotypes, however, B19 PCR methods might fail to detect the new variants. We therefore examined the suitability of the two commercially available quantitative B19 PCR tests, LightCycler-Parvovirus B19 quantification kit (Roche Diagnostics) and RealArt Parvo B19 LC PCR (Artus), for detection, quantification and differentiation of the three B19 types known, including B19 types 2 and 3. The former method was highly sensitive for detection of the B19 prototype but was not suitable for detection of types 2 and 3. The latter method detected and differentiated all three B19 virus types. However, one of the two type-3 strains was detected at a lower sensitivity. Then, we assessed the prevalence of the three B19 virus types among Finnish blood donors, by screening pooled plasma samples derived from >140 000 blood-donor units: none of the pools contained detectable levels of B19 virus types 2 or 3. According to the results of other groups, B19 type 2 was absent also among Danish blood-donors, and extremely rare among symptomatic European patients. B19 type 3 has been encountered endemically in Ghana and (apparently) in Brazil, and sporadical cases have been detected in France and the UK. We next examined the biological characteristics of these virus types. The p6 promoter regions of virus types 1-3 were cloned in front of a reporter gene, the constructs were transfected into different cell lines, and the promoter activities were measured. As a result, we found that the activities of the three p6 promoters, although differing in sequence by >20%, were of equal strength, and most active in B19-permissive cells. Furthermore, the infectivity of the three B19 types was examined in two B19-permissive cell lines. RT-PCR revealed synthesis of spliced B19 mRNAs, and immunofluorescence verified the production of NS1 and VP proteins in the infected cells. These experiments suggested similar host-cell tropism and showed that the three virus types are strains of the same species, i.e. human parvovirus B19. Last but not least, the sera from subjects infected in the past either with B19 type 1 or type 2 (as evidenced by tissue persistence of the respective DNAs), revealed in VP1/2- and VP2-EIAs a 100 % cross-reactivity between virus types 1 and 2. These results, together with similar studies by others, indicate that the three B19 genotypes constitute a single serotype.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Social behaviour affects dispersal of animals and is an important modifier of genetic population structures. The female sex is often philopatric, which maintains coancestry within the breeding groups and promotes cooperative behaviours. This enables also inclusive fitness returns from altruism and explains why some individuals sacrifice personal reproduction for the good of others in social insects such as ants. However, reduced dispersal and population substructuring at the level of colonies may also entail inbreeding, loss of genetic diversity, and vulnerability. In addition, the most vulnerable ants are species that are evolved to parasitize colonies of other ants, and which compromise between abilities to disperse and the efficiency to parasitize the host. On the other hand, certain social organisations of ant colonies may facilitate a species to disperse outside its natural range and become a pest. Altogether, knowledge on genetic structuring of ant populations, as well as the evolution of their life histories can contribute to conservation biology and population management. The aim of this thesis was to investigate population structures and phylogenetic evolution of the ant Plagiolepis pygmaea and its two obligatory, workerless social parasites (inquilines) P. xene and P. grassei with genetic markers and DNA sequence data. The results support the general assumption that populations of inquiline parasites are highly fragmented and genetically vulnerable. Comparison of the two parasites suggests that differences in their relative abundance may follow from their interaction with the host, i.e. how well the species is adapted to reproduce in the host colonies. The results also indicate that the most recent free living ancestor to these two parasite species is their common host. This is considered to provide evidence for the controversial issue of sympatric speciation. Further, given that the level of adaptations to parasitic life history depends on the evolutionary time since the free-living ancestor, the results establish a link between species rarity and its evolutionary age. The populations of the host species P. pygmaea displayed significantly reduced dispersal both among the females (queens) and males, and high levels of inbreeding which may enhance worker altruism. In addition, the queens were found to mate with multiple males. Given the high relatedness between the queens and their mates, this occurs probably for non-genetic reasons, e.g. without benefits associated in genetically more diverse offspring. The results hence caution that the contribution of non-genetic factors to the prevailing mating patterns and genetic population structures should not be underestimated.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Pohjoisella havumetsävyöhykkeellä typpi on usein kasvien kasvua rajoittava tekijä. Metsämaan typpivarannot koostuvat pääasiassa orgaaniseen ainekseen sitoutuneista typpiyhdisteistä, erityisesti aminohapoista. Ektomykorritsasienet osallistuvat metsämaassa tapahtuvaan typenkiertoon hajottamalla orgaanisia typpiyhdisteitä ja kuljettamalla niitä kasvien käytettäväksi. Sienisolun sisällä tapahtuvasta aminohappojen mineralisaatiosta tiedetään toistaiseksi melko vähän. Aminohappo-oksidaasit katalysoivat aminohappojen mineralisaatiota. Eräissä ektomykorritsaa muodostavien kantasienten suvuissa on osoitettu L-aminohappo-oksidaaseja (LAO). Toistaiseksi LAO-geeniä ei tunneta kantasienistä. Työssä kuvattiin ensimmäistä kertaa LAO-geeni kantasienistä. Hiekkatympösen LAO1- geenin cDNA:n 5´ ja 3´ päiden emäsjärjestykset määritettiin RACE-PCR -menetelmällä, josta saatujen sekvenssien perusteella suunniteltiin alukkeet koko geenin cDNA:n ja genomisen DNA:n monistamiseksi. Genomisen DNA ja cDNA -sekvenssien perusteella määritettiin hiekkatympösen LAO1-geenin rakenne. Hiekkatympösen LAO1-geeni koostuu viidestä eksonista ja neljästä intronista. Hiekkatympösen LAO1-geenin yläpuoliselta alueelta löydettiin typpimetabolian säätelyyn osallistuvan proteiinin sitoutumiskohta. LAO1-geeniä edeltävä geenin osittainen genominen DNA-sekvenssi määritettiin. Kangaslohisienen genomissa LAO1-geeniä edeltävä geeni oli ennustettu pyruvaattidekarboksylaasiksi. Lisäksi työssä määritettiin hiekkatympösen toisen LAOhomologin cDNA:n osittainen emäsjärjestys. Työssä tunnistettiin myös toisen kantasienen, kangaslohisienen, LAO-geeni. LAO-geeniksi tunnistettu kangaslohisienen geenimalli oli aiemmin ennustettu NCBI:n tietokannassa toiminnaltaan tuntemattomaksi proteiiniksi. Proteiinien sukupuun perusteella hiekkatympösen ja kangaslohisienen LAO:n kantamuoto on kahdentunut. Työstä saatu tutkimustulos tuo täysin uutta tietoa molekyylibiologian tasolla ektomykorritsasienten aminohappojen katabolisista reaktioista. Aminohappojen mineralisaation seurauksen muodostuneet ammoniumionit saattavat olla merkittävä typen lähde myös maan muille mikrobeille ja kasveille. On mahdollista, että ektomykorritsasienten LAO-entsyymi on yksi merkittävä tekijä metsämaan typenkierrossa.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Transcription factors play a key role in tumor development, in which dysfunction of genes regulating tissue growth and differentiation is a central phenomenon. The GATA family of transcription factors consists of six members that bind to a consensus DNA sequence (A/T)GATA(A/G) in gene promoters and enhancers. The two GATA factors expressed in the adrenal cortex are GATA-4 and GATA-6. In both mice and humans, GATA-4 can be detected only during the fetal period, whereas GATA-6 expression is abundant both throughout development and in the adult. It is already established that GATA factors are important in both normal development and tumorigenesis of several endocrine organs, and expression of GATA-4 and GATA-6 is detected in adrenocortical tumors. The aim of this study was to elucidate the function of these factors in adrenocortical tumor growth. In embryonal development, the adrenocortical cells arise and differentiate from a common pool with gonadal steroidogenic cells, the urogenital ridge. As the adult adrenal cortex undergoes constant renewal, it is hypothesized that undifferentiated adrenocortical progenitor cells reside adjacent to the adrenal capsule and give rise to daughter cells that differentiate and migrate centripetally. A diverse array of hormones controls the differentiation, growth and survival of steroidogenic cells in the adrenal gland and the gonads. Factors such as luteinizing hormone and inhibins, traditionally associated with gonadal steroidogenic cells, can also influence the function of adrenocortical cells in physiological and pathophysiological states. Certain inbred strains of mice develop subcapsular adrenocortical tumors in response to gonadectomy. In this study, we found that these tumors express GATA-4, normally absent from the adult adrenal cortex, while GATA-6 expression is downregulated. Gonadal markers such as luteinizing hormone receptor, anti-Müllerian hormone and P450c17 are also expressed in the neoplastic cells, and the tumors produce gonadal hormones. The tumor cells have lost the expression of melanocortin-2 receptor and the CYP enzymes necessary for the synthesis of corticosterone and aldosterone. By way of xenograft studies utilizing NU/J nude mice, we confirmed that chronic gonadotropin elevation is sufficient to induce adrenocortical tumorigenesis in susceptible inbred strains. Collectively, these studies suggest that subcapsular adrenocortical progenitor cells can, under certain conditions, adopt a gonadal fate. We studied the molecular mechanisms involved in gene regulation in endocrine cells in order to elucidate the role of GATA factors in endocrine tissues. Ovarian granulosa cells express both GATA-4 and GATA-6, and the TGF-β signaling pathway is active in these cells. Inhibin-α is both a target gene for, and an atypical or antagonistic member of the TGF-β growth factor superfamily. In this study, we show that GATA-4 is required for TGF-β-mediated inhibin-α promoter activation in granulosa cells, and that GATA-4 physically interacts with Smad3, a TGF-β downstream protein. Apart from the regulation of steroidogenesis and other events in normal tissues, TGF-β signaling is implicated in tumors of multiple organs, including the adrenal cortex. Another signaling pathway found often to be aberrantly active in adrenocortical tumors is the Wnt pathway. As both of these pathways regulate the expression of inhibin-α, a transcriptional target for GATA-4 and GATA-6, we wanted to investigate whether GATA factors are associated with the components of these signaling cascades in human adrenocortical tumors. We found that the expression of Wnt co-receptors LRP5 and LRP6, Smad3, GATA-6 and SF-1 was diminished in adrenocortical carcinomas with poor outcome. All of these factors drive inhibin-α expression, and their expression in adrenocortical tumors correlated with that of inhibin-α. The results support a tumor suppressor role previously suggested for inhibin-α in the mouse adrenal cortex, and offer putative pathways associated with adrenocortical tumor aggressiveness. Unraveling the role of GATA factors and associated molecules in human and mouse adrenocortical tumors could ultimately contribute to the development of diagnostic tools and future therapies for these diseases.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Neurofibromatosis 2 (NF2) is an autosomal dominant disorder manifested by the formation of multiple benign tumors of the nervous system. Affected individuals typically develop bilateral vestibular schwannomas which lead to deafness and balance disorders. The syndrome is caused by inactivation of the NF2 tumor suppressor gene, and mutation or loss of the NF2 product, merlin, is sufficient for tumorigenesis in both hereditary and sporadic NF2-associated tumors. Merlin belongs to the band 4.1 superfamily of cytoskeletal proteins, which also contain the related ezrin, radixin, and moesin (ERM) proteins. The ERM members provide a link between the cell cytoskeleton and membrane by connecting membrane-associated proteins to actin filaments. By stabilizing complexes in the cell cortex, the ERMs modulate morphology, growth, and migration of cells. Despite their structural homology, overlapping subcellular distribution, direct molecular association, and partial overlap of molecular interactions, merlin and ezrin exert opposite effects on cell proliferation. Merlin suppresses cell proliferation, whereas ezrin expression is linked to oncogenic activity. We hypothesized that the regions which differ between the proteins might explain merlin s specificity as a tumor suppressor. We therefore analyzed the regions, which are most diverse between merlin and ezrin; the N-terminal tail and the C-terminus. To determine the properties of the C-terminal region, we studied the two most predominant merlin isoforms together with truncation variants similar to those found in patients. We also focused on the evolutionally conserved C-terminal residues, E545-E547, that harbor disease causing mutations in its corresponding DNA sequence. In addition to inhibiting cell proliferation, merlin regulates cytoskeletal organization. The morphogenic properties of merlin may play a role in tumor suppression, since patient-derived tumor cells demonstrate cytoskeletal abnormalities. We analyzed the mechanisms of merlin-induced extension formation and determined that the C-terminal region of amino acids 538-568 is particularly important for the morphogenic activity. We also characterized the role of C-terminal merlin residues in the regulation of proliferation, phosphorylation, and intramolecular associations. In contrast to previous reports, we demonstrated that both merlin isoforms are able to suppress cell proliferation, whereas C-terminally mutated merlin constructs showed reduced growth inhibition. Phosphorylation serves as a mechanism to regulate the tumor suppressive activity of merlin. The C-terminal serine 518 is phosphorylated in response to both p21-activated kinase (PAK) and protein kinase A (PKA), which inactivates the growth inhibitory function of merlin. However, at least three differentially phosphorylated forms of the protein exist. In this study we demonstrated that also the N-terminus of merlin is phosphorylated by AGC kinases, and that both PKA and Akt phosphorylate merlin at serine 10 (S10). We evaluated the impact of this N-terminal tail phosphorylation, and showed that the phosphorylation state of S10 is an important regulator of merlin s ability to modulate cytoskeletal organization but also regulates the stability of the protein. In summary, this study describes the functional effect of merlin specific regions. We demonstrate that both S10 in the N-terminal tail and residues E545-E547 in the C-terminus are essential for merlin activity and function.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

This thesis presents methods for locating and analyzing cis-regulatory DNA elements involved with the regulation of gene expression in multicellular organisms. The regulation of gene expression is carried out by the combined effort of several transcription factor proteins collectively binding the DNA on the cis-regulatory elements. Only sparse knowledge of the 'genetic code' of these elements exists today. An automatic tool for discovery of putative cis-regulatory elements could help their experimental analysis, which would result in a more detailed view of the cis-regulatory element structure and function. We have developed a computational model for the evolutionary conservation of cis-regulatory elements. The elements are modeled as evolutionarily conserved clusters of sequence-specific transcription factor binding sites. We give an efficient dynamic programming algorithm that locates the putative cis-regulatory elements and scores them according to the conservation model. A notable proportion of the high-scoring DNA sequences show transcriptional enhancer activity in transgenic mouse embryos. The conservation model includes four parameters whose optimal values are estimated with simulated annealing. With good parameter values the model discriminates well between the DNA sequences with evolutionarily conserved cis-regulatory elements and the DNA sequences that have evolved neutrally. In further inquiry, the set of highest scoring putative cis-regulatory elements were found to be sensitive to small variations in the parameter values. The statistical significance of the putative cis-regulatory elements is estimated with the Two Component Extreme Value Distribution. The p-values grade the conservation of the cis-regulatory elements above the neutral expectation. The parameter values for the distribution are estimated by simulating the neutral DNA evolution. The conservation of the transcription factor binding sites can be used in the upstream analysis of regulatory interactions. This approach may provide mechanistic insight to the transcription level data from, e.g., microarray experiments. Here we give a method to predict shared transcriptional regulators for a set of co-expressed genes. The EEL (Enhancer Element Locator) software implements the method for locating putative cis-regulatory elements. The software facilitates both interactive use and distributed batch processing. We have used it to analyze the non-coding regions around all human genes with respect to the orthologous regions in various other species including mouse. The data from these genome-wide analyzes is stored in a relational database which is used in the publicly available web services for upstream analysis and visualization of the putative cis-regulatory elements in the human genome.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Microbes in natural and artificial environments as well as in the human body are a key part of the functional properties of these complex systems. The presence or absence of certain microbial taxa is a correlate of functional status like risk of disease or course of metabolic processes of a microbial community. As microbes are highly diverse and mostly notcultivable, molecular markers like gene sequences are a potential basis for detection and identification of key types. The goal of this thesis was to study molecular methods for identification of microbial DNA in order to develop a tool for analysis of environmental and clinical DNA samples. Particular emphasis was placed on specificity of detection which is a major challenge when analyzing complex microbial communities. The approach taken in this study was the application and optimization of enzymatic ligation of DNA probes coupled with microarray read-out for high-throughput microbial profiling. The results show that fungal phylotypes and human papillomavirus genotypes could be accurately identified from pools of PCR amplicons generated from purified sample DNA. Approximately 1 ng/μl of sample DNA was needed for representative PCR amplification as measured by comparisons between clone sequencing and microarray. A minimum of 0,25 amol/μl of PCR amplicons was detectable from amongst 5 ng/μl of background DNA, suggesting that the detection limit of the test comprising of ligation reaction followed by microarray read-out was approximately 0,04%. Detection from sample DNA directly was shown to be feasible with probes forming a circular molecule upon ligation followed by PCR amplification of the probe. In this approach, the minimum detectable relative amount of target genome was found to be 1% of all genomes in the sample as estimated from 454 deep sequencing results. Signal-to-noise of contact printed microarrays could be improved by using an internal microarray hybridization control oligonucleotide probe together with a computational algorithm. The algorithm was based on identification of a bias in the microarray data and correction of the bias as shown by simulated and real data. The results further suggest semiquantitative detection to be possible by ligation detection, allowing estimation of target abundance in a sample. However, in practise, comprehensive sequence information of full length rRNA genes is needed to support probe design with complex samples. This study shows that DNA microarray has the potential for an accurate microbial diagnostic platform to take advantage of increasing sequence data and to replace traditional, less efficient methods that still dominate routine testing in laboratories. The data suggests that ligation reaction based microarray assay can be optimized to a degree that allows good signal-tonoise and semiquantitative detection.