10 resultados para Genome-specific Sequence

em Helda - Digital Repository of University of Helsinki


Relevância:

80.00% 80.00%

Publicador:

Resumo:

Trimeric autotransporters are a family of secreted outer membrane proteins in Gram-negative bacteria. These obligate homotrimeric proteins share a conserved C-terminal region, termed the translocation unit. This domain consists of an integral membrane β-barrel anchor and associated α-helices which pass through the pore of the barrel. The α-helices link to the extracellular portion of the protein, the passenger domain. Autotransportation refers to the way in which the passenger domain is secreted into the extracellular space. It appears that the translocation unit mediates the transport of the passenger domain across the outer membrane, and no external factors, such as ATP, ion gradients nor other proteins, are required. The passenger domain of autotransporters contains the specific activities of each protein. These are usually related to virulence. In trimeric autotransporters, the main function of the proteins is to act as adhesins. One such protein is the Yersinia adhesin YadA, found in enteropathogenic species of Yersinia. The main activity of YadA from Y. enterocolitica is to bind collagen, and it also mediates adhesion to other molecules of the extracellular matrix. In addition, YadA is involved in serum resistance, phagocytosis resistance, binding to epithelial cells and autoagglutination. YadA is an essential virulence factor of Y. enterocolitica, and removal of this protein from the bacteria leads to avirulence. In this study, I investigated the YadA-collagen interaction by studying the binding of YadA to collagen-mimicking peptides by several biochemical and biophysical methods. YadA bound as tightly to the triple-helical model peptide (Pro-Hyp-Gly)10 as to native collagen type I. However, YadA failed to bind a similar peptide that does not form a collagenous triple helix. As (Pro-Hyp-Gly)10 does not contain a specific sequence, we concluded that a triple-helical conformation is necessary for YadA binding, but no specific sequence is required. To further investigate binding determinants for YadA in collagens, I examined the binding of YadA to a library of collagen-mimicking peptides that span the entire triple-helical sequences of human collagens type II and type III. YadA bound promiscuously to many but not all peptides, indicating that a triple-helical conformation alone is not sufficient for binding. The high-binding peptides did not share a clear binding motif, but these peptides were rich in hydroxyproline residues and contained a low number of charged residues. YadA thus binds collagens without sequence specificity. This strategy of promiscuous binding may be advantageous for pathogenic bacteria. The Eib proteins from Escherichia coli are immunoglobulin (Ig)-binding homologues of YadA. I showed conclusively that recombinant EibA, EibC, EibD and EibF bind to IgG Fc. I crystallised a fragment of the passenger domain of EibD, which binds IgA in addition to IgG. The structure has a YadA-like head domain and an extended coiled-coil stalk. The top half of the coiled-coil is right-handed with hendecad periodicity, whereas the lower half is a canonical left-handed coiled-coil. At the transition from right- to left-handedness, a small β-sheet protrudes from each monomer. I was able to map the binding regions for IgG and IgA using truncations and site-directed mutagenesis to the coiled-coil stalk and identified residues critical for Ig binding.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

Growth is a fundamental aspect of life cycle of all organisms. Body size varies highly in most animal groups, such as mammals. Moreover, growth of a multicellular organism is not uniform enlargement of size, but different body parts and organs grow to their characteristic sizes at different times. Currently very little is known about the molecular mechanisms governing this organ-specific growth. The genome sequencing projects have provided complete genomic DNA sequences of several species over the past decade. The amount of genomic sequence information, including sequence variants within species, is constantly increasing. Based on the universal genetic code, we can make sense of this sequence information as far as it codes proteins. However, less is known about the molecular mechanisms that control expression of genes, and about the variations in gene expression that underlie many pathological states in humans. This is caused in part by lack of information about the second genetic code that consists of the binding specificities of transcription factors and the combinatorial code by which transcription factor binding sites are assembled to form tissue-specific and/or ligand-regulated enhancer elements. This thesis presents a high-throughput assay for identification of transcription factor binding specificities, which were then used to measure the DNA binding profiles of transcription factors involved in growth control. We developed ‘enhancer element locator’, a computational tool, which can be used to predict functional enhancer elements. A genome-wide prediction of human and mouse enhancer elements generated a large database of enhancer elements. This database can be used to identify target genes of signaling pathways, and to predict activated transcription factors based on changes in gene expression. Predictions validated in transgenic mouse embryos revealed the presence of multiple tissue-specific enhancers in mouse c- and N-Myc genes, which has implications to organ specific growth control and tumor type specificity of oncogenes. Furthermore, we were able to locate a variation in a single nucleotide, which carries a susceptibility to colorectal cancer, to an enhancer element and propose a mechanism by which this SNP might be involved in generation of colorectal cancer.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

NMR spectroscopy enables the study of biomolecules from peptides and carbohydrates to proteins at atomic resolution. The technique uniquely allows for structure determination of molecules in solution-state. It also gives insights into dynamics and intermolecular interactions important for determining biological function. Detailed molecular information is entangled in the nuclear spin states. The information can be extracted by pulse sequences designed to measure the desired molecular parameters. Advancement of pulse sequence methodology therefore plays a key role in the development of biomolecular NMR spectroscopy. A range of novel pulse sequences for solution-state NMR spectroscopy are presented in this thesis. The pulse sequences are described in relation to the molecular information they provide. The pulse sequence experiments represent several advances in NMR spectroscopy with particular emphasis on applications for proteins. Some of the novel methods are focusing on methyl-containing amino acids which are pivotal for structure determination. Methyl-specific assignment schemes are introduced for increasing the size range of 13C,15N labeled proteins amenable to structure determination without resolving to more elaborate labeling schemes. Furthermore, cost-effective means are presented for monitoring amide and methyl correlations simultaneously. Residual dipolar couplings can be applied for structure refinement as well as for studying dynamics. Accurate methods for measuring residual dipolar couplings in small proteins are devised along with special techniques applicable when proteins require high pH or high temperature solvent conditions. Finally, a new technique is demonstrated to diminish strong-coupling induced artifacts in HMBC, a routine experiment for establishing long-range correlations in unlabeled molecules. The presented experiments facilitate structural studies of biomolecules by NMR spectroscopy.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

This thesis presents methods for locating and analyzing cis-regulatory DNA elements involved with the regulation of gene expression in multicellular organisms. The regulation of gene expression is carried out by the combined effort of several transcription factor proteins collectively binding the DNA on the cis-regulatory elements. Only sparse knowledge of the 'genetic code' of these elements exists today. An automatic tool for discovery of putative cis-regulatory elements could help their experimental analysis, which would result in a more detailed view of the cis-regulatory element structure and function. We have developed a computational model for the evolutionary conservation of cis-regulatory elements. The elements are modeled as evolutionarily conserved clusters of sequence-specific transcription factor binding sites. We give an efficient dynamic programming algorithm that locates the putative cis-regulatory elements and scores them according to the conservation model. A notable proportion of the high-scoring DNA sequences show transcriptional enhancer activity in transgenic mouse embryos. The conservation model includes four parameters whose optimal values are estimated with simulated annealing. With good parameter values the model discriminates well between the DNA sequences with evolutionarily conserved cis-regulatory elements and the DNA sequences that have evolved neutrally. In further inquiry, the set of highest scoring putative cis-regulatory elements were found to be sensitive to small variations in the parameter values. The statistical significance of the putative cis-regulatory elements is estimated with the Two Component Extreme Value Distribution. The p-values grade the conservation of the cis-regulatory elements above the neutral expectation. The parameter values for the distribution are estimated by simulating the neutral DNA evolution. The conservation of the transcription factor binding sites can be used in the upstream analysis of regulatory interactions. This approach may provide mechanistic insight to the transcription level data from, e.g., microarray experiments. Here we give a method to predict shared transcriptional regulators for a set of co-expressed genes. The EEL (Enhancer Element Locator) software implements the method for locating putative cis-regulatory elements. The software facilitates both interactive use and distributed batch processing. We have used it to analyze the non-coding regions around all human genes with respect to the orthologous regions in various other species including mouse. The data from these genome-wide analyzes is stored in a relational database which is used in the publicly available web services for upstream analysis and visualization of the putative cis-regulatory elements in the human genome.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Viral genomes are encapsidated within protective protein shells. This encapsidation can be achieved either by a co-condensation reaction of the nucleic acid and coat proteins, or by first forming empty viral particles which are subsequently packaged with nucleic acid, the latter mechanism being typical for many dsDNA bacteriophages. Bacteriophage PRD1 is an icosahedral, non-tailed dsDNA virus that has an internal lipid membrane, the hallmark of the Tectiviridae family. Although PRD1 has been known to assemble empty particles into which the genome is subsequently packaged, the mechanism for this has been unknown, and there has been no evidence for a separate packaging vertex, similar to the portal structures used for packaging in the tailed bacteriophages and herpesviruses. In this study, a unique DNA packaging vertex was identified for PRD1, containing the packaging ATPase P9, packaging factor P6 and two small membrane proteins, P20 and P22, extending the packaging vertex to the internal membrane. Lack of small membrane protein P20 was shown to totally abolish packaging, making it an essential part of the PRD1 packaging mechanism. The minor capsid proteins P6 was shown to be an important packaging factor, its absence leading to greatly reduced packaging efficiency. An in vitro DNA packaging mechanism consisting of recombinant packaging ATPase P9, empty procapsids and mutant PRD1 DNA with a LacZ-insert was developed for the analysis of PRD1 packaging, the first such system ever for a virus containing an internal membrane. A new tectiviral sequence, a linear plasmid called pBClin15, was identified in Bacillus cereus, providing material for sequence analysis of the tectiviruses. Analysis of PRD1 P9 and other putative tectiviral ATPase sequences revealed several conserved sequence motifs, among them a new tectiviral packaging ATPase motif. Mutagenesis studies on PRD1 P9 were used to confirm the significance of the motifs. P9-type putative ATPase sequences carrying a similar sequence motif were identified in several other membrane containing dsDNA viruses of bacterial, archaeal and eukaryotic hosts, suggesting that these viruses may have similar packaging mechanisms. Interestingly, almost the same set of viruses that were found to have similar putative packaging ATPases had earlier been found to share similar coat protein folds and capsid structures, and a common origin for these viruses had been suggested. The finding in this study of similar packaging proteins further supports the idea that these viruses are descendants of a common ancestor.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Autoimmune diseases are a major health problem. Usually autoimmune disorders are multifactorial and their pathogenesis involves a combination of predisposing variations in the genome and other factors such as environmental triggers. APECED (autoimmune polyendocrinopathy-candidiasis-ectodermal dystrophy) is a rare, recessively inherited, autoimmune disease caused by mutations in a single gene. Patients with APECED suffer from several organ-specific autoimmune disorders, often affecting the endocrine glands. The defective gene, AIRE, codes for a transcriptional regulator. The AIRE (autoimmune regulator) protein controls the expression of hundreds of genes, representing a substantial subset of tissue-specific antigens which are presented to developing T cells in the thymus and has proven to be a key molecule in the establishment of immunological tolerance. However, the molecular mechanisms by which AIRE mediates its functions are still largely obscure. The aim of this thesis has been to elucidate the functions of AIRE by studying the molecular interactions it is involved in by utilizing different cultured cell models. A potential molecular mechanism for exceptional, dominant, inheritance of APECED in one family, carrying a glycine 228 to tryptophan (G228W) mutation, was described in this thesis. It was shown that the AIRE polypeptide with G228W mutation has a dominant negative effect by binding the wild type AIRE and inhibiting its transactivation capacity in vitro. The data also emphasizes the importance of homomultimerization of AIRE in vivo. Furthermore, two novel protein families interacting with AIRE were identified. The importin alpha molecules regulate the nuclear import of AIRE by binding to the nuclear localization signal of AIRE, delineated as a classical monopartite signal sequence. The interaction of AIRE with PIAS E3 SUMO ligases, indicates a link to the sumoylation pathway, which plays an important role in the regulation of nuclear architecture. It was shown that AIRE is not a target for SUMO modification but enhances the localization of SUMO1 and PIAS1 proteins to nuclear bodies. Additional support for the suggestion that AIRE would preferably up-regulate genes with tissue-specific expression pattern and down-regulate housekeeping genes was obtained from transactivation studies performed with two models: human insulin and cystatin B promoters. Furthermore, AIRE and PIAS activate the insulin promoter concurrently in a transactivation assay, indicating that their interaction is biologically relevant. Identification of novel interaction partners for AIRE provides us information about the molecular pathways involved in the establishment of immunological tolerance and deepens our understanding of the role played by AIRE not only in APECED but possibly also in several other autoimmune diseases.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Evolutionary genetics incorporates traditional population genetics and studies of the origins of genetic variation by mutation and recombination, and the molecular evolution of genomes. Among the primary forces that have potential to affect the genetic variation within and among populations, including those that may lead to adaptation and speciation, are genetic drift, gene flow, mutations and natural selection. The main challenges in knowing the genetic basis of evolutionary changes is to distinguish the adaptive selection forces that cause existent DNA sequence variants and also to identify the nucleotide differences responsible for the observed phenotypic variation. To understand the effects of various forces, interpretation of gene sequence variation has been the principal basis of many evolutionary genetic studies. The main aim of this thesis was to assess different forms of teleost gene sequence polymorphisms in evolutionary genetic studies of Atlantic salmon (Salmo salar) and other species. Firstly, the level of Darwinian adaptive evolution affected coding regions of the growth hormone (GH) gene during the teleost evolution was investigated based on the sequence data existing in public databases. Secondly, a target gene approach was used to identify within population variation in the growth hormone 1 (GH1) gene in salmon. Then, a new strategy for single nucleotide polymorphisms (SNPs) discovery in salmonid fishes was introduced, and, finally, the usefulness of a limited number of SNP markers as molecular tools in several applications of population genetics in Atlantic salmon was assessed. This thesis showed that the gene sequences in databases can be utilized to perform comparative studies of molecular evolution, and some putative evidence of the existence of Darwinian selection during the teleost GH evolution was presented. In addition, existent sequence data was exploited to investigate GH1 gene variation within Atlantic salmon populations throughout its range. Purifying selection is suggested to be the predominant evolutionary force controlling the genetic variation of this gene in salmon, and some support for gene flow between continents was also observed. The novel approach to SNP discovery in species with duplicated genome fragments introduced here proved to be an effective method, and this may have several applications in evolutionary genetics with different species - e.g. when developing gene-targeted markers to investigate quantitative genetic variation. The thesis also demonstrated that only a few SNPs performed highly similar signals in some of the population genetic analyses when compared with the microsatellite markers. This may have useful applications when estimating genetic diversity in genes having a potential role in ecological and conservation issues, or when using hard biological samples in genetic studies as SNPs can be applied with relatively highly degraded DNA.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Filamentous fungi of the subphylum Pezizomycotina are well known as protein and secondary metabolite producers. Various industries take advantage of these capabilities. However, the molecular biology of yeasts, i.e. Saccharomycotina and especially that of Saccharomyces cerevisiae, the baker's yeast, is much better known. In an effort to explain fungal phenotypes through their genotypes we have compared protein coding gene contents of Pezizomycotina and Saccharomycotina. Only biomass degradation and secondary metabolism related protein families seem to have expanded recently in Pezizomycotina. Of the protein families clearly diverged between Pezizomycotina and Saccharomycotina, those related to mitochondrial functions emerge as the most prominent. However, the primary metabolism as described in S. cerevisiae is largely conserved in all fungi. Apart from the known secondary metabolism, Pezizomycotina have pathways that could link secondary metabolism to primary metabolism and a wealth of undescribed enzymes. Previous studies of individual Pezizomycotina genomes have shown that regardless of the difference in production efficiency and diversity of secreted proteins, the content of the known secretion machinery genes in Pezizomycotina and Saccharomycotina appears very similar. Genome wide analysis of gene products is therefore needed to better understand the efficient secretion of Pezizomycotina. We have developed methods applicable to transcriptome analysis of non-sequenced organisms. TRAC (Transcriptional profiling with the aid of affinity capture) has been previously developed at VTT for fast, focused transcription analysis. We introduce a version of TRAC that allows more powerful signal amplification and multiplexing. We also present computational optimisations of transcriptome analysis of non-sequenced organism and TRAC analysis in general. Trichoderma reesei is one of the most commonly used Pezizomycotina in the protein production industry. In order to understand its secretion system better and find clues for improvement of its industrial performance, we have analysed its transcriptomic response to protein secretion stress conditions. In comparison to S. cerevisiae, the response of T. reesei appears different, but still impacts on the same cellular functions. We also discovered in T. reesei interesting similarities to mammalian protein secretion stress response. Together these findings highlight targets for more detailed studies.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Enzymes offer many advantages in industrial processes, such as high specificity, mild treatment conditions and low energy requirements. Therefore, the industry has exploited them in many sectors including food processing. Enzymes can modify food properties by acting on small molecules or on polymers such as carbohydrates or proteins. Crosslinking enzymes such as tyrosinases and sulfhydryl oxidases catalyse the formation of novel covalent bonds between specific residues in proteins and/or peptides, thus forming or modifying the protein network of food. In this study, novel secreted fungal proteins with sequence features typical of tyrosinases and sulfhydryl oxidases were iden-tified through a genome mining study. Representatives of both of these enzyme families were selected for heterologous produc-tion in the filamentous fungus Trichoderma reesei and biochemical characterisation. Firstly, a novel family of putative tyrosinases carrying a shorter sequence than the previously characterised tyrosinases was discovered. These proteins lacked the whole linker and C-terminal domain that possibly play a role in cofactor incorporation, folding or protein activity. One of these proteins, AoCO4 from Aspergillus oryzae, was produced in T. reesei with a production level of about 1.5 g/l. The enzyme AoCO4 was correctly folded and bound the copper cofactors with a type-3 copper centre. However, the enzyme had only a low level of activity with the phenolic substrates tested. Highest activity was obtained with 4-tert-butylcatechol. Since tyrosine was not a substrate for AoCO4, the enzyme was classified as catechol oxidase. Secondly, the genome analysis for secreted proteins with sequence features typical of flavin-dependent sulfhydryl oxidases pinpointed two previously uncharacterised proteins AoSOX1 and AoSOX2 from A. oryzae. These two novel sulfhydryl oxidases were produced in T. reesei with production levels of 70 and 180 mg/l, respectively, in shake flask cultivations. AoSOX1 and AoSOX2 were FAD-dependent enzymes with a dimeric tertiary structure and they both showed activity on small sulfhydryl compounds such as glutathione and dithiothreitol, and were drastically inhibited by zinc sulphate. AoSOX2 showed good stabil-ity to thermal and chemical denaturation, being superior to AoSOX1 in this respect. Thirdly, the suitability of AoSOX1 as a possible baking improver was elucidated. The effect of AoSOX1, alone and in combi-nation with the widely used improver ascorbic acid was tested on yeasted wheat dough, both fresh and frozen, and on fresh water-flour dough. In all cases, AoSOX1 had no effect on the fermentation properties of fresh yeasted dough. AoSOX1 nega-tively affected the fermentation properties of frozen doughs and accelerated the damaging effects of the frozen storage, i.e. giving a softer dough with poorer gas retention abilities than the control. In combination with ascorbic acid, AoSOX1 gave harder doughs. In accordance, rheological studies in yeast-free dough showed that the presence of only AoSOX1 resulted in weaker and more extensible dough whereas a dough with opposite properties was obtained if ascorbic acid was also used. Doughs containing ascorbic acid and increasing amounts of AoSOX1 were harder in a dose-dependent manner. Sulfhydryl oxidase AoSOX1 had an enhancing effect on the dough hardening mechanism of ascorbic acid. This was ascribed mainly to the produc-tion of hydrogen peroxide in the SOX reaction which is able to convert the ascorbic acid to the actual improver dehydroascorbic acid. In addition, AoSOX1 could possibly oxidise the free glutathione in the dough and thus prevent the loss of dough strength caused by the spontaneous reduction of the disulfide bonds constituting the dough protein network. Sulfhydryl oxidase AoSOX1 is therefore able to enhance the action of ascorbic acid in wheat dough and could potentially be applied in wheat dough baking.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Lactobacillus rhamnosus GG is a probiotic bacterium that is known worldwide. Since its discovery in 1985, the health effects and biology of this health-promoting strain have been researched at an increasing rate. However, knowledge of the molecular biology responsible for these health effects is limited, even though research in this area has continued to grow since the publication of the whole genome sequence of L. rhamnosus GG in 2009. In this thesis, the molecular biology of L. rhamnosus GG was explored by mapping the changes in protein levels in response to diverse stress factors and environmental conditions. The proteomics data were supplemented with transcriptome level mapping of gene expression. The harsh conditions of the gastro-intestinal tract, which involve acidic conditions and detergent-like bile acids, are a notable challenge to the survival of probiotic bacteria. To simulate these conditions, L. rhamnosus GG was exposed to a sudden bile stress, and several stress response mechanisms were revealed, among others various changes in the cell envelope properties. L. rhamnosus GG also responded in various ways to mild acid stress, which probiotic bacteria may face in dairy fermentations and product formulations. The acid stress response of L. rhamnosus GG included changes in central metabolism and specific responses related to the control of intracellular pH. Altogether, L. rhamnosus GG was shown to possess a large repertoire of mechanisms for responding to stress conditions, which is a beneficial character of a probiotic organism. Adaptation to different growth conditions was studied by comparing the proteome level responses of L. rhamnosus GG to divergent growth media and to different phases of growth. Comparing different growth phases revealed that the metabolism of L. rhamnosus GG is modified markedly during shift from the exponential to the stationary phase of growth. These changes were seen both at proteome and transcriptome levels and in various different cellular functions. When the growth of L. rhamnosus GG in a rich laboratory medium and in an industrial whey-based medium was compared, various differences in metabolism and in factors affecting the cell surface properties could be seen. These results led us to recommend that the industrial-type media should be used in laboratory studies of L. rhamnosus GG and other probiotic bacteria to achieve a similar physiological state for the bacteria as that found in industrial products, which would thus yield more relevant information about the bacteria. In addition, an interesting phenomenon of protein phosphorylation was observed in L. rhamnosus GG. Phosphorylation of several proteins of L. rhamnosus GG was detected, and there were hints that the degree of phosphorylation may be dependent on the growth pH.