920 resultados para The genetic code
Resumo:
Part I. Complexes of Biological Bases and Oligonucleotides with RNA
The physical nature of complexes of several biological bases and oligonucleotides with single-stranded ribonucleic acids have been studied by high resolution proton magnetic resonance spectroscopy. The importance of various forces in the stabilization of these complexes is also discussed.
Previous work has shown that purine forms an intercalated complex with single-stranded nucleic acids. This complex formation led to severe and stereospecific broadening of the purine resonances. From the field dependence of the linewidths, T1 measurements of the purine protons and nuclear Overhauser enhancement experiments, the mechanism for the line broadening was ascertained to be dipole-dipole interactions between the purine protons and the ribose protons of the nucleic acid.
The interactions of ethidium bromide (EB) with several RNA residues have been studied. EB forms vertically stacked aggregates with itself as well as with uridine, 3'-uridine monophosphate and 5'-uridine monophosphate and forms an intercalated complex with uridylyl (3' → 5') uridine and polyuridylic acid (poly U). The geometry of EB in the intercalated complex has also been determined.
The effect of chain length of oligo-A-nucleotides on their mode of interaction with poly U in D20 at neutral pD have also been studied. Below room temperatures, ApA and ApApA form a rigid triple-stranded complex involving a stoichiometry of one adenine to two uracil bases, presumably via specific adenine-uracil base pairing and cooperative base stacking of the adenine bases. While no evidence was obtained for the interaction of ApA with poly U above room temperature, ApApA exhibited complex formation of a 1:1 nature with poly U by forming Watson-Crick base pairs. The thermodynamics of these systems are discussed.
Part II. Template Recognition and the Degeneracy of the Genetic Code
The interaction of ApApG and poly U was studied as a model system for the codon-anticodon interaction of tRNA and mRNA in vivo. ApApG was shown to interact with poly U below ~20°C. The interaction was of a 1:1 nature which exhibited the Hoogsteen bonding scheme. The three bases of ApApG are in an anti conformation and the guanosine base appears to be in the lactim tautomeric form in the complex.
Due to the inadequacies of previous models for the degeneracy of the genetic code in explaining the observed interactions of ApApG with poly U, the "tautomeric doublet" model is proposed as a possible explanation of the degenerate interactions of tRNA with mRNA during protein synthesis in vivo.
Resumo:
Although the genetic code is generally viewed as immutable, alterations to its standard form occur in the three domains of life. A remarkable alteration to the standard genetic code occurs in many fungi of the Saccharomycotina CTG clade where the Leucine CUG codon has been reassigned to Serine by a novel transfer RNA (Ser-tRNACAG). The host laboratory made a major breakthrough by reversing this atypical genetic code alteration in the human pathogen Candida albicans using a combination of tRNA engineering, gene recombination and forced evolution. These results raised the hypothesis that synthetic codon ambiguities combined with experimental evolution may release codons from their frozen state. In this thesis we tested this hypothesis using S. cerevisiae as a model system. We generated ambiguity at specific codons in a two-step approach, involving deletion of tRNA genes followed by expression of non-cognate tRNAs that are able to compensate the deleted tRNA. Driven by the notion that rare codons are more susceptible to reassignment than those that are frequently used, we used two deletion strains where there is no cognate tRNA to decode the rare CUC-Leu codon and AGG-Arg codon. We exploited the vulnerability of the latter by engineering mutant tRNAs that misincorporate Ser at these sites. These recombinant strains were evolved over time using experimental evolution. Although there was a strong negative impact on the growth rate of strains expressing mutant tRNAs at high level, such expression at low level had little effect on cell fitness. We found that not only codon ambiguity, but also destabilization of the endogenous tRNA pool has a strong negative impact in growth rate. After evolution, strains expressing the mutant tRNA at high level recovered significantly in several growth parameters, showing that these strains adapt and exhibit higher tolerance to codon ambiguity. A fluorescent reporter system allowing the monitoring of Ser misincorporation showed that serine was indeed incorporated and possibly codon reassignment was achieved. Beside the overall negative consequences of codon ambiguity, we demonstrated that codons that tolerate the loss of their cognate tRNA can also tolerate high Ser misincorporation. This raises the hypothesis that these codons can be reassigned to standard and eventually to new amino acids for the production of proteins with novel properties, contributing to the field of synthetic biology and biotechnology.
Resumo:
We give a list of all possible schemes for performing amino acid and codon assignments in algebraic models for the genetic code, which are consistent with a few simple symmetry principles, in accordance with the spirit of the algebraic approach to the evolution of the genetic code proposed by Hornos and Hornos. Our results are complete in the sense of covering all the algebraic models that arise within this approach, whether based on Lie groups/Lie algebras, on Lie superalgebras or on finite groups.
Resumo:
We investigate the possibility of interpreting the degeneracy of the genetic code, i.e., the feature that different codons (base triplets) of DNA are transcribed into the same amino acid, as the result of a symmetry breaking process, in the context of finite groups. In the first part of this paper, we give the complete list of all codon representations (64-dimensional irreducible representations) of simple finite groups and their satellites (central extensions and extensions by outer automorphisms). In the second part, we analyze the branching rules for the codon representations found in the first part by computational methods, using a software package for computational group theory. The final result is a complete classification of the possible schemes, based on finite simple groups, that reproduce the multiplet structure of the genetic code. (C) 2010 Elsevier Ltd. All rights reserved.
Resumo:
The objective of this work is to characterize the genome of the chromosome 1 of A.thaliana, a small flowering plants used as a model organism in studies of biology and genetics, on the basis of a recent mathematical model of the genetic code. I analyze and compare different portions of the genome: genes, exons, coding sequences (CDS), introns, long introns, intergenes, untranslated regions (UTR) and regulatory sequences. In order to accomplish the task, I transformed nucleotide sequences into binary sequences based on the definition of the three different dichotomic classes. The descriptive analysis of binary strings indicate the presence of regularities in each portion of the genome considered. In particular, there are remarkable differences between coding sequences (CDS and exons) and non-coding sequences, suggesting that the frame is important only for coding sequences and that dichotomic classes can be useful to recognize them. Then, I assessed the existence of short-range dependence between binary sequences computed on the basis of the different dichotomic classes. I used three different measures of dependence: the well-known chi-squared test and two indices derived from the concept of entropy i.e. Mutual Information (MI) and Sρ, a normalized version of the “Bhattacharya Hellinger Matusita distance”. The results show that there is a significant short-range dependence structure only for the coding sequences whose existence is a clue of an underlying error detection and correction mechanism. No doubt, further studies are needed in order to assess how the information carried by dichotomic classes could discriminate between coding and noncoding sequence and, therefore, contribute to unveil the role of the mathematical structure in error detection and correction mechanisms. Still, I have shown the potential of the approach presented for understanding the management of genetic information.
Resumo:
The complementary Watson-Crick base-pairs, A:T and G:C, have long been recognized as pivotal to both the stability of the DNA double helix and replication/transcription. Recently, the replacement of the Watson-Crick base-pairs with other molecular entities has received considerable attention. In this tutorial review we highlight different approaches used to replace natural base-pairs and equip them with novel function. We also discuss the advantages that non-natural base-pairs convey with respect to practical applications.
Resumo:
The trinucleotide/amino acid relationships of the present-day genetic code are established by the amino-acylation reactions of tRNA synthetases, whereby each of 20 specific amino acids is attached to its cognate tRNAs, which bear anticodon trinucleotides. Because of its universality, the appearance of the modern genetic code is thought to predate the separation of prokaryotic and eukaryotic organisms in the universal phylogenetic tree. In the light of new sequence information, we present here a phylogenetic analysis that shows an unusual picture for tyrosyl- and tryptophanyl-tRNA synthetases. Ij particular, the eukaryotic tyrosyl- and tryptophanyl-tRNA synthetases are more related to each other than to their respective prokaryotic counterparts. In contrast, each of the other 18 eukaryotic synthetases is more related to its prokaryotic counterpart than to any eukaryotic synthetase specific for a different amino acid. Our results raise the possibility that present day tyrosyl- and tryptophanyl-tRNA synthetases appeared after the separation of nucleated cells from eubacteria. The results have implications for the development of the genetic code.
Resumo:
mRNA translation in many ciliates utilizes variant genetic codes where stop codons are reassigned to specify amino acids. To characterize the repertoire of ciliate genetic codes, we analyzed ciliate transcriptomes from marine environments. Using codon substitution frequencies in ciliate protein-coding genes and their orthologs, we inferred the genetic codes of 24 ciliate species. Nine did not match genetic code tables currently assigned by NCBI. Surprisingly, we identified a novel genetic code where all three standard stop codons (TAA, TAG, and TGA) specify amino acids in Condylostoma magnum. We provide evidence suggesting that the functions of these codons in C. magnum depend on their location within mRNA. They are decoded as amino acids at internal positions, but specify translation termination when in close proximity to an mRNA 3' end. The frequency of stop codons in protein coding sequences of closely related Climacostomum virens suggests that it may represent a transitory state.mRNA translation in many ciliates utilizes variant genetic codes where stop codons are reassigned to specify amino acids. To characterize the repertoire of ciliate genetic codes, we analyzed ciliate transcriptomes from marine environments. Using codon substitution frequencies in ciliate protein-coding genes and their orthologs, we inferred the genetic codes of 24 ciliate species. Nine did not match genetic code tables currently assigned by NCBI. Surprisingly, we identified a novel genetic code where all three standard stop codons (TAA, TAG, and TGA) specify amino acids in Condylostoma magnum. We provide evidence suggesting that the functions of these codons in C. magnum depend on their location within mRNA. They are decoded as amino acids at internal positions, but specify translation termination when in close proximity to an mRNA 3' end. The frequency of stop codons in protein coding sequences of closely related Climacostomum virens suggests that it may represent a transitory state.
Resumo:
The genetic code establishes the rules that govern gene translation into proteins. It was established more than 3.5 billion years ago and it is one of the most conserved features of life. Despite this, several alterations to the standard genetic code have been discovered in both prokaryotes and eukaryotes, namely in the fungal CTG clade where a unique seryl transfer RNA (tRNACAG Ser) decodes leucine CUG codons as serine. This tRNACAG Ser appeared 272±25 million years ago through insertion of an adenosine in the middle position of the anticodon of a tRNACGA Ser gene, which changed its anticodon from 5´-CGA-3´ to 5´-CAG-3´. This most dramatic genetic event restructured the proteome of the CTG clade species, but it is not yet clear how and why such deleterious genetic event was selected and became fixed in those fungal genomes. In this study we have attempted to shed new light on the evolution of this fungal genetic code alteration by reconstructing its evolutionary pathway in vivo in the yeast Saccharomyces cerevisiae. For this, we have expressed wild type and mutant versions of the C. albicans tRNACGA Ser gene into S. cerevisiae and evaluated the impact of the mutant tRNACGA Ser on fitness, tRNA stability, translation efficiency and aminoacylation kinetics. Our data demonstrate that these mutants are expressed and misincorporate Ser at CUGs, but their expression is repressed through an unknown molecular mechanism. We further demonstrate, using in vivo forced evolution methodologies, that the tRNACAG Ser can be easily inactivated through natural mutations that prevent its recognition by the seryl-tRNA synthetase. The overall data show that repression of expression of the mistranslating tRNACAG Ser played a critical role on the evolution of CUG reassignment from Leu to Ser. In order to better understand the evolution of natural genetic code alterations, we have also engineered partial reassignment of various codons in yeast. The data confirmed that genetic code ambiguity affects fitness, induces protein aggregation, interferes with the cell cycle and results in nuclear and morphologic alterations, genome instability and gene expression deregulation. Interestingly, it also generates phenotypic variability and phenotypes that confer growth advantages in certain environmental conditions. This study provides strong evidence for direct and critical roles of the environment on the evolution of genetic code alterations.
Resumo:
The genetic code is not universal. Alterations to its standard form have been discovered in both prokaryotes and eukaryotes and demolished the dogma of an immutable code. For instance, several Candida species translate the standard leucine CUG codon as serine. In the case of the human pathogen Candida albicans, a serine tRNA (tRNACAGSer) incorporates in vivo 97% of serine and 3% of leucine in proteins at CUG sites. Such ambiguity is flexible and the level of leucine incorporation increases significantly in response to environmental stress. To elucidate the function of such ambiguity and clarify whether the identity of the CUG codon could be reverted from serine back to leucine, we have developed a forced evolution strategy to increase leucine incorporation at CUGs and a fluorescent reporter system to monitor such incorporation in vivo. Leucine misincorporation increased from 3% up to nearly 100%, reverting CUG identity from serine back to leucine. Growth assays showed that increasing leucine incorporation produced impressive arrays of phenotypes of high adaptive potential. In particular, strains with high levels of leucine misincorporation exhibited novel phenotypes and high level of tolerance to antifungals. Whole genome re-sequencing revealed that increasing levels of leucine incorporation were associated with accumulation of single nucleotide polymorphisms (SNPs) and loss of heterozygozity (LOH) in the higher misincorporating strains. SNPs accumulated preferentially in genes involved in cell adhesion, filamentous growth and biofilm formation, indicating that C. albicans uses its natural CUG ambiguity to increase genetic diversity in pathogenesis and drug resistance related processes. The overall data provided evidence for unantecipated flexibility of the C. albicans genetic code and highlighted new roles of codon ambiguity on the evolution of genetic and phenotypic diversity.
Resumo:
Mathematical models, as instruments for understanding the workings of nature, are a traditional tool of physics, but they also play an ever increasing role in biology - in the description of fundamental processes as well as that of complex systems. In this review, the authors discuss two examples of the application of group theoretical methods, which constitute the mathematical discipline for a quantitative description of the idea of symmetry, to genetics. The first one appears, in the form of a pseudo-orthogonal (Lorentz like) symmetry, in the stochastic modelling of what may be regarded as the simplest possible example of a genetic network and, hopefully, a building block for more complicated ones: a single self-interacting or externally regulated gene with only two possible states: ` on` and ` off`. The second is the algebraic approach to the evolution of the genetic code, according to which the current code results from a dynamical symmetry breaking process, starting out from an initial state of complete symmetry and ending in the presently observed final state of low symmetry. In both cases, symmetry plays a decisive role: in the first, it is a characteristic feature of the dynamics of the gene switch and its decay to equilibrium, whereas in the second, it provides the guidelines for the evolution of the coding rules.
Resumo:
In distinction to single-stranded anticodons built of G, C, A, and U bases, their presumable double-stranded precursors at the first three positions of the acceptor stem are composed almost invariably of G-C and C-G base pairs. Thus, the “second” operational RNA code responsible for correct aminoacylation seems to be a (G,C) code preceding the classic genetic code. Although historically rooted, the two codes were destined to diverge quite early. However, closer inspection revealed that two complementary catalytic domains of class I and class II aminoacyl-tRNA synthetases (aaRSs) multiplied by two, also complementary, G2-C71 and C2-G71 targets in tRNA acceptors, yield four (2 × 2) different modes of recognition. It appears therefore that the core four-column organization of the genetic code, associated with the most conservative central base of anticodons and codons, was in essence predetermined by these four recognition modes of the (G,C) operational code. The general conclusion follows that the genetic code per se looks like a “frozen accident” but only beyond the “2 × 2 = 4” scope. The four primordial modes of tRNA–aaRS recognition are amenable to direct experimental verification.
Resumo:
This work is concerned with the genetic basis of normal human pigmentation variation. Specifically, the role of polymorphisms within the solute carrier family 45 member 2 (SLC45A2 or membrane associated transporter protein; MATP) gene were investigated with respect to variation in hair, skin and eye colour ― both between and within populations. SLC45A2 is an important regulator of melanin production and mutations in the gene underly the most recently identified form of oculocutaneous albinism. There is evidence to suggest that non-synonymous polymorphisms in SLC45A2 are associated with normal pigmentation variation between populations. Therefore, the underlying hypothesis of this thesis is that polymorphisms in SLC45A2 will alter the function or regulation of the protein, thereby altering the important role it plays in melanogenesis and providing a mechanism for normal pigmentation variation. In order to investigate the role that SLC45A2 polymorphisms play in human pigmentation variation, a DNA database was established which collected pigmentation phenotypic information and blood samples of more than 700 individuals. This database was used as the foundation for two association studies outlined in this thesis, the first of which involved genotyping two previously-described non-synonymous polymorphisms, p.Glu272Lys and p.Phe374Leu, in four different population groups. For both polymorphisms, allele frequencies were significantly different between population groups and the 272Lys and 374Leu alleles were strongly associated with black hair, brown eyes and olive skin colour in Caucasians. This was the first report to show that SLC45A2 polymorphisms were associated with normal human intra-population pigmentation variation. The second association study involved genotyping several SLC45A2 promoter polymorphisms to determine if they also played a role in pigmentation variation. Firstly, the transcription start site (TSS), and hence putative proximal promoter region, was identified using 5' RNA ligase mediated rapid amplification of cDNA ends (RLM-RACE). Two alternate TSSs were identified and the putative promoter region was screened for novel polymorphisms using denaturing high performance liquid chromatography (dHPLC). A novel duplication (c.–1176_–1174dupAAT) was identified along with other previously described single nucleotide polymorphisms (c.–1721C>G and c.–1169G>A). Strong linkage disequilibrium ensured that all three polymorphisms were associated with skin colour such that the –1721G, +dup and –1169A alleles were associated with olive skin in Caucasians. No linkage disequilibrium was observed between the promoter and coding region polymorphisms, suggesting independent effects. The association analyses were complemented with functional data, showing that the –1721G, +dup and –1169A alleles significantly decreased SLC45A2 transcriptional activity. Based on in silico bioinformatic analysis that showed these alleles remove a microphthalmia-associated transcription factor (MITF) binding site, and that MITF is a known regulator of SLC45A2 (Baxter and Pavan, 2002; Du and Fisher, 2002), it was postulated that SLC45A2 promoter polymorphisms could contribute to the regulation of pigmentation by altering MITF binding affinity. Further characterisation of the SLC45A2 promoter was carried out using luciferase reporter assays to determine the transcriptional activity of different regions of the promoter. Five constructs were designed of increasing length and their promoter activity evaluated. Constitutive promoter activity was observed within the first ~200 bp and promoter activity increased as the construct size increased. The functional impact of the –1721G, +dup and –1169A alleles, which removed a MITF consensus binding site, were assessed using electrophoretic mobility shift assays (EMSA) and expression analysis of genotyped melanoblast and melanocyte cell lines. EMSA results confirmed that the promoter polymorphisms affected DNA-protein binding. Interestingly, however, the protein/s involved were not MITF, or at least MITF was not the protein directly binding to the DNA. In an effort to more thoroughly characterise the functional consequences of SLC45A2 promoter polymorphisms, the mRNA expression levels of SLC45A2 and MITF were determined in melanocyte/melanoblast cell lines. Based on SLC45A2’s role in processing and trafficking TYRP1 from the trans-Golgi network to stage 2 melanosmes, the mRNA expression of TYRP1 was also investigated. Expression results suggested a coordinated expression of pigmentation genes. This thesis has substantially contributed to the field of pigmentation by showing that SLC45A2 polymorphisms not only show allele frequency differences between population groups, but also contribute to normal pigmentation variation within a Caucasian population. In addition, promoter polymorphisms have been shown to have functional consequences for SLC45A2 transcription and the expression of other pigmentation genes. Combined, the data presented in this work supports the notion that SLC45A2 is an important contributor to normal pigmentation variation and should be the target of further research to elucidate its role in determining pigmentation phenotypes. Understanding SLC45A2’s function may lead to the development of therapeutic interventions for oculocutaneous albinism and other disorders of pigmentation. It may also help in our understanding of skin cancer susceptibility and evolutionary adaptation to different UV environments, and contribute to the forensic application of pigmentation phenotype prediction.