990 resultados para transfer rna
Resumo:
The complete sequence of the 16,539 nucleotide mitochondrial genome from the single species of the catfish family Cranoglanididae, the helmet catfish Cranoglanis bouderius, was determined using the long and accurate polymerase chain reaction (LA PCR) method. The nucleotide sequences of C. bouderius mitochondrial DNA have been compared with those of three other catfish species in the same order. The contents of the C. bouderius mitochondrial genome are 13 protein-coding genes, two ribosomal RNA and 22 transfer RNA genes, and a non-coding control region, the gene order of which is identical to that observed in most other vertebrates. Phylogenetic analyses for 13 otophysan fishes were performed using Bayesian method based on the concatenated mtDNA protein-coding gene sequence and the individual protein-coding gene sequence data set. The competing otophysan topologies were then tested by using the approximately unbiased test, the Kishino-Hasegawa test, and the Shimodaira-Hasegawa test. The results show that the grouping ((((Characifonnes, Gymnotiformes), Siluriformes), Cyprinifionnes), outgroup) is the most likely but there is no significant difference between this one and the other alternative hypotheses. In addition, the phylogenetic placement of the family Cranoglanididae among siluriform families was also discussed. (c) 2006 Elsevier B.V. All rights reserved.
Resumo:
The structure of phenylalanine transfer ribonucleic acid (tRNA(Phe)) in solution was explored by H-1 NMR spectroscopy to evaluate the effect of lanthanide ion on the structural and conformational change. It was found that La3+ ions possess specific effects on the imino proton region of the H-1 NMR spectra for yeast tRNA(Phe). The dependence of the imino proton spectra of yeast tRNA(Phe) as a function of La3+ concentration was examined, and the results suggest that the tertiary base pair G(15). C-48, which is located in the terminal in the augmented dihydrouridine helix (D-helix), was markedly affected by La3+ (shifted to downfield by as much as 0.35). Base pair U-8. A(14) in yeast tRNA(Phe), which are stacked on G(15). C-48, was also affected by added La3+ when 1 similar to 2 Mg2+ were also present. Another imino proton that may be affected by La3+ in yeast tRNA(Phe) is that of the tertiary base pair G(19). C-56. The assignment of this resonance in yeast tRNA(Phe) is tentative since it is located in the region of highly overlapping resonances beween 12.6 and 12.2. This base pair helps to anchor the D-loop to the T Psi C loop. The binding of La3+ caused conformational change of tRNA, which is responsible for shifts to upfield or downfield in H-1 NMR spectra.
Resumo:
Complete mitochondrial genome plays an important role in the accurate revelation of phylogenetic relationships among metazoans. Here we present the complete mitochondrial genome sequence from a sea cucumber Apostichopus japonicus (Echinodermata: Holothuroidea), which is the first representative from the subclass Aspidochirotacea. The mitochondrial genome of A. japonicus is 16,096 bp in length. The heavy strand consists of 31.8% A, 20.2% C, 17.9% G, and 30.1% T bases (AT skew = 0.027: GC skew = 0.062). It contains thirteen protein-coding genes (PCGs), twenty-two transfer RNA genes, and two ribosomal RNA genes. There are a total of 3793 codons in all thirteen mitochondrial PCGs, excluding incomplete termination codons. The most frequently used amino acid is Leu (15.77%), followed by Set (9.73%), Met (8.62%), Phe (7.94%), and Ala (7.28%). Intergenetic regions in the mitochondrial genome of A. japonicus are 839 bp in total, with three relatively large regions of Unassigned Sequences (UAS) greater than 100 bp. The gene order of A. japonicus is identical to that observed in the five studied sea urchins, which confirms that the gene order shared by the two classes (Holothuroidea and Echinoidea) is a ground pattern of echinoderm mitochondrial genomes. Bayesian tree based on the cob gene supports the following relationship: (outgroup, (Crinoids, (Asteroids, Ophiuroids, (Echinoids, Holothuroids)))). (C) 2009 Elsevier B.V. All rights reserved.
Resumo:
Complete mitochondrial genomes have proven extremely valuable in helping to understand the evolutionary relationships among metazoans. However, uneven taxon sampling may lead to unclear or even erroneous phylogenetic topologies. The decapod crustaceans are relatively well-sampled, but sampling is still uneven within this group. We have sequenced the mitochondrial genomes of two shrimps Litopenaeus vannamei and Fenneropenaeus chinensis. As seen in other metazoans, the genomes contain a standard set of 13 protein-coding genes, 22 transfer RNA genes, two ribosomal RNA genes and an AT-rich non-coding region. The gene arrangements are consistent with the pancrustacean ground pattern. Both the pattern of gene rearrangements and phylogenomic analyses using concatenated nucleic acid and amino acid sequences of the 13 mitochondrial protein-coding genes strengthened the support that Caridea and Palinura are primitive members of Pleocyemata. These sequences, in combination with two previously published penaeid mitochondrial genomes, suggest that genera within the family Penaeidae have the following relationship: (((Penaeits + Fenneropenaett.) + Litopeiiaelts) + Marsupenaeus). The analyses of nucleic acid and amino acid sequences of the mitochondrial genomes also strongly support the monophyly of Penaeidae, Brachyura and Pleocyemata. In addition, the analyses of the average Ka/Ks in the 13 mitochondrial protein-coding genes of penaeid shrimps indicated a strong purifying selection within this group.
Resumo:
Given the commercial and ecological importance of the Asian paddle crab, Charybdis japonica, there is a clearly need for genetic and molecular research on this species. Here, we present the complete mitochondrial genome sequence of C. japonica, determined by the long-polymerase chain reaction and primer walking sequencing method. The entire genome is 15,738 bp in length, encoding a standard set of 13 protein-coding genes, two ribosomal RNA genes, and 22 transfer RNA genes, plus the putative control region, which is typical for metazoans. The total A+T content of the genome is 69.2%, lower than the other brachyuran crabs except for Callinectes sapidus. The gene order is identical to the published marine brachyurans and differs from the ancestral pancrustacean order by only the position of the tRNA (His) gene. Phylogenetic analyses using the concatenated nucleotide and amino acid sequences of 13 protein-coding genes strongly support the monophyly of Dendrobranchiata and Pleocyemata, which is consistent with the previous taxonomic classification. However, the systematic status of Charybdis within subfamily Thalamitinae of family Portunidae is not supported. C. japonica, as the first species of Charybdis with complete mitochondrial genome available, will provide important information on both genomics and molecular ecology of the group.
Resumo:
Background: There are many advantages to the application of complete mitochondrial (mt) genomes in the accurate reconstruction of phylogenetic relationships in Metazoa. Although over one thousand metazoan genomes have been sequenced, the taxonomic sampling is highly biased, left with many phyla without a single representative of complete mitochondrial genome. Sipuncula (peanut worms or star worms) is a small taxon of worm-like marine organisms with an uncertain phylogenetic position. In this report, we present the mitochondrial genome sequence of Phascolosoma esculenta, the first complete mitochondrial genome of the phylum. Results: The mitochondrial genome of P. esculenta is 15,494 bp in length. The coding strand consists of 32.1% A, 21.5% C, 13.0% G, and 33.4% T bases (AT = 65.5%; AT skew = -0.019; GC skew = -0.248). It contains thirteen protein-coding genes (PCGs) with 3,709 codons in total, twenty-two transfer RNA genes, two ribosomal RNA genes and a non-coding AT-rich region (AT = 74.2%). All of the 37 identified genes are transcribed from the same DNA strand. Compared with the typical set of metazoan mt genomes, sipunculid lacks trnR but has an additional trnM. Maximum Likelihood and Bayesian analyses of the protein sequences show that Myzostomida, Sipuncula and Annelida (including echiurans and pogonophorans) form a monophyletic group, which supports a closer relationship between Sipuncula and Annelida than with Mollusca, Brachiopoda, and some other lophotrochozoan groups. Conclusion: This is the first report of a complete mitochondrial genome as a representative within the phylum Sipuncula. It shares many more similar features with the four known annelid and one echiuran mtDNAs. Firstly, sipunculans and annelids share quite similar gene order in the mitochondrial genome, with all 37 genes located on the same strand; secondly, phylogenetic analyses based on the concatenated protein sequences also strongly support the sipunculan + annelid clade (including echiurans and pogonophorans). Hence annelid "key-characters" including segmentation may be more labile than previously assumed.
Resumo:
The complete mitochondrial (mt) DNA sequence was determined for a ridgetail white prawn, Exopalaemon carinicauda Holthuis, 1950 (Crustacea: Decopoda: Palaemonidae). The mt genome is 15,730 bp in length, encoding a standard set of 13 protein-coding genes, 2 ribosomal RNA genes, and 22 transfer RNA genes, which is typical for metazoans. The majority-strand consists of 33.6% A, 23.0% C, 13.4% G, and 30.0% T bases (AT skew = 0.057: GC skew = -0.264). A total of 1045 bp of non-coding nucleotides were observed in 16 intergenic regions,,including a major A+ T rich (79.7%) noncoding region (886 bp). A novel translocation of tRNA(Pro) and tRNA(Thr) was found when comparing this genome with the pancrustacean ground pattern indicating that gene order is not conserved among caridean mitochondria. Furthermore, the rate of Ka/Ks in 13 protein-coding genes between three caridean species is Much less than 1, which indicates a strong Purifying selection within this group. To investigate the phylogenetic relationship within Malacostraca, phylogenetic trees based oil Currently available malacostracan complete mitochondrial sequences were built with the maximum likelihood and Bayesian models. All analyses based oil nucleotide and amino acid data strongly support the monophyly of Decapoda. The Penaeidae, Reptantia, Caridea, and Meiura clades were also recovered as monophyletic groups with Strong Statistical Support. However, the phylogenetic relationships within Pleocyemata are unstable, as represented by the inclusion or exclusion of Caridea. (C) 2009 Elsevier B.V. All rights reserved.
Resumo:
The genetic diversity of liver fluke populations in three different countries from Eastern Europe (Greece, Bulgaria, and Poland) in comparison with available data from other countries was determined. Specifically, SNPs from regions of two nuclear genes, 28S rDNA, ß-tubulin 3 and an informative region of the mitochondrial genome were examined. Two major lineages for the 28S rDNA gene based on the highly polymorphic 105th nucleotide position were found. These lineages were widely and almost equally spread not only through the countries studied but also in other investigated geographical areas. Two basic lineages and additional haplotypes were defined for the mtDNA gene region, consisting of the cytochrome c oxidase subunit III gene, transfer RNA histidine gene and cytochome b gene. The basic lineages were observed within Greek, Bulgarian, and Polish Fasciola hepatica populations but the distribution of additional haplotypes differed between the populations from the three countries. For the ß-tubulin 3 gene multiple polymorphic sites were revealed but no explicit clades. The SNPs were spread unequally in all studied geographical regions with an evident distinction between the Greek and Polish specimens. Additional genotypes for the 28S rDNA region as well as haplotypes of the mtDNA region that were typical for the Greek or Polish populations were observed. Significant polymorphisms for ß-tubulin 3 gene were displayed with decreasing percentage of presence within populations from Greece to Poland. There was an amino acid substitution in ß-tubulin 3 protein found only among Polish specimens. It is hypothesized that genotypic differences between Greek, Bulgarian, and Polish liver fluke populations are due to territorial division and genetic drift in past epochs.
Resumo:
The genetic code establishes the rules that govern gene translation into proteins. It was established more than 3.5 billion years ago and it is one of the most conserved features of life. Despite this, several alterations to the standard genetic code have been discovered in both prokaryotes and eukaryotes, namely in the fungal CTG clade where a unique seryl transfer RNA (tRNACAG Ser) decodes leucine CUG codons as serine. This tRNACAG Ser appeared 272±25 million years ago through insertion of an adenosine in the middle position of the anticodon of a tRNACGA Ser gene, which changed its anticodon from 5´-CGA-3´ to 5´-CAG-3´. This most dramatic genetic event restructured the proteome of the CTG clade species, but it is not yet clear how and why such deleterious genetic event was selected and became fixed in those fungal genomes. In this study we have attempted to shed new light on the evolution of this fungal genetic code alteration by reconstructing its evolutionary pathway in vivo in the yeast Saccharomyces cerevisiae. For this, we have expressed wild type and mutant versions of the C. albicans tRNACGA Ser gene into S. cerevisiae and evaluated the impact of the mutant tRNACGA Ser on fitness, tRNA stability, translation efficiency and aminoacylation kinetics. Our data demonstrate that these mutants are expressed and misincorporate Ser at CUGs, but their expression is repressed through an unknown molecular mechanism. We further demonstrate, using in vivo forced evolution methodologies, that the tRNACAG Ser can be easily inactivated through natural mutations that prevent its recognition by the seryl-tRNA synthetase. The overall data show that repression of expression of the mistranslating tRNACAG Ser played a critical role on the evolution of CUG reassignment from Leu to Ser. In order to better understand the evolution of natural genetic code alterations, we have also engineered partial reassignment of various codons in yeast. The data confirmed that genetic code ambiguity affects fitness, induces protein aggregation, interferes with the cell cycle and results in nuclear and morphologic alterations, genome instability and gene expression deregulation. Interestingly, it also generates phenotypic variability and phenotypes that confer growth advantages in certain environmental conditions. This study provides strong evidence for direct and critical roles of the environment on the evolution of genetic code alterations.
Resumo:
A fidelidade da síntese proteica é fundamental para a estabilidade do proteoma e para a homeostasia celular. Em condições fisiológicas normais as células têm uma taxa de erro basal associada e esta muitas vezes aumenta com o envelhecimento e doença. Problemas na síntese das proteínas estão associados a várias doenças humanas e aos processos de envelhecimento. De facto, a incorporação de erros nas proteínas devido a tRNAs carregados pelas aminoacil-tRNA sintetases com o amino ácido errado causa doenças neurodegenerativas em humanos e ratos. Ainda não é claro como é que estas doenças se desenvolvem e se são uma consequência directa da disrupção do proteoma ou se são o resultado da toxicidade produzida pela acúmulação de proteínas mal traduzidas ao nível do ribossoma. Para elucidar como é que as células eucarióticas lidam com proteínas aberrantes e agregados proteicos (stress proteotóxico) desenvolvemos uma estratégia para destabilizar o proteoma. Para isso estabelecemos um sistema de erros de tradução em embriões de peixe zebra que assenta em tRNAs mutantes capazes de incorporar erradamente serina nas proteínas. As proteínas produzidas neste sistema despoletam as vias de resposta ao stress, nomeadamente a via da ubiquitina-proteassoma (UPP – “ubiquitin protesome pathway”) e a via do retículo endoplasmático (UPR – “unfolded protein response”). O stress proteotóxico gerado pelos erros de tradução altera a expressão génica e perfis de expressão de miRNAs, o desenvolvimento embrionário e viabilidade, aumenta a produção de espécies reactivas de oxigénio (ROS), leva ainda à acumulação de agregados proteicos e à disfunção mitocondrial. As malformações embrionárias e fenótipos de viabilidade que observámos foram revertidos por antioxidantes, o que sugere que os ROS desempenham papéis importantes nos fenótipos degenerativos celulares induzidos pela produção de proteínas aberrantes e agregação proteica. Estabelecemos ainda uma linha de peixe zebra transgénica para o estudo do stress proteotóxico. Este trabalho mostra que a destabilização do proteoma em embriões de peixe zebra com tRNAs mutantes é uma boa metodologia para estudar a biologia do stress proteotóxico visto que permite a agregação controlada do proteoma, mimetizando os processos de agregação de proteínas que ocorrem naturalmente durante o envelhecimento e em doenças conformacionais humanas.
Resumo:
Although the genetic code is generally viewed as immutable, alterations to its standard form occur in the three domains of life. A remarkable alteration to the standard genetic code occurs in many fungi of the Saccharomycotina CTG clade where the Leucine CUG codon has been reassigned to Serine by a novel transfer RNA (Ser-tRNACAG). The host laboratory made a major breakthrough by reversing this atypical genetic code alteration in the human pathogen Candida albicans using a combination of tRNA engineering, gene recombination and forced evolution. These results raised the hypothesis that synthetic codon ambiguities combined with experimental evolution may release codons from their frozen state. In this thesis we tested this hypothesis using S. cerevisiae as a model system. We generated ambiguity at specific codons in a two-step approach, involving deletion of tRNA genes followed by expression of non-cognate tRNAs that are able to compensate the deleted tRNA. Driven by the notion that rare codons are more susceptible to reassignment than those that are frequently used, we used two deletion strains where there is no cognate tRNA to decode the rare CUC-Leu codon and AGG-Arg codon. We exploited the vulnerability of the latter by engineering mutant tRNAs that misincorporate Ser at these sites. These recombinant strains were evolved over time using experimental evolution. Although there was a strong negative impact on the growth rate of strains expressing mutant tRNAs at high level, such expression at low level had little effect on cell fitness. We found that not only codon ambiguity, but also destabilization of the endogenous tRNA pool has a strong negative impact in growth rate. After evolution, strains expressing the mutant tRNA at high level recovered significantly in several growth parameters, showing that these strains adapt and exhibit higher tolerance to codon ambiguity. A fluorescent reporter system allowing the monitoring of Ser misincorporation showed that serine was indeed incorporated and possibly codon reassignment was achieved. Beside the overall negative consequences of codon ambiguity, we demonstrated that codons that tolerate the loss of their cognate tRNA can also tolerate high Ser misincorporation. This raises the hypothesis that these codons can be reassigned to standard and eventually to new amino acids for the production of proteins with novel properties, contributing to the field of synthetic biology and biotechnology.
Resumo:
The neuropeptide Th1RFamide with the sequence Phe-Met-Arg-Phe-amide was originally isolated in the clam Macrocallista nimbosa (price and Greenberg, 1977). Since its discovery, a large family ofFl\1RFamide-related peptides termed FaRPs have been found to be present in all major animal phyla with functions ranging from modulation of neuronal activity to alteration of muscular contractions. However, little is known about the genetics encoding these peptides, especially in invertebrates. As FaRP-encoding genes have yet to be investigated in the invertebrate Malacostracean subphylum, the isolation and characterization ofFaRP-encoding DNA and mRNA was pursued in this project. The immediate aims of this thesis were: (1) to amplify mRNA sequences of Procambarus clarkii using a degenerate oligonucleotide primer deduced from the common amino acid sequence ofisolated Procambarus FaRPS, (2) to determine if these amplification products encode FaRP gene sequences, and (3) to create a selective cDNA library of sequences recognized by the degenerate oligonucleotide primer. The polymerase chain reaction - rapid amplification of cDNA ends (PCR-RACE) is a procedure in which a single gene-specific primer is used in conjunction with a generalized 3' or 5' primer to amplify copies ofthe region between a single point in the transcript and the 3' or 5' end of cDNA of interest (Frohman et aI., 1988). PCRRACE reactions were optimized with respect to primers used, buffer composition, cycle number, nature ofgenetic substrate to be amplified, annealing, extension and denaturation temperatures and times, and use of reamplification procedures. Amplification products were cloned into plasmid vectors and recombinant products were isolated, as were the recombinant plaques formed in the selective cDNA library. Labeled amplification products were hybridized to recombinant bacteriophage to determine ligated amplification product presence. When sequenced, the five isolated PCR-RACE amplification products were determined not to possess FaRP-encoding sequences. The 200bp, 450bp, and 1500bp sequences showed homology to the Caenorhabditis elegans cosmid K09A11, which encodes for cytochrome P450; transfer-RNA; transposase; and tRNA-Tyr, while the 500bp and 750bp sequences showed homology with the complete genome of the Vaccinia virus. Under the employed amplification conditions the degenerate oligonucleotide primer was observed to bind to and to amplify sequences with either 9 or 10bp of 17bp identity. The selective cDNA library was obselVed to be of extremely low titre. When library titre was increased, white. plaques were isolated. Amplification analysis of eight isolated Agt11 sequences from these plaques indicated an absence of an insertion sequence. The degenerate 17 base oligonucleotide primer synthesized from the common amino acid sequence ofisolated Procambarus FaRPs was thus determined to be non-specific in its binding under the conditions required for its use, and to be insufficient for the isolation and identification ofFaRP-encoding sequences. A more specific primer oflonger sequence, lower degeneracy, and higher melting temperature (TJ is recommended for further investigation into the FaRP-encoding genes of Procambarlls clarkii.
Resumo:
La plupart des molécules d’ARN doivent se replier en structure tertiaire complexe afin d’accomplir leurs fonctions biologiques. Cependant, les déterminants d’une chaîne de polynucléotides qui sont nécessaires à son repliement et à ses interactions avec d’autres éléments sont essentiellement inconnus. L’établissement des relations structure-fonction dans les grandes molécules d’ARN passe inévitablement par l’analyse de chaque élément de leur structure de façon individuelle et en contexte avec d’autres éléments. À l’image d’une construction d’immeuble, une structure d’ARN est composée d’unités répétitives assemblées de façon spécifique. Les motifs récurrents d’ARN sont des arrangements de nucléotides retrouvés à différents endroits d’une structure tertiaire et possèdent des conformations identiques ou très similaires. Ainsi, une des étapes nécessaires à la compréhension de la structure et de la fonction des molécules d’ARN consiste à identifier de façon systématique les motifs récurrents et d’en effectuer une analyse comparative afin d’établir la séquence consensus. L’analyse de tous les cas d’empaquetage de doubles hélices dans la structure du ribosome a permis l’identification d’un nouvel arrangement nommé motif d’empaquetage le long du sillon (AGPM) (along-groove packing motif). Ce motif est retrouvé à 14 endroits dans la structure du ribosome de même qu’entre l’ARN ribosomique 23S et les molécules d’ARN de transfert liées aux sites ribosomaux P et E. Le motif se forme par l’empaquetage de deux doubles hélices via leur sillon mineur. Le squelette sucre-phosphate d’une hélice voyage le long du sillon mineur de l’autre hélice et vice versa. Dans chacune des hélices, la région de contact comprend quatre paires de bases. L’empaquetage le plus serré est retrouvé au centre de l’arrangement où l’on retrouve souvent une paire de bases GU dans une hélice interagissant avec une paire de bases Watson-Crick (WC) dans l’autre hélice. Même si la présence des paires de bases centrales GU versus WC au centre du motif augmente sa stabilité, d’autres alternatives existent pour différents représentants du motif. L’analyse comparative de trois librairies combinatoires de gènes d’AGPM, où les paires de bases centrales ont été variées de manière complètement aléatoire, a montré que le contexte structural influence l’étendue de la variabilité des séquences de nucléotides formant les paires de bases centrales. Le fait que l’identité des paires de bases centrales puisse varier suggérait la présence d’autres déterminants responsables au maintien de l’intégrité du motif. L’analyse de tous les contacts entre les hélices a révélé qu’en dehors du centre du motif, les interactions entre les squelettes sucre-phosphate s’effectuent via trois contacts ribose-ribose. Pour chacun de ces contacts, les riboses des nucléotides qui interagissent ensemble doivent adopter des positions particulières afin d’éviter qu’ils entrent en collision. Nous montrons que la position de ces riboses est modulée par des conformations spécifiques des paires de bases auxquelles ils appartiennent. Finalement, un autre motif récurrent identifié à l’intérieur même de la structure de trois cas d’AGPM a été nommé « adenosine-wedge ». Son analyse a révélé que ce dernier est lui-même composé d’un autre arrangement, nommé motif triangle-NAG (NAG-triangle). Nous montrons que le motif « adenosine-wedge » représente un arrangement complexe d’ARN composé de quatre éléments répétitifs, c’est-à-dire des motifs AGPM, « hook-turn », « A-minor » et triangle-NAG. Ceci illustre clairement l’arrangement hiérarchique des structures d’ARN qui peut aussi être observé pour d’autres motifs d’ARN. D’un point de vue plus global, mes résultats enrichissent notre compréhension générale du rôle des différents types d’interactions tertiaires dans la formation des molécules d’ARN complexes.
Resumo:
Les ARN non codants (ARNnc) sont des transcrits d'ARN qui ne sont pas traduits en protéines et qui pourtant ont des fonctions clés et variées dans la cellule telles que la régulation des gènes, la transcription et la traduction. Parmi les nombreuses catégories d'ARNnc qui ont été découvertes, on trouve des ARN bien connus tels que les ARN ribosomiques (ARNr), les ARN de transfert (ARNt), les snoARN et les microARN (miARN). Les fonctions des ARNnc sont étroitement liées à leurs structures d’où l’importance de développer des outils de prédiction de structure et des méthodes de recherche de nouveaux ARNnc. Les progrès technologiques ont mis à la disposition des chercheurs des informations abondantes sur les séquences d'ARN. Ces informations sont accessibles dans des bases de données telles que Rfam, qui fournit des alignements et des informations structurelles sur de nombreuses familles d'ARNnc. Dans ce travail, nous avons récupéré toutes les séquences des structures secondaires annotées dans Rfam, telles que les boucles en épingle à cheveux, les boucles internes, les renflements « bulge », etc. dans toutes les familles d'ARNnc. Une base de données locale, RNAstem, a été créée pour faciliter la manipulation et la compilation des données sur les motifs de structure secondaire. Nous avons analysé toutes les boucles terminales et internes ainsi que les « bulges » et nous avons calculé un score d’abondance qui nous a permis d’étudier la fréquence de ces motifs. Tout en minimisant le biais de la surreprésentation de certaines classes d’ARN telles que l’ARN ribosomal, l’analyse des scores a permis de caractériser les motifs rares pour chacune des catégories d’ARN en plus de confirmer des motifs communs comme les boucles de type GNRA ou UNCG. Nous avons identifié des motifs abondants qui n’ont pas été étudiés auparavant tels que la « tetraloop » UUUU. En analysant le contenu de ces motifs en nucléotides, nous avons remarqué que ces régions simples brins contiennent beaucoup plus de nucléotides A et U. Enfin, nous avons exploré la possibilité d’utiliser ces scores pour la conception d’un filtre qui permettrait d’accélérer la recherche de nouveaux ARN non-codants. Nous avons développé un système de scores, RNAscore, qui permet d’évaluer un ARN en se basant sur son contenu en motifs et nous avons testé son applicabilité avec différents types de contrôles.
Resumo:
Alterations to the genetic code – codon reassignments – have occurred many times in life’s history, despite the fact that genomes are coadapted to their genetic codes and therefore alterations are likely to be maladaptive. A potential mechanism for adaptive codon reassignment, which could trigger either a temporary period of codon ambiguity or a permanent genetic code change, is the reactivation of a pseudogene by a nonsense suppressor mutant transfer RNA. I examine the population genetics of each stage of this process and find that pseudogene rescue is plausible and also readily explains some features of extant variability in genetic codes.