85 resultados para non-coding region of RNA

em Université de Lausanne, Switzerland


Relevância:

100.00% 100.00%

Publicador:

Resumo:

In oviparous vertebrates vitellogenin, the precursor of the major yolk proteins, is synthesized in the liver of mature females under the control of estrogen. We have established the organization and primary structure of the 5' end region of the Xenopus laevis vitellogenin A2 gene and of the major chicken vitellogenin gene. The first three homologous exons have exactly the same length in both species, namely 53, 21 and 152 nucleotides, and present an overall sequence homology of 60%. In both species, the 5'-non-coding region of the vitellogenin mRNA measures only 13 nucleotides, nine of which are conserved. In contrast, the corresponding introns of the Xenopus and the chicken vitellogenin gene show no significant sequence homology. Within the 500 nucleotides preceding the 5' end of the genes, at least six blocks with sequence homologies of greater than 70% were detected. It remains to be demonstrated which of these conserved sequences, if any, are involved in the hormone-regulated expression of the vitellogenin genes.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Cardiovascular diseases and in particular heart failure are major causes of morbidity and mortality in the Western world. Recently, the notion of promoting cardiac regeneration as a means to replace lost cardiomyocytes in the damaged heart has engendered considerable research interest. These studies envisage the utilization of both endogenous and exogenous cellular populations, which undergo highly specialized cell fate transitions to promote cardiomyocyte replenishment. Such transitions are under the control of regenerative gene regulatory networks, which are enacted by the integrated execution of specific transcriptional programs. In this context, it is emerging that the non-coding portion of the genome is dynamically transcribed generating thousands of regulatory small and long non-coding RNAs, which are central orchestrators of these networks. In this review, we discuss more particularly the biological roles of two classes of regulatory non-coding RNAs, i.e. microRNAs and long non-coding RNAs, with a particular emphasis on their known and putative roles in cardiac homeostasis and regeneration. Indeed, manipulating non-coding RNA-mediated regulatory networks could provide keys to unlock the dormant potential of the mammalian heart to regenerate. This should ultimately improve the effectiveness of current regenerative strategies and discover new avenues for repair. This article is part of a Special Issue entitled: Cardiomyocyte Biology: Cardiac Pathways of Differentiation, Metabolism and Contraction.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Numerous links between genetic variants and phenotypes are known and genome-wide association studies dramatically increased the number of genetic variants associated with traits during the last decade. However, how changes in the DNA perturb the molecular mechanisms and impact on the phenotype of an organism remains elusive. Studies suggest that many traitassociated variants are in the non-coding region of the genome and probably act through regulation of gene expression. During my thesis I investigated how genetic variants affect gene expression through gene regulatory mechanisms. The first chapter was a collaborative project with a pharmaceutical company, where we investigated genome-wide copy number variation (CNVs) among Cynomolgus monkeys (Macaca fascicularis) used in pharmaceutical studies, and associated them to changes in gene expression. We found substantial copy number variation and identified CNVs linked to tissue-specific expression changes of proximal genes. The second and third chapters focus on genetic variation in humans and its effects on gene regulatory mechanisms and gene expression. The second chapter studies two human trios, where the allelic effects of genetic variation on genome-wide gene expression, protein-DNA binding and chromatin modifications were investigated. We found abundant allele specific activity across all measured molecular phenotypes and show extended coordinated behavior among them. In the third chapter, we investigated the impact of genetic variation on these phenotypes in 47 unrelated individuals. We found that chromatin phenotypes are organized into local variable modules, often linked to genetic variation and gene expression. Our results suggest that chromatin variation emerges as a result of perturbations of cis-regulatory elements by genetic variants, leading to gene expression changes. The work of this thesis provides novel insights into how genetic variation impacts gene expression by perturbing regulatory mechanisms. -- De nombreux liens entre variations génétiques et phénotypes sont connus. Les études d'association pangénomique ont considérablement permis d'augmenter le nombre de variations génétiques associées à des phénotypes au cours de la dernière décennie. Cependant, comprendre comment ces changements perturbent les mécanismes moléculaires et affectent le phénotype d'un organisme nous échappe encore. Des études suggèrent que de nombreuses variations, associées à des phénotypes, sont situées dans les régions non codantes du génome et sont susceptibles d'agir en modifiant la régulation d'expression des gènes. Au cours de ma thèse, j'ai étudié comment les variations génétiques affectent les niveaux d'expression des gènes en perturbant les mécanismes de régulation de leur expression. Le travail présenté dans le premier chapitre est un projet en collaboration avec une société pharmaceutique. Nous avons étudié les variations en nombre de copies (CNV) présentes chez le macaque crabier (Macaca fascicularis) qui est utilisé dans les études pharmaceutiques, et nous les avons associées avec des changements d'expression des gènes. Nous avons découvert qu'il existe une variabilité substantielle du nombre de copies et nous avons identifié des CNVs liées aux changements d'expression des gènes situés dans leur voisinage. Ces associations sont présentes ou absentes de manière spécifique dans certains tissus. Les deuxième et troisième chapitres se concentrent sur les variations génétiques dans les populations humaines et leurs effets sur les mécanismes de régulation des gènes et leur expression. Le premier se penche sur deux trios humains, père, mère, enfant, au sein duquel nous avons étudié les effets alléliques des variations génétiques sur l'expression des gènes, les liaisons protéine-ADN et les modifications de la chromatine. Nous avons découvert que l'activité spécifique des allèles est abondante abonde dans tous ces phénotypes moléculaires et nous avons démontré que ces derniers ont un comportement coordonné entre eux. Dans le second, nous avons examiné l'impact des variations génétiques de ces phénotypes moléculaires chez 47 individus, sans lien de parenté. Nous avons observé que les phénotypes de la chromatine sont organisés en modules locaux, qui sont liés aux variations génétiques et à l'expression des gènes. Nos résultats suggèrent que la variabilité de la chromatine est due à des variations génétiques qui perturbent des éléments cis-régulateurs, et peut conduire à des changements dans l'expression des gènes. Le travail présenté dans cette thèse fournit de nouvelles pistes pour comprendre l'impact des différentes variations génétiques sur l'expression des gènes à travers les mécanismes de régulation.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The opportunistic ubiquitous pathogen Pseudomonas aeruginosa strain PAOl is a versatile Gram-negative bacterium that has the extraordinary capacity to colonize a wide diversity of ecological niches and to cause severe and persistent infections in humans. To ensure an optimal coordination of the genes involved in nutrient utilization, this bacterium uses the NtrB/C and/or the CbrA/B two-component systems, to sense nutrients availability and to regulate in consequence the expression of genes involved in their uptake and catabolism. NtrB/C is specialized in nitrogen utilization, while the CbrA/B system is involved in both carbon and nitrogen utilization and both systems activate their target genes expression in concert with the alternative sigma factor RpoN. Moreover, the NtrB/C and CbrA/B two- component systems regulate the secondary metabolism of the bacterium, such as the production of virulence factors. In addition to the fine-tuning transcriptional regulation, P. aeruginosa can rapidly modulate its metabolism using small non-coding regulatory RNAs (sRNAs), which regulate gene expression at the post-transcriptional level by diverse and sophisticated mechanisms and contribute to the fast physiological adaptability of this bacterium. In our search for novel RpoN-dependent sRNAs modulating the nutritional adaptation of P. aeruginosa PAOl, we discovered NrsZ (Nitrogen regulated sRNA), a novel RpoN-dependent sRNA that is induced under nitrogen starvation by the NtrB/C two-component system. NrsZ has a unique architecture, formed of three similar stem-loop structures (SL I, II and II) separated by variant spacer sequences. Moreover, this sRNA is processed in short individual stem-loop molecules, by internal cleavage involving the endoribonuclease RNAse E. Concerning NrsZ functions in P. aeruginosa PAOl, this sRNA was shown to trigger the swarming motility and the rhamnolipid biosurfactants production. This regulation is due to the NrsZ-mediated activation of rhlA expression, a gene encoding for an enzyme essential for swarming motility and rhamnolipids production. Interestingly, the SL I structure of NrsZ ensures its regulatory function on rhlA expression, suggesting that the similar SLs are the functional units of this modular sRNA. However, the regulatory mechanism of action of NrsZ on rhlA expression activation remains unclear and is currently being investigated. Additionally, the NrsZ regulatory network was investigated by a transcriptome analysis, suggesting that numerous genes involved in both primary and secondary metabolism are regulated by this sRNA. To emphasize the importance of NrsZ, we investigated its conservation in other Pseudomonas species and demonstrated that NrsZ is conserved and expressed under nitrogen limitation in Pseudomonas protegens Pf-5, Pseudomonas putida KT2442, Pseudomonas entomophila L48 and Pseudomonas syringae pv. tomato DC3000, strains having different ecological features, suggesting an important role of NrsZ in the adaptation of Pseudomonads to nitrogen starvation. Interestingly the architecture of the different NrsZ homologs is similarly composed by SL structures and variant spacer sequences. However, the number of SL repetitions is not identical, and one to six SLs were predicted on the different NrsZ homologs. Moreover, NrsZ is processed in short molecules in all the strains, similarly to what was previously observed in P. aeruginosa PAOl, and the heterologous expression of the NrsZ homologs restored rhlA expression, swarming motility and rhamnolipids production in the P. aeruginosa NrsZ mutant. In many aspects, NrsZ is an atypical sRNA in the bacterial panorama. To our knowledge, NrsZ is the first described sRNA induced by the NtrB/C. Moreover, its unique modular architecture and its processing in similar short SL molecules suggest that NrsZ belongs to a novel family of bacterial sRNAs. -- L'agent pathogène opportuniste et ubiquitaire Pseudomonas aeruginosa souche PAOl est une bactérie Gram négative versatile ayant l'extraordinaire capacité de coloniser différentes niches écologiques et de causer des infections sévères et persistantes chez l'être humain. Afin d'assurer une coordination optimale des gènes impliqués dans l'utilisation de différents nutriments, cette bactérie se sert de systèmes à deux composants tel que NtrB/C et CbrA/B afin de détecter la disponibilité des ressources nutritives, puis de réguler en conséquence l'expression des gènes impliqués dans leur importation et leur catabolisme. Le système NtrB/C régule l'utilisation des sources d'azote alors que le système CbrA/B est impliqué à la fois dans l'utilisation des sources de carbone et d'azote. Ces deux systèmes activent l'expression de leurs gènes-cibles de concert avec le facteur sigma alternatif RpoN. En outre, NtrB/C et CbrA/B régulent aussi le métabolisme secondaire, contrôlant notamment la production d'importants facteurs de virulence. En plus de toutes ces régulations génétiques fines ayant lieu au niveau transcriptionnel, P. aeruginosa est aussi capable de moduler son métabolisme en se servant de petits ARNs régulateurs non-codants (ARNncs), qui régulent l'expression génétique à un niveau post- transcriptionnel par divers mécanismes sophistiqués et contribuent à rendre particulièrement rapide l'adaptation physiologique de cette bactérie. Au cours de nos recherches sur de nouveaux ARNncs dépendant du facteur sigma RpoN et impliqués dans l'adaptation nutritionnelle de P. aeruginosa PAOl, nous avons découvert NrsZ (Nitrogen regulated sRNA), un ARNnc induit par la cascade NtrB/C-RpoN en condition de carence en azote. NrsZ a une architecture unique, composée de trois structures en tige- boucle (TB I, II et III) hautement similaires et séparées par des « espaceurs » ayant des séquences variables. De plus, cet ARNnc est clivé en petits fragments correspondant au trois molécules en tige-boucle, par un processus de clivage interne impliquant l'endoribonucléase RNase E. Concernant les fonctions de NrsZ chez P. aeruginosa PAOl, cet ARNnc est capable d'induire la motilité de type « swarming » et la production de biosurfactants, nommés rhamnolipides. Cette régulation est due à l'activation par NrsZ de l'expression de rhlA, un gène essentiel pour la motilité de type swarming et pour la production de rhamnolipides. Étonnamment, la structure TB I est capable d'assurer à elle seule la fonction régulatrice de NrsZ sur l'expression de rhlA, suggérant que ces molécules TBs sont les unités fonctionnelles de cet ARNnc modulaire. Cependant, le mécanisme moléculaire par lequel NrsZ active l'expression de rhlA demeure à ce jour incertain et est actuellement à l'étude. En plus, le réseau de régulations médiées par NrsZ a été étudié par une analyse de transcriptome qui a indiqué que de nombreux gènes impliqués dans le métabolisme primaire ou secondaire seraient régulés par NrsZ. Pour accentuer l'importance de NrsZ, nous avons étudié sa conservation dans d'autres espèces de Pseudomonas. Ainsi, nous avons démontré que NrsZ est conservé et exprimé en situation de carence d'azote par les souches Pseudomonas protegens Pf-5, Pseudomonas putida KT2442, Pseudomonas entomophila L48, Pseudomonas syringae pv. tomato DC3000, quatre espèces ayant des caractéristiques écologiques très différentes, suggérant que NrsZ joue un rôle important dans l'adaptation du genre Pseudomonas envers la carence en azote. Chez toutes les souches étudiées, les différents homologues de NrsZ présentent une architecture similaire faite de TBs conservées et d'espaceurs. Cependant, le nombre de TBs n'est pas identique et peut varier de une à six copies selon la souche. Les différentes versions de NrsZ sont clivées en petites molécules dans ces quatre souches, comme il a été observé chez P. aeruginosa PAOl. De plus, l'expression hétérologue des différentes variantes de NrsZ est capable de restaurer l'expression de rhlA, la motilité swarming et la production de rhamnolipides dans une souche de P. aeruginosa dont nrsZ a été inactivé. Par bien des aspects, NrsZ est un ARNnc atypique dans le monde bactérien. À notre connaissance, NrsZ est le premier ARNnc décrit comme étant régulé par le système NtrB/C. De plus, son unique architecture modulaire et son clivage en petites molécules similaires suggèrent que NrsZ appartient à une nouvelle famille d'ARNncs bactériens.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

INTRODUCTION: Intrauterine Growth Restriction (IUGR) is a multifactorial disease defined by an inability of the fetus to reach its growth potential. IUGR not only increases the risk of neonatal mortality/morbidity, but also the risk of metabolic syndrome during adulthood. Certain placental proteins have been shown to be implicated in IUGR development, such as proteins from the GH/IGF axis and angiogenesis/apoptosis processes. METHODS: Twelve patients with term IUGR pregnancy (birth weight < 10th percentile) and 12 CTRLs were included. mRNA was extracted from the fetal part of the placenta and submitted to a subtraction method (Clontech PCR-Select cDNA Subtraction). RESULTS: One candidate gene identified was the long non-coding RNA NEAT1 (nuclear paraspeckle assembly transcript 1). NEAT1 is the core component of a subnuclear structure called paraspeckle. This structure is responsible for the retention of hyperedited mRNAs in the nucleus. Overall, NEAT1 mRNA expression was 4.14 (±1.16)-fold increased in IUGR vs. CTRL placentas (P = 0.009). NEAT1 was exclusively localized in the nuclei of the villous trophoblasts and was expressed in more nuclei and with greater intensity in IUGR placentas than in CTRLs. PSPC1, one of the three main proteins of the paraspeckle, co-localized with NEAT1 in the villous trophoblasts. The expression of NEAT1_2 mRNA, the long isoform of NEAT1, was only modestly increased in IUGR vs. CTRL placentas. DISCUSSION/CONCLUSION: The increase in NEAT1 and its co-localization with PSPC1 suggests an increase in paraspeckles in IUGR villous trophoblasts. This could lead to an increased retention of important mRNAs in villous trophoblasts nuclei. Given that the villous trophoblasts are crucial for the barrier function of the placenta, this could in part explain placental dysfunction in idiopathic IUGR fetuses.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Pancreatic β-cells play a central role in glucose homeostasis by tightly regulating insulin release according to the organism's demand. Impairment of β-cell function due to hostile environment, such as hyperglycaemia and hyperlipidaemia, or due to autoimmune destruction of β-cells, results in diabetes onset. Both environmental factors and genetic predisposition are known to be involved in the development of the disease, but the exact mechanisms leading to β-cell dysfunction and death remain to be characterized. Non-coding RNA molecules, such as microRNAs (miRNAs), have been suggested to be necessary for proper β-cell development and function. The present review aims at summarizing the most recent findings about the role of non-coding RNAs in the control of β-cell functions and their involvement in diabetes. We will also provide a perspective view of the future research directions in the field of non-coding RNAs. In particular, we will discuss the implications for diabetes research of the discovery of a new communication mechanism based on cell-to-cell miRNA transfer. Moreover, we will highlight the emerging interconnections between miRNAs and epigenetics and the possible role of long non-coding RNAs in the control of β-cell activities.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Canine distemper virus (CDV) produces a glycosylated type I fusion protein (F) with an internal hydrophobic signal sequence beginning around 115 residues downstream of the first AUG used for translation initiation. Cleavage of the signal sequence yields the F0 molecule, which is cleaved into the F1 and F2 subunits. Surprisingly, when all in-frame AUGs located in the first third of the F gene were mutated a protein of the same molecular size as the F0 molecule was still expressed from both the Onderstepoort (OP) and A75/17-CDV F genes. We designated this protein, which is initiated from a non-AUG codon protein Fx. Site-directed mutagenesis allowed to identify codon 85, a GCC codon coding for alanine, as the most likely position from which translation initiation of Fx occurs in OP-CDV. Deletion analysis demonstrated that at least 60 nucleotides upstream of the GCC codon are required for efficient Fx translation. This sequence is GC-rich, suggesting extensive folding. Secondary structure may therefore be important for translation initiation at codon 85.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Starting from a biologically active recombinant DNA clone of exogenous unintegrated GR mouse mammary tumor virus, we have generated three subclones of PstI fragments of 1.45, 1.1, and 2.0 kb in the plasmid vector PBR322. The nucleotide sequence has been determined for the clone of 1.45 kb which includes almost the complete region of the long terminal repeat (LTR) plus an adjacent stretch of unique sequence DNA. A short region of the 2.0 kb clone, containing the beginning of the LTR, has also been sequenced. Starting with the A of an initiation codon outside the LTR, we detected an open reading frame of 960 nucleotides, potentially coding for a protein of 320 amino acids (36K). Two hundred nucleotides downstream from the termination codon, and approximately 25 nucleotides upstream from the presumptive initiation site of viral RNA synthesis, we found a promoter-like sequence. The sequence AGTAAA was detected approximately 15-20 nucleotides upstream from the 3' end of virion RNA and probably serves as a polyadenylation signal. The 1.45 kb PstI fragment has been transfected into Ltk- cells together with a plasmid containing the thymidine kinase gene of herpes simplex virus. The virus-specific RNA synthesis detected in a Tk+ cell clone was strongly stimulated by the addition of dexamethasone.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The opportunistic pathogen Pseudomonas aeruginosa PAO1 has a remarkable capacity to adapt to various environments and to survive with limited nutrients. Here, we report the discovery and characterization of a novel small non-coding RNA: NrsZ (nitrogen-regulated sRNA). We show that under nitrogen limitation, NrsZ is induced by the NtrB/C two component system, an important regulator of nitrogen assimilation and P. aeruginosa's swarming motility, in concert with the alternative sigma factor RpoN. Furthermore, we demonstrate that NrsZ modulates P. aeruginosa motility by controlling the production of rhamnolipid surfactants, virulence factors notably needed for swarming motility. This regulation takes place through the post-transcriptional control of rhlA, a gene essential for rhamnolipids synthesis. Interestingly, we also observed that NrsZ is processed in three similar short modules, and that the first short module encompassing the first 60 nucleotides is sufficient for NrsZ regulatory functions.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Gene expression changes may underlie much of phenotypic evolution. The development of high-throughput RNA sequencing protocols has opened the door to unprecedented large-scale and cross-species transcriptome comparisons by allowing accurate and sensitive assessments of transcript sequences and expression levels. Here, we review the initial wave of the new generation of comparative transcriptomic studies in mammals and vertebrate outgroup species in the context of earlier work. Together with various large-scale genomic and epigenomic data, these studies have unveiled commonalities and differences in the dynamics of gene expression evolution for various types of coding and non-coding genes across mammalian lineages, organs, developmental stages, chromosomes and sexes. They have also provided intriguing new clues to the regulatory basis and phenotypic implications of evolutionary gene expression changes.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

AIM: Heart disease is recognized as a consequence of dysregulation of cardiac gene regulatory networks. Previously, unappreciated components of such networks are the long non-coding RNAs (lncRNAs). Their roles in the heart remain to be elucidated. Thus, this study aimed to systematically characterize the cardiac long non-coding transcriptome post-myocardial infarction and to elucidate their potential roles in cardiac homoeostasis. METHODS AND RESULTS: We annotated the mouse transcriptome after myocardial infarction via RNA sequencing and ab initio transcript reconstruction, and integrated genome-wide approaches to associate specific lncRNAs with developmental processes and physiological parameters. Expression of specific lncRNAs strongly correlated with defined parameters of cardiac dimensions and function. Using chromatin maps to infer lncRNA function, we identified many with potential roles in cardiogenesis and pathological remodelling. The vast majority was associated with active cardiac-specific enhancers. Importantly, oligonucleotide-mediated knockdown implicated novel lncRNAs in controlling expression of key regulatory proteins involved in cardiogenesis. Finally, we identified hundreds of human orthologues and demonstrate that particular candidates were differentially modulated in human heart disease. CONCLUSION: These findings reveal hundreds of novel heart-specific lncRNAs with unique regulatory and functional characteristics relevant to maladaptive remodelling, cardiac function and possibly cardiac regeneration. This new class of molecules represents potential therapeutic targets for cardiac disease. Furthermore, their exquisite correlation with cardiac physiology renders them attractive candidate biomarkers to be used in the clinic.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

AIMS/HYPOTHESIS: Exposure of pancreatic beta cells to cytokines released by islet-infiltrating immune cells induces alterations in gene expression, leading to impaired insulin secretion and apoptosis in the initial phases of type 1 diabetes. Long non-coding RNAs (lncRNAs) are a new class of transcripts participating in the development of many diseases. As little is known about their role in insulin-secreting cells, this study aimed to evaluate their contribution to beta cell dysfunction. METHODS: The expression of lncRNAs was determined by microarray in the MIN6 beta cell line exposed to proinflammatory cytokines. The changes induced by cytokines were further assessed by real-time PCR in islets of control and NOD mice. The involvement of selected lncRNAs modified by cytokines was assessed after their overexpression in MIN6 cells and primary islet cells. RESULTS: MIN6 cells were found to express a large number of lncRNAs, many of which were modified by cytokine treatment. The changes in the level of selected lncRNAs were confirmed in mouse islets and an increase in these lncRNAs was also seen in prediabetic NOD mice. Overexpression of these lncRNAs in MIN6 and mouse islet cells, either alone or in combination with cytokines, favoured beta cell apoptosis without affecting insulin production or secretion. Furthermore, overexpression of lncRNA-1 promoted nuclear translocation of nuclear factor of κ light polypeptide gene enhancer in B cells 1 (NF-κB). CONCLUSIONS/INTERPRETATION: Our study shows that lncRNAs are modulated during the development of type 1 diabetes in NOD mice, and that their overexpression sensitises beta cells to apoptosis, probably contributing to their failure during the initial phases of the disease.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The discovery of long non-coding RNA (lncRNA) has dramatically altered our understanding of cancer. Here, we describe a comprehensive analysis of lncRNA alterations at transcriptional, genomic, and epigenetic levels in 5,037 human tumor specimens across 13 cancer types from The Cancer Genome Atlas. Our results suggest that the expression and dysregulation of lncRNAs are highly cancer type specific compared with protein-coding genes. Using the integrative data generated by this analysis, we present a clinically guided small interfering RNA screening strategy and a co-expression analysis approach to identify cancer driver lncRNAs and predict their functions. This provides a resource for investigating lncRNAs in cancer and lays the groundwork for the development of new diagnostics and treatments.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

BACKGROUND: The comparison of complete genomes has revealed surprisingly large numbers of conserved non-protein-coding (CNC) DNA regions. However, the biological function of CNC remains elusive. CNC differ in two aspects from conserved protein-coding regions. They are not conserved across phylum boundaries, and they do not contain readily detectable sub-domains. Here we characterize the persistence length and time of CNC and conserved protein-coding regions in the vertebrate and insect lineages. RESULTS: The persistence length is the length of a genome region over which a certain level of sequence identity is consistently maintained. The persistence time is the evolutionary period during which a conserved region evolves under the same selective constraints.Our main findings are: (i) Insect genomes contain 1.60 times less conserved information than vertebrates; (ii) Vertebrate CNC have a higher persistence length than conserved coding regions or insect CNC; (iii) CNC have shorter persistence times as compared to conserved coding regions in both lineages. CONCLUSION: Higher persistence length of vertebrate CNC indicates that the conserved information in vertebrates and insects is organized in functional elements of different lengths. These findings might be related to the higher morphological complexity of vertebrates and give clues about the structure of active CNC elements.Shorter persistence time might explain the previously puzzling observations of highly conserved CNC within each phylum, and of a lack of conservation between phyla. It suggests that CNC divergence might be a key factor in vertebrate evolution. Further evolutionary studies will help to relate individual CNC to specific developmental processes.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

We present and validate BlastR, a method for efficiently and accurately searching non-coding RNAs. Our approach relies on the comparison of di-nucleotides using BlosumR, a new log-odd substitution matrix. In order to use BlosumR for comparison, we recoded RNA sequences into protein-like sequences. We then showed that BlosumR can be used along with the BlastP algorithm in order to search non-coding RNA sequences. Using Rfam as a gold standard, we benchmarked this approach and show BlastR to be more sensitive than BlastN. We also show that BlastR is both faster and more sensitive than BlastP used with a single nucleotide log-odd substitution matrix. BlastR, when used in combination with WU-BlastP, is about 5% more accurate than WU-BlastN and about 50 times slower. The approach shown here is equally effective when combined with the NCBI-Blast package. The software is an open source freeware available from www.tcoffee.org/blastr.html.