987 resultados para EUKARYOTIC GENOMES
Resumo:
Free-living amoebae serve as hosts for a variety of amoebae-resisting microorganisms, including giant viruses and certain bacteria. The latter include symbiotic bacteria as well as bacteria exhibiting a pathogenic phenotype towards amoebae. Amoebae-resisting bacteria have been shown to be widespread in water and to use the amoebae as a reservoir, a replication niche, a protective armour as well as a training ground to select virulence traits allowing survival in the face of microbicidal effects of macrophages, the first line of defense against invading pathogens. More importantly, amoebae play a significant role as a melting pot for genetic exchanges. These ecological and evolutionary roles of amoebae might also be at play for giant viruses and knowledge derived from the study of amoebae-resisting bacteria is useful for the study and understanding of interactions between amoebae and giant viruses. This is especially important since some genes have spread in all domains of life and the exponential availability of eukaryotic genomes and metagenomic sequences will allow researchers to explore these genetic exchanges in a more comprehensive way, thus completely changing our perception of the evolutionary history of organisms. Thus, a large part of this review is dedicated to report current known gene exchanges between the different amoebae-resisting organisms and between amoebae and the internalized bacteria.
Resumo:
Background: Transposable elements (TEs) constitute a substantial amount of all eukaryotic genomes. They induce an important proportion of deleterious mutations by insertion into genes or gene regulatory regions. However, their mutational capabilities are not always adverse but can contribute to the genetic diversity and evolution of organisms. Knowledge of their distribution and activity in the genomes of populations under different environmental and demographic regimes, is important to understand their role in species evolution. In this work we study the chromosomaldistribution of two TEs, gypsy and bilbo, in original and colonizing populations of Drosophilasubobscura to reveal the putative effect of colonization on their insertion profile.Results: Chromosomal frequency distribution of two TEs in one original and three colonizingpopulations of D. subobscura, is different. Whereas the original population shows a low insertionfrequency in most TE sites, colonizing populations have a mixture of high (frequency ¿ 10%) andlow insertion sites for both TEs. Most highly occupied sites are coincident among colonizingpopulations and some of them are correlated to chromosomal arrangements. Comparisons of TEcopy number between the X chromosome and autosomes show that gypsy occupancy seems to becontrolled by negative selection, but bilbo one does not. Conclusion: These results are in accordance that TEs in Drosophila subobscura colonizing populations are submitted to a founder effect followed by genetic drift as a consequence of colonization. This would explain the high insertion frequencies of bilbo and gypsy in coincident sites of colonizing populations. High occupancy sites would represent insertion events prior to colonization. Sites of low frequency would be insertions that occurred after colonization and/orcopies from the original population whose frequency is decreasing in colonizing populations. Thiswork is a pioneer attempt to explain the chromosomal distribution of TEs in a colonizing specieswith high inversion polymorphism to reveal the putative effect of arrangements in TE insertionprofiles. In general no associations between arrangements and TE have been found, except in a fewcases where the association is very strong. Alternatively, founder drift effects, seem to play aleading role in TE genome distribution in colonizing populations.
Resumo:
Matrix attachment regions are DNA sequences found throughout eukaryotic genomes that are believed to define boundaries interfacing heterochromatin and euchromatin domains, thereby acting as epigenetic regulators. When included in expression vectors, MARs can improve and sustain transgene expression, and a search for more potent novel elements is therefore actively pursued to further improve recombinant protein production. Here we describe the isolation of new MARs from the mouse genome using a modified in silico analysis. One of these MARs was found to be a powerful activator of transgene expression in stable transfections. Interestingly, this MAR also increased GFP and/or immunoglobulin expression from some but not all expression vectors in transient transfections. This effect was attributed to the presence or absence of elements on the vector backbone, providing an explanation for earlier discrepancies as to the ability of this class of elements to affect transgene expression under such conditions.
Resumo:
Eukaryotic genomes are compartmentalized in different structural domains that can affect positively or negatively gene expression. These regions of euchromatin and heterochromatin are characterized by distinct histones marks which can facilitate or repress gene transcription. The chromatin environment represents thus one of the main problems to control gene expression in biotechnological applications or gene therapy, since its expression is affected by the chromatin neighboring its locus of insertion. Some chromatin regions like telomeres are composed of constitutive heterochromatin which leads to the telomeric position effect (TPE) that silences genes adjacent to the telomere. TPE is known to spread by the selfrecruitment of the SIR histone deacetylase complex from the telomere in S.cerevisiae, but the histone marks that are associated to telomeric chromatin in mammalian cells remain mostly unknown. The transcription factor CTF1 has shown antisilencing properties in mammalian cells and also a boundary activity against TPE in yeast cells when fused to the yeast Gal4 DNA binding domain. In the work presented here, we describe a dual-reporter system to assess the boundary activity of proteins such as CTF1 at human telomeres. When located between the two reporter genes, CTF1 shields the telomere distal gene from TPE, while the telomereproximal gene remains silenced by telomeric heterochromatin. The boundary activity of CTF1 is shown to act regardless its function of transcriptional activator, by opposition to the transcriptional activator VP16 which activates indifferently both transgenes. Moreover, this study shows that CTF1 boundary activity is linked to its H3 binding function, as expected from a chromatin remodeler. ChIP experiments showed that histone deacetylation is the main histone modification involved in gene silencing at mammalian cell telomeres. Distinctly to yeast cells, the histone deacetylation signal in human cells extented over a short range along the chromosome. CTF1 may help to block this propagation and therefore to restore histones acetylation level on telomere protected locus. Surprisingly, other histone marks such as trimethyl-H3K9 or trimethyl-H4K20 were found on telomere protected locus, while in another clone, unsilencing of telomere distal transgene was associated with recruitment of the histone variant H2A.Z. Thus, I conclude that CTF1 displays a chromatin boundary function which is independent of its transcriptional activity and therefore exhibit features required for use as chromatin insulator in biotechnological applications. RESUME Les génomes eucaryotes sont compartementalisés en domaines structurels qui peuvent affecter positivement ou négativement l'expression des gènes avoisinants. Ces régions dites d'euchromatine ou d'hétérochromatine sont caractérisées par des modifications posttraductionnelles des histones qui peuvent faciliter ou au contraire inhiber la transcription des gènes qui s'y trouvent. Ainsi, isoler un gène de son environnement chromatinien est problème fréquent lorsqu'il s'agit de contrôler son expression dans le cadre d'applications en biotechnologie ou encore en thérapie génique. Certaines régions de chromatine telles que les télomères sont composées d'hétérochromatine constitutive qui mène au silençage des gènes avoisinants. Cet effet de position télomérique (TPE) est connu dans la levure S.cerevisiae comme se propageant par auto-recrutement du complexe de déacétylation d'histone SIR, alors que peu de modifications de chromatine ont pu être associées à ce phénomène dans les cellules de mammifères. Le facteur de transcription CTF1 a montré des propriétés d'anti-silençage dans les cellules de mammifères, ainsi qu'une activité barrière contre le silençage télomérique dans les cellules de levures lorsqu'il est fusionné au domaine de liaison à l'ADN de la protéine de levure Gal4. Dans le travail présenté ci-après est décrit un système à deux gènes rapporteurs permettant de mesurer l'activité barrière de protéines telles que CTF1 aux télomères humains, et les modifications de chromatine qui y sont associées. Lorsque CTF1 est placé entre les deux gènes rapporteurs, le gène distant du télomère est protégé du silençage qui lui est associé, alors que le gène proche du télomère reste soumis à ce silençage induit par l'hétérochromatine télomérique. L'activité barrière de CTF1 est montrée ici comme agissant indépendamment de son activité transcriptionnelle, par opposition à l'activateur transcriptionnel VP16 qui active indifféremment les deux transgènes. En outre, cette étude appuie l'hypothèse stipulant que CTF1 agisse comme remodeleur chromatinien puisqu'elle démontre que son activité barrière est directement dépendante de son activité de liaison avec l'histone H3. De plus, des expériences d'immuno-précipitation de la chromatine démontrent que la déacétylation des histones est le majeur phénomène intervenant dans le silençage télomérique. Par opposition à la levure, ce signal de déacétylation ne se propage dans les cellules humaines que sur une courte distance le long du chromosome. CTF1 agit ainsi en bloquant cette propagation et en restaurant le niveau d'acétylation des histones sur le locus protégé du télomère. De manière surprenante et inattendue, d'autres modifications d'histones telles que 4 les H3K9 et H4K20 triméthylées sont aussi observées à ce locus, tandis le recrutement du variant H2A.Z peut aussi être suffisant à restaurer l'expression du gène distant du télomère. En terme de cette analyse, CTF1 exhibe ainsi une fonction de barrière chromatinienne qui exclue une activité transcriptionnelle non désirée - propriété qui est requise dans l'établissement des isolateurs visant à permettre le contrôle d'un transgène dans le cadre d'applications en biotechnologies.
Resumo:
Background: It has been shown in a variety of organisms, including mammals, that genes that appeared recently in evolution, for example orphan genes, evolve faster than older genes. Low functional constraints at the time of origin of novel genes may explain these results. However, this observation has been recently attributed to an artifact caused by the inability of Blast to detect the fastest genes in different eukaryotic genomes. Distinguishing between these two possible explanations would be of great importance for any studies dealing with the taxon distribution of proteins and the origin of novel genes. Results: Here we used simulations of protein sequences to examine the capacity of Blast to detect proteins of diverse evolutionary rates in the different species of an eukaryotic phylogenetic tree that included metazoans, fungi and plants. We simulated the evolution of protein genes with the same evolutionary rates than those observed in functional mammalian genes and with among-site rate heterogeneity. Under these conditions, we found that only a very small percentage of simulated ancestral eukaryotic proteins was affected by the Blast artifact. We show that the good detectability of Blast is due to the heterogeneity of protein evolutionary rates at different sites, since only a small conserved motif in a sequence suffices to detect its homologues. Our results indicate that Blast, at least when applied within eukaryotes, only misses homologues of extremely fast-evolving sequences, which are rare in the mammalian genome, as well as sequences evolving homogeneously or pseudogenes.Conclusion: Although great care should be exercised in the recognition of remote homologues, most functional mammalian genes can be detected in eukaryotic genomes by Blast. That is, the majority of functional mammalian genes are not as fast as for not being detected in other metazoans, fungi or plants, if they had been present in these organisms. Thus, the correlation previously found between age and rate seems not to be due to a pure Blast artifact, at least for mammals. This may have important implications to understand the mechanisms by which novel genes originate.
Resumo:
Motivation: We compare phylogenetic approaches for inferring functional gene links. The approaches detect independent instances of the correlated gain and loss of pairs of genes from species' genomes. We investigate the effect on results of basing evidence of correlations on two phylogenetic approaches, Dollo parsminony and maximum likelihood (ML). We further examine the effect of constraining the ML model by fixing the rate of gene gain at a low value, rather than estimating it from the data. Results: We detect correlated evolution among a test set of pairs of yeast (Saccharomyces cerevisiae) genes, with a case study of 21 eukaryotic genomes and test data derived from known yeast protein complexes. If the rate at which genes are gained is constrained to be low, ML achieves by far the best results at detecting known functional links. The model then has fewer parameters but it is more realistic by preventing genes from being gained more than once. Availability: BayesTraits by M. Pagel and A. Meade, and a script to configure and repeatedly launch it by D. Barker and M. Pagel, are available at http://www.evolution.reading.ac.uk .
Resumo:
Trypanosoma cruzi is highly diverse genetically and has been partitioned into six discrete typing units (DTUs), recently re-named T. cruzi I-VI. Although T. cruzi reproduces predominantly by binary division, accumulating evidence indicates that particular DTUs are the result of hybridization events. Two major scenarios for the origin of the hybrid lineages have been proposed. It is accepted widely that the most heterozygous TcV and TcVI DTUs are the result of genetic exchange between TcII and TcIII strains. On the other hand, the participation of a TcI parental in the current genome structure of these hybrid strains is a matter of debate. Here, sequences of the T. cruzi-specific 195-bp satellite DNA of TcI, TcII, Tat, TcV, and TcVI strains have been used for inferring network genealogies. The resulting genealogy showed a high degree of reticulation, which is consistent with more than one event of hybridization between the Tc DTUs. The data also strongly suggest that Tat is a hybrid with two distinct sets of satellite sequences, and that genetic exchange between TcI and TcII parentals occurred within the pedigree of the TcV and TcVI DTUs. Although satellite DNAs belong to the fast-evolving portion of eukaryotic genomes, in >100 satellite units of nine T. cruzi strains we found regions that display 100% identity. No DTU-specific consensus motifs were identified, inferring species-wide conservation. (C) 2010 Elsevier B.V. All rights reserved.
Resumo:
Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES)
Resumo:
Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq)
Resumo:
Transposable elements (TE) are major components of eukaryotic genomes and involved in cell regulation and organism evolution. We have analyzed 123,889 expressed sequence tags of the Eucalyptus Genome Project database and found 124 sequences representing 76 TE in 9 groups, of which copia, MuDR and FAR1 groups were the most abundant. The low amount of sequences of TE may reflect the high efficiency of repression of these elements, a process that is called TE silencing. Frequency of groups of TE in Eucalyptus libraries which were prepared with different tissues or physiologic conditions from seedlings or adult plants indicated that developing plants experience the expression of a much wider spectrum of TE groups than that seen in adult plants. These are preliminary results that identify the most relevant TE groups involved with Eucalyptus development, which is important for industrial wood production. Copyright by the Brazilian Society of Genetics.
Resumo:
Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)
Resumo:
Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)
Resumo:
Transposons are abundant components of eukaryotic genomes, and play important role in genome evolution. The knowledge about these elements should contribute to the understanding of their impact on the host genomes. The hAT transposon superfamily is one of the best characterized superfamilies in diverse organisms, nevertheless, a detailed study of these elements was never carried in sugarcane. To address this question we analyzed 32 cDNAs similar to that of hAT superfamily of transposons previously identified in the sugarcane transcriptome. Our results revealed that these hAT-like transposases cluster in one highly homogeneous and other more heterogeneous lineage. We present evidences that support the hypothesis that the highly homogeneous group is a domesticated transposase while the remainder of the lineages are composed of transposon units. The first is common to grasses, clusters significantly with domesticated transposases from Arabidopsis, rice and sorghum and is expressed in different tissues of two sugarcane cultivars analyzed. In contrast, the more heterogeneous group represents at least two transposon lineages. We recovered five genomic versions of one lineage, characterizing a novel transposon family with conserved DDE motif, named SChAT. These results indicate the presence of at least three distinct lineages of hAT-like transposase paralogues in sugarcane genome, including a novel transposon family described in Saccharum and a domesticated transposase. Taken together, these findings permit to follow the diversification of some hAT transposase paralogues in sugarcane, aggregating knowledge about the co-evolution of transposons and their host genomes.
Resumo:
Mobile elements are widely present in eukaryotic genomes. They are repeated DNA segments that are able to move from one locus to another within the genome. They are divided into two main categories, depending on their mechanism of transposition, involving RNA (class I) or DNA (class II) molecules. The mariner-like elements are class II transposons. They encode their own transposase, which is necessary and sufficient for transposition in the absence of host factors. They are flanked by a short inverted terminal repeat and a TA dinucleotide target site, which is duplicated upon insertion. The transposase consists of two domains, an N-terminal inverted terminal repeat binding domain and a C-terminal catalytic domain. We identified a transposable element with molecular characteristics of a mariner-like element in Atta sexdens rubropilosa genome. Identification started from a PCR with degenerate primers and queen genomic DNA templates, with which it was possible to amplify a fragment with mariner transposable-element homology. Phylogenetic analysis demonstrated that this element belongs to the mauritiana subfamily of mariner-like elements and it was named Asmar1. We found that Asmar1 is homologous to a transposon described from another ant, Messor bouvieri. The predicted transposase sequence demonstrated that Asmar1 has a truncated transposase ORF. This study is part of a molecular characterization of mobile elements in the Atta spp genome. Our finding of mariner-like elements in all castes of this ant could be useful to help understand the dynamics of mariner-like element distribution in the Hymenoptera.
Resumo:
The continuous increase of genome sequencing projects produced a huge amount of data in the last 10 years: currently more than 600 prokaryotic and 80 eukaryotic genomes are fully sequenced and publically available. However the sole sequencing process of a genome is able to determine just raw nucleotide sequences. This is only the first step of the genome annotation process that will deal with the issue of assigning biological information to each sequence. The annotation process is done at each different level of the biological information processing mechanism, from DNA to protein, and cannot be accomplished only by in vitro analysis procedures resulting extremely expensive and time consuming when applied at a this large scale level. Thus, in silico methods need to be used to accomplish the task. The aim of this work was the implementation of predictive computational methods to allow a fast, reliable, and automated annotation of genomes and proteins starting from aminoacidic sequences. The first part of the work was focused on the implementation of a new machine learning based method for the prediction of the subcellular localization of soluble eukaryotic proteins. The method is called BaCelLo, and was developed in 2006. The main peculiarity of the method is to be independent from biases present in the training dataset, which causes the over‐prediction of the most represented examples in all the other available predictors developed so far. This important result was achieved by a modification, made by myself, to the standard Support Vector Machine (SVM) algorithm with the creation of the so called Balanced SVM. BaCelLo is able to predict the most important subcellular localizations in eukaryotic cells and three, kingdom‐specific, predictors were implemented. In two extensive comparisons, carried out in 2006 and 2008, BaCelLo reported to outperform all the currently available state‐of‐the‐art methods for this prediction task. BaCelLo was subsequently used to completely annotate 5 eukaryotic genomes, by integrating it in a pipeline of predictors developed at the Bologna Biocomputing group by Dr. Pier Luigi Martelli and Dr. Piero Fariselli. An online database, called eSLDB, was developed by integrating, for each aminoacidic sequence extracted from the genome, the predicted subcellular localization merged with experimental and similarity‐based annotations. In the second part of the work a new, machine learning based, method was implemented for the prediction of GPI‐anchored proteins. Basically the method is able to efficiently predict from the raw aminoacidic sequence both the presence of the GPI‐anchor (by means of an SVM), and the position in the sequence of the post‐translational modification event, the so called ω‐site (by means of an Hidden Markov Model (HMM)). The method is called GPIPE and reported to greatly enhance the prediction performances of GPI‐anchored proteins over all the previously developed methods. GPIPE was able to predict up to 88% of the experimentally annotated GPI‐anchored proteins by maintaining a rate of false positive prediction as low as 0.1%. GPIPE was used to completely annotate 81 eukaryotic genomes, and more than 15000 putative GPI‐anchored proteins were predicted, 561 of which are found in H. sapiens. In average 1% of a proteome is predicted as GPI‐anchored. A statistical analysis was performed onto the composition of the regions surrounding the ω‐site that allowed the definition of specific aminoacidic abundances in the different considered regions. Furthermore the hypothesis that compositional biases are present among the four major eukaryotic kingdoms, proposed in literature, was tested and rejected. All the developed predictors and databases are freely available at: BaCelLo http://gpcr.biocomp.unibo.it/bacello eSLDB http://gpcr.biocomp.unibo.it/esldb GPIPE http://gpcr.biocomp.unibo.it/gpipe