25 resultados para Genomes
Resumo:
In unicellular eukaryotes, such as Saccharomyces cerevisiae, and in multicellular organisms, the replication origin is recognized by the heterohexamer origin recognition complex (ORC) containing six proteins, Orc1 to Orc6, while in members of the domain Archaea, the replication origin is recognized by just one protein, Orc1/Cdc6; the sequence of Orc1/Cdc6 is highly related to those of Orc1 and Cdc6. Similar to Archaea, trypanosomatid genomes contain only one gene encoding a protein named Orc1. Since trypanosome Orc1 is also homologous to Cdc6, in this study we named the Orc1 protein from trypanosomes Orc1/Cdc6. Here we show that the recombinant Orc1/Cdc6 from Trypanosoma cruzi (TcOrc1/Cdc6) and from Trypanosoma brucei (TbOrc1/Cdc6) present ATPase activity, typical of prereplication machinery components. Also, TcOrc1/Cdc6 and TbOrc1/Cdc6 replaced yeast Cdc6 but not Orc1 in a phenotypic complementation assay. The induction of Orc1/Cdc6 silencing by RNA interference in T. brucei resulted in enucleated cells, strongly suggesting the involvement of Orc1/Cdc6 in DNA replication. Orc1/Cdc6 is expressed during the entire cell cycle in the nuclei of trypanosomes, remaining associated with chromatin in all stages of the cell cycle. These results allowed us to conclude that Orc1/Cdc6 is indeed a member of the trypanosome prereplication machinery and point out that trypanosomes carry a prereplication machinery that is less complex than other eukaryotes and closer to archaea.
Resumo:
Motivation: DNA assembly programs classically perform an all-against-all comparison of reads to identify overlaps, followed by a multiple sequence alignment and generation of a consensus sequence. If the aim is to assemble a particular segment, instead of a whole genome or transcriptome, a target-specific assembly is a more sensible approach. GenSeed is a Perl program that implements a seed-driven recursive assembly consisting of cycles comprising a similarity search, read selection and assembly. The iterative process results in a progressive extension of the original seed sequence. GenSeed was tested and validated on many applications, including the reconstruction of nuclear genes or segments, full-length transcripts, and extrachromosomal genomes. The robustness of the method was confirmed through the use of a variety of DNA and protein seeds, including short sequences derived from SAGE and proteome projects.
Resumo:
Leptospirosis is a world spread zoonosis caused by members of the genus Leptospira. Although leptospires were identified as the causal agent of leptospirosis almost 100 years ago, little is known about their biology, which hinders the development of new treatment and prevention strategies. One of the several aspects of the leptospiral biology not yet elucidated is the process by which outer membrane proteins (OMPs) traverse the periplasm and are inserted into the outer membrane. The crystal structure determination of the conserved hypothetical protein LIC12922 from Leptospira interrogans revealed a two domain protein homologous to the Escherichia coli periplasmic chaperone SurA. The LIC12922 NC-domain is structurally related to the chaperone modules of E. coli SurA and trigger factor, whereas the parvulin domain is devoid of peptidyl prolyl cis-trans isomerase activity. Phylogenetic analyses suggest a relationship between LIC12922 and the chaperones PrsA, PpiD and SurA. Based on our structural and evolutionary analyses, we postulate that LIC12922 is a periplasmic chaperone involved in OMPs biogenesis in Leptospira spp. Since LIC12922 homologs were identified in all spirochetal genomes sequenced to date, this assumption may have implications for the OMPs biogenesis studies not only in leptospires but in the entire Phylum Spirochaetes. (C) 2010 Elsevier Inc. All rights reserved.
Resumo:
This paper explores the structural continuum in CATH and the extent to which superfamilies adopt distinct folds. Although most superfamilies are structurally conserved, in some of the most highly populated superfamilies (4% of all superfamilies) there is considerable structural divergence. While relatives share a similar fold in the evolutionary conserved core, diverse elaborations to this core can result in significant differences in the global structures. Applying similar protocols to examine the extent to which structural overlaps occur between different fold groups, it appears this effect is confined to just a few architectures and is largely due to small, recurring super-secondary motifs (e.g., alpha beta-motifs, alpha-hairpins). Although 24% of superfamilies overlap with superfamilies having different folds, only 14% of nonredundant structures in CATH are involved in overlaps. Nevertheless, the existence of these overlaps suggests that, in some regions of structure space, the fold universe should be seen as more continuous.
Resumo:
The latest version of CATH (class, architecture, topology, homology) (version 3.2), released in July 2008 (http://www.cathdb.info), contains 1 14215 domains, 2178 Homologous superfamilies and 1110 fold groups. We have assigned 20 330 new domains, 87 new homologous superfamilies and 26 new folds since CATH release version 3.1. A total of 28 064 new domains have been assigned since our NAR 2007 database publication (CATH version 3.0). The CATH website has been completely redesigned and includes more comprehensive documentation. We have revisited the CATH architecture level as part of the development of a `Protein Chart` and present information on the population of each architecture. The CATHEDRAL structure comparison algorithm has been improved and used to characterize structural diversity in CATH superfamilies and structural overlaps between superfamilies. Although the majority of superfamilies in CATH are not structurally diverse and do not overlap significantly with other superfamilies, similar to 4% of superfamilies are very diverse and these are the superfamilies that are most highly populated in both the PDB and in the genomes. Information on the degree of structural diversity in each superfamily and structural overlaps between superfamilies can now be downloaded from the CATH website.
Resumo:
This study develops a simplified model describing the evolutionary dynamics of a population composed of obligate sexually and asexually reproducing, unicellular organisms. The model assumes that the organisms have diploid genomes consisting of two chromosomes, and that the sexual organisms replicate by first dividing into haploid intermediates, which then combine with other haploids, followed by the normal mitotic division of the resulting diploid into two new daughter cells. We assume that the fitness landscape of the diploids is analogous to the single-fitness-peak approach often used in single-chromosome studies. That is, we assume a master chromosome that becomes defective with just one point mutation. The diploid fitness then depends on whether the genome has zero, one, or two copies of the master chromosome. We also assume that only pairs of haploids with a master chromosome are capable of combining so as to produce sexual diploid cells, and that this process is described by second-order kinetics. We find that, in a range of intermediate values of the replication fidelity, sexually reproducing cells can outcompete asexual ones, provided the initial abundance of sexual cells is above some threshold value. The range of values where sexual reproduction outcompetes asexual reproduction increases with decreasing replication rate and increasing population density. We critically evaluate a common approach, based on a group selection perspective, used to study the competition between populations and show its flaws in addressing the evolution of sex problem.
Resumo:
Xylella fastidiosa is the etiologic agent of a wide range of plant diseases, including citrus variegated chlorosis (CVC), a major threat to citrus industry. The genomes of several strains of this phytopathogen were completely sequenced, enabling large-scale functional studies. DNA microarrays representing 2,608 (91.6%) coding sequences (CDS) of X. fastidiosa CVC strain 9a5c were used to investigate transcript levels during growth with different iron availabilities. When treated with the iron chelator 2,2`-dipyridyl, 193 CDS were considered up-regulated and 216 were considered down-regulated. Upon incubation with 100 mu M ferric pyrophosphate, 218 and 256 CDS were considered up- and down-regulated, respectively. Differential expression for a subset of 44 CDS was further evaluated by reverse transcription-quantitative PCR. Several CDS involved with regulatory functions, pathogenicity, and cell structure were modulated under both conditions assayed, suggesting that major changes in cell architecture and metabolism occur when X. fastidiosa cells are exposed to extreme variations in iron concentration. Interestingly, the modulated CDS include those related to colicin V-like bacteriocin synthesis and secretion and to functions of pili/fimbriae. We also investigated the contribution of the ferric uptake regulator Fur to the iron stimulon of X. fastidiosa. The promoter regions of the strain 9a5c genome were screened for putative Fur boxes, and candidates were analyzed by electrophoretic mobility shift assays. Taken together, our data support the hypothesis that Fur is not solely responsible for the modulation of the iron stimulon of X fastidiosa, and they present novel evidence for iron regulation of pathogenicity determinants.
Resumo:
Trypanosoma cruzi is highly diverse genetically and has been partitioned into six discrete typing units (DTUs), recently re-named T. cruzi I-VI. Although T. cruzi reproduces predominantly by binary division, accumulating evidence indicates that particular DTUs are the result of hybridization events. Two major scenarios for the origin of the hybrid lineages have been proposed. It is accepted widely that the most heterozygous TcV and TcVI DTUs are the result of genetic exchange between TcII and TcIII strains. On the other hand, the participation of a TcI parental in the current genome structure of these hybrid strains is a matter of debate. Here, sequences of the T. cruzi-specific 195-bp satellite DNA of TcI, TcII, Tat, TcV, and TcVI strains have been used for inferring network genealogies. The resulting genealogy showed a high degree of reticulation, which is consistent with more than one event of hybridization between the Tc DTUs. The data also strongly suggest that Tat is a hybrid with two distinct sets of satellite sequences, and that genetic exchange between TcI and TcII parentals occurred within the pedigree of the TcV and TcVI DTUs. Although satellite DNAs belong to the fast-evolving portion of eukaryotic genomes, in >100 satellite units of nine T. cruzi strains we found regions that display 100% identity. No DTU-specific consensus motifs were identified, inferring species-wide conservation. (C) 2010 Elsevier B.V. All rights reserved.
Resumo:
It has been postulated that noncoding RNAs (ncRNAs) are involved in the posttranscriptional control of gene expression, and may have contributed to the emergence of the complex attributes observed in mammalians. We show here that the complement of ncRNAs expressed from intronic regions of the human and mouse genomes comprises at least 78,147 and 39,660 transcriptional units, respectively. To identify conserved intronic sequences expressed in both humans and mice, we used custom-designed human cDNA microarrays to separately interrogate RNA from mouse and human liver, kidney, and prostate tissues. An overlapping tissue expression signature was detected for both species, comprising 198 transcripts; among these, 22 RNAs map to intronic regions with evidence of evolutionary conservation in humans and mice. Transcription of selected human-mouse intronic ncRNAs was confirmed using strand-specific RT-PCR. Altogether, these results support an evolutionarily conserved role of intronic ncRNAs in human and mouse, which are likely to be involved in the fine tuning of gene expression regulation in different mammalian tissues. (C) 2008 Elsevier Inc. All rights reserved.
Resumo:
The 195-bp satellite DNA is the most abundant Trypanosoma cruzi repetitive sequence. Here we show by RNA blotting and RT-PCR that 195 SAT is intensely transcribed. We observed a positive correlation between the level of satellite RNA and the abundance of the satellite copies in the genome of T cruzi strains and that the satellite expression is not developmentally regulated. By analyzing CL Brener individual reads, we estimated that 195 SAT corresponds to approximately 5% of the CL Brener genome. 195 SAT elements were found in only 37 annotated contigs, indicating that a large number of satellite copies were not incorporated into the assembled data. The assembled satellite units are distributed in non-syntenic regions with Trypanosoma brucei and Leishmania major genomes, enriched with surface proteins, retroelements, RHS and hypothetical proteins. Satellite repeats were not observed in annotated subtelomeric regions. We report that 12 satellite sequences are truncated by the retroelement VIPER. (C) 2008 Elsevier B.V. All rights reserved.