927 resultados para PROTEIN-CODING GENES


Relevância:

100.00% 100.00%

Publicador:

Resumo:

Protozoan parasites cause thousands of deaths each year in developing countries. The genome projects of these parasites opened a new era in the identification of therapeutic targets. However, the putative function could be predicted for fewer than half of the protein-coding genes. In this work, all Trypanosoma cruzi proteins containing predicted transmembrane spans were processed through an automated computational routine and further analyzed in order to assign the most probable function. The analysis consisted of dissecting the whole predicted protein in different regions. More than 5,000 sequences were processed, and the predicted biological functions were grouped into 19 categories according to the hits obtained after analysis. One focus of interest, due to the scarce information available on trypanosomatids, is the proteins involved in signal-transduction processes. In the present work, we identified 54 proteins belonging to this group, which were individually analyzed. The results show that by means of a simple pipeline it was possible to attribute probable functions to sequences annotated as coding for "hypothetical proteins.'' Also, we successfully identified the majority of candidates participating in the signal-transduction pathways in T. cruzi.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

In silico analyses of Leishmania spp. genome data are a powerful resource to improve the understanding of these pathogens' biology. Trypanosomatids such as Leishmania spp. have their protein-coding genes grouped in long polycistronic units of functionally unrelated genes. The control of gene expression happens by a variety of posttranscriptional mechanisms. The high degree of synteny among Leishmania species is accompanied by highly conserved coding sequences (CDS) and poorly conserved intercoding untranslated sequences. To identify the elements involved in the control of gene expression, we conducted an in silico investigation to find conserved intercoding sequences (CICS) in the genomes of L major, L infantum, and L braziliensis. We used a combination of computational tools, such as Linux-Shell, PERL and R languages, BLAST, MSPcrunch, SSAKE, and Pred-A-Term algorithms to construct a pipeline which was able to: (i) search for conservation in target-regions, (ii) eliminate CICS redundancy and mask repeat elements, (iii) predict the mRNA's extremities, (iv) analyze the distribution of orthologous genes within the generated LeishCICS-clusters, (v) assign GO terms to the LeishCICS-clusters. and (vi) provide statistical support for the gene-enrichment annotation. We associated the LeishCICS-cluster data, generated at the end of the pipeline, with the expression profile oft. donovani genes during promastigote-amastigote differentiation, as previously evaluated by others (GEO accession: GSE21936). A Pearson's correlation coefficient greater than 0.5 was observed for 730 LeishCICS-clusters containing from 2 to 17 genes. The designed computational pipeline is a useful tool and its application identified potential regulatory cis elements and putative regulons in Leishmania. (C) 2012 Elsevier B.V. All rights reserved.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The addition of a capped mini-exon [spliced leader (SL)] through trans-splicing is essential for the maturation of RNA polymerase (pol) II-transcribed polycistronic pre-mRNAs in all members of the Trypanosomatidae family. This process is an inter-molecular splicing reaction that follows the same basic rules of cis-splicing reactions. In this study, we demonstrated that mini-exons were added to precursor ribosomal RNA (pre-rRNA) are transcribed by RNA pol I, including the 5' external transcribed spacer (ETS) region. Additionally, we detected the SL-5' ETS molecule using three distinct methods and located the acceptor site between two known 5' ETS rRNA processing sites (A' and A1) in four different trypanosomatids. Moreover, we detected a polyadenylated 5' ETS upstream of the trans-splicing acceptor site, which also occurs in pre-mRNA trans-splicing. After treatment with an indirect trans-splicing inhibitor (sinefungin), we observed SL-5' ETS decay. However, treatment with 5-fluorouracil (a precursor of RNA synthesis that inhibits the degradation of pre-rRNA) led to the accumulation of SL-5' ETS, suggesting that the molecule may play a role in rRNA degradation. The detection of trans-splicing in these molecules may indicate broad RNA-joining properties, regardless of the polymerase used for transcription.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Schistosoma mansoni is responsible for schistosomiasis, a parasitic disease that affects 200 million people worldwide. Molecular mechanisms of host-parasite interaction are complex and involve a crosstalk between host signals and parasite receptors. TGF-beta signaling pathway has been shown to play an important role in S. mansoni development and embryogenesis. In particular human (h) TGF-beta has been shown to bind to a S. mansoni receptor, transduce a signal that regulates the expression of a schistosome target gene. Here we describe 381 parasite genes whose expression levels are affected by in vitro treatment with hTGF-beta. Among these differentially expressed genes we highlight genes related to morphology, development and cell cycle that could be players of cytokine effects on the parasite. We confirm by qPCR the expression changes detected with microarrays for 5 out of 7 selected genes. We also highlight a set of non-coding RNAs transcribed from the same loci of protein-coding genes that are differentially expressed upon hTCF-beta treatment. These datasets offer potential targets to be explored in order to understand the molecular mechanisms behind the possible role of hTGF-beta effects on parasite biology. (C) 2012 Elsevier B.V. All rights reserved.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Schistosoma mansoni is one of the agents of schistosomiasis, a chronic and debilitating disease. Here we, present a transcriptome-wide characterization of adult S. mansoni males by high-throughput RNA-sequencing. We obtained 1,620,432 high-quality ESTs from a directional strand-specific cDNA library, resulting in a 26% higher coverage of genome bases than that of the public ESTs available at NCBI. With a 15 x-deep coverage of transcribed genomic regions, our data were able to (i) confirm for the first time 990 predictions without previous evidence of transcription; (ii) correct gene predictions; (iii) discover 989 and 1196 RNA-seq contigs that map to intergenic and intronic genomic regions, respectively, where no gene had been predicted before. These contigs could represent new protein-coding genes or non-coding RNAs (ncRNAs). Interestingly, we identified 11 novel Micro-exon genes (MEGs). These data reveal new features of the S. mansoni transcriptional landscape and significantly advance our understanding of the parasite transcriptome. (c) 2011 Elsevier Inc. All rights reserved.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Abstract Background The sequencing of the D.melanogaster genome revealed an unexpected small number of genes (~ 14,000) indicating that mechanisms acting on generation of transcript diversity must have played a major role in the evolution of complex metazoans. Among the most extensively used mechanisms that accounts for this diversity is alternative splicing. It is estimated that over 40% of Drosophila protein-coding genes contain one or more alternative exons. A recent transcription map of the Drosophila embryogenesis indicates that 30% of the transcribed regions are unannotated, and that 1/3 of this is estimated as missed or alternative exons of previously characterized protein-coding genes. Therefore, the identification of the variety of expressed transcripts depends on experimental data for its final validation and is continuously being performed using different approaches. We applied the Open Reading Frame Expressed Sequence Tags (ORESTES) methodology, which is capable of generating cDNA data from the central portion of rare transcripts, in order to investigate the presence of hitherto unnanotated regions of Drosophila transcriptome. Results Bioinformatic analysis of 1,303 Drosophila ORESTES clusters identified 68 sequences derived from unannotated regions in the current Drosophila genome version (4.3). Of these, a set of 38 was analysed by polyA+ northern blot hybridization, validating 17 (50%) new exons of low abundance transcripts. For one of these ESTs, we obtained the cDNA encompassing the complete coding sequence of a new serine protease, named SP212. The SP212 gene is part of a serine protease gene cluster located in the chromosome region 88A12-B1. This cluster includes the predicted genes CG9631, CG9649 and CG31326, which were previously identified as up-regulated after immune challenges in genomic-scale microarray analysis. In agreement with the proposal that this locus is co-regulated in response to microorganisms infection, we show here that SP212 is also up-regulated upon injury. Conclusion Using the ORESTES methodology we identified 17 novel exons from low abundance Drosophila transcripts, and through a PCR approach the complete CDS of one of these transcripts was defined. Our results show that the computational identification and manual inspection are not sufficient to annotate a genome in the absence of experimentally derived data.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Abstract Background Pancreatic ductal adenocarcinoma (PDAC) is known by its aggressiveness and lack of effective therapeutic options. Thus, improvement in current knowledge of molecular changes associated with pancreatic cancer is urgently needed to explore novel venues of diagnostics and treatment of this dismal disease. While there is mounting evidence that long noncoding RNAs (lncRNAs) transcribed from intronic and intergenic regions of the human genome may play different roles in the regulation of gene expression in normal and cancer cells, their expression pattern and biological relevance in pancreatic cancer is currently unknown. In the present work we investigated the relative abundance of a collection of lncRNAs in patients' pancreatic tissue samples aiming at identifying gene expression profiles correlated to pancreatic cancer and metastasis. Methods Custom 3,355-element spotted cDNA microarray interrogating protein-coding genes and putative lncRNA were used to obtain expression profiles from 38 clinical samples of tumor and non-tumor pancreatic tissues. Bioinformatics analyses were performed to characterize structure and conservation of lncRNAs expressed in pancreatic tissues, as well as to identify expression signatures correlated to tissue histology. Strand-specific reverse transcription followed by PCR and qRT-PCR were employed to determine strandedness of lncRNAs and to validate microarray results, respectively. Results We show that subsets of intronic/intergenic lncRNAs are expressed across tumor and non-tumor pancreatic tissue samples. Enrichment of promoter-associated chromatin marks and over-representation of conserved DNA elements and stable secondary structure predictions suggest that these transcripts are generated from independent transcriptional units and that at least a fraction is under evolutionary selection, and thus potentially functional. Statistically significant expression signatures comprising protein-coding mRNAs and lncRNAs that correlate to PDAC or to pancreatic cancer metastasis were identified. Interestingly, loci harboring intronic lncRNAs differentially expressed in PDAC metastases were enriched in genes associated to the MAPK pathway. Orientation-specific RT-PCR documented that intronic transcripts are expressed in sense, antisense or both orientations relative to protein-coding mRNAs. Differential expression of a subset of intronic lncRNAs (PPP3CB, MAP3K14 and DAPK1 loci) in metastatic samples was confirmed by Real-Time PCR. Conclusion Our findings reveal sets of intronic lncRNAs expressed in pancreatic tissues whose abundance is correlated to PDAC or metastasis, thus pointing to the potential relevance of this class of transcripts in biological processes related to malignant transformation and metastasis in pancreatic cancer.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Abstract Background Intronic and intergenic long noncoding RNAs (lncRNAs) are emerging gene expression regulators. The molecular pathogenesis of renal cell carcinoma (RCC) is still poorly understood, and in particular, limited studies are available for intronic lncRNAs expressed in RCC Methods Microarray experiments were performed with custom-designed arrays enriched with probes for lncRNAs mapping to intronic genomic regions. Samples from 18 primary RCC tumors and 11 nontumor adjacent matched tissues were analyzed. Meta-analyses were performed with microarray expression data from three additional human tissues (normal liver, prostate tumor and kidney nontumor samples), and with large-scale public data for epigenetic regulatory marks and for evolutionarily conserved sequences. Results A signature of 29 intronic lncRNAs differentially expressed between RCC and nontumor samples was obtained (false discovery rate (FDR) <5%). A signature of 26 intronic lncRNAs significantly correlated with the RCC five-year patient survival outcome was identified (FDR <5%, p-value ≤0.01). We identified 4303 intronic antisense lncRNAs expressed in RCC, of which 22% were significantly (p <0.05) cis correlated with the expression of the mRNA in the same locus across RCC and three other human tissues. Gene Ontology (GO) analysis of those loci pointed to 'regulation of biological processes’ as the main enriched category. A module map analysis of the protein-coding genes significantly (p <0.05) trans correlated with the 20% most abundant lncRNAs, identified 51 enriched GO terms (p <0.05). We determined that 60% of the expressed lncRNAs are evolutionarily conserved. At the genomic loci containing the intronic RCC-expressed lncRNAs, a strong association (p <0.001) was found between their transcription start sites and genomic marks such as CpG islands, RNA Pol II binding and histones methylation and acetylation. Conclusion Intronic antisense lncRNAs are widely expressed in RCC tumors. Some of them are significantly altered in RCC in comparison with nontumor samples. The majority of these lncRNAs is evolutionarily conserved and possibly modulated by epigenetic modifications. Our data suggest that these RCC lncRNAs may contribute to the complex network of regulatory RNAs playing a role in renal cell malignant transformation.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Insect storage proteins accumulate at high levels during larval development of holometabolous insects. During metamorphosis they are degraded, supplying energy and amino acids for the completion of adult development. The genome of Culex quinquefasciatus contains eleven storage protein-coding genes. Their transcripts are more abundant in larvae than in pupae and in adults. In fact, only four of these genes are transcribed in adults, two of which in blood-fed adult females but not in adult males. Transcripts corresponding to all Cx. quinquefasciatus storage proteins were detected by RT-PCR, while mass spectrometric analysis of larval and pupal proteins identified all storage proteins with the exception of one encoded by Cq LSP1.8. Our results indicate that the identified Cx. quinquefasciatus storage protein-coding genes are candidates for identifying regulatory sequences for the development of molecular tools for vector control

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Animal neocentromeres are defined as ectopic centromeres that have formed in non-centromeric locations and avoid some of the features, like the DNA satellite sequence, that normally characterize canonical centromeres. Despite this, they are stable functional centromeres inherited through generations. The only existence of neocentromeres provide convincing evidence that centromere specification is determined by epigenetic rather than sequence-specific mechanisms. For all this reasons, we used them as simplified models to investigate the molecular mechanisms that underlay the formation and the maintenance of functional centromeres. We collected human cell lines carrying neocentromeres in different positions. To investigate the region involved in the process at the DNA sequence level we applied a recent technology that integrates Chromatin Immuno-Precipitation and DNA microarrays (ChIP-on-chip) using rabbit polyclonal antibodies directed against CENP-A or CENP-C human centromeric proteins. These DNA binding-proteins are required for kinetochore function and are exclusively targeted to functional centromeres. Thus, the immunoprecipitation of DNA bound by these proteins allows the isolation of centromeric sequences, including those of the neocentromeres. Neocentromeres arise even in protein-coding genes region. We further analyzed if the increased scaffold attachment sites and the corresponding tighter chromatin of the region involved in the neocentromerization process still were permissive or not to transcription of within encoded genes. Centromere repositioning is a phenomenon in which a neocentromere arisen without altering the gene order, followed by the inactivation of the canonical centromere, becomes fixed in population. It is a process of chromosome rearrangement fundamental in evolution, at the bases of speciation. The repeat-free region where the neocentromere initially forms, progressively acquires extended arrays of satellite tandem repeats that may contribute to its functional stability. In this view our attention focalized to the repositioned horse ECA11 centromere. ChIP-on-chip analysis was used to define the region involved and SNPs studies, mapping within the region involved into neocentromerization, were carried on. We have been able to describe the structural polymorphism of the chromosome 11 centromeric domain of Caballus population. That polymorphism was seen even between homologues chromosome of the same cells. That discovery was the first described ever. Genomic plasticity had a fundamental role in evolution. Centromeres are not static packaged region of genomes. The key question that fascinates biologists is to understand how that centromere plasticity could be combined to the stability and maintenance of centromeric function. Starting from the epigenetic point of view that underlies centromere formation, we decided to analyze the RNA content of centromeric chromatin. RNA, as well as secondary chemically modifications that involve both histones and DNA, represents a good candidate to guide somehow the centromere formation and maintenance. Many observations suggest that transcription of centromeric DNA or of other non-coding RNAs could affect centromere formation. To date has been no thorough investigation addressing the identity of the chromatin-associated RNAs (CARs) on a global scale. This prompted us to develop techniques to identify CARs in a genome-wide approach using high-throughput genomic platforms. The future goal of this study will be to focalize the attention on what strictly happens specifically inside centromere chromatin.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Die phylogenetische Position der Mollusken innerhalb der Trochozoa sowie die interne Evolution der Klassen der Mollusca sind weitgehend unbekannt und wurden in meiner Arbeit anhand molekularer Merkmale untersucht. Phylogenomische Analysen zeigten in der Vergangenheit eine gute Auflösung für ursprüngliche Speziationsereignisse. Daher wurden hier drei neue EST Datensätze generiert: für Sipunculus nudus (Sipuncula), Barentsia elongata (Kamptozoa) und Lepidochitona cinerea, (Polyplacophora, Mollusca). Zusätzlich wurden gezielt Gene verschiedener Mollusken mittels RT-PCR amplifiziert. rnSowohl Kamptozoen als auch Sipunculiden wurden aufgrund morphologischer Kriterien bisher als mögliche Schwestergruppe der Mollusken gehandelt, aber die hier erzielten Ergebnisse zur Evolution der Hämerythrine, Gen-Anordnungen der mitochondrialen Genome und phylogenetische Analysen der ribosomalen und der mitochondriellen Proteine stützen diese Hypothese nicht. Die Position der Kamptozoa erwies sich hier generell als unbeständig; phylogenomische Analysen deuten eine Nähe zu den Bryozoen an, aber diese Position wird stark durch die Auswahl der Taxa beeinflusst. Dagegen weisen meine Analysen klar auf eine nähere Beziehung zwischen Annelida und Sipuncula hin. Die ribosomalen Proteine zeigen Sipuncula (und Echiura) sogar als Subtaxa der Anneliden. Wie den Mollusken fehlt den Sipunculiden jegliche Segmentierung und meine Ergebnisse legen hier die Möglichkeit des Verlusts dieses Merkmals innerhalb der Anneliden bei den Sipunculiden nahe. Innerhalb der Mollusken wurden die Solenogastren bereits als Schwestergruppe aller rezenten Mollusken vorgeschlagen. Im Rahmen meiner Arbeit wurden von drei verschiedenen Solenogastren-Arten die ersten zuverlässigen 18S rRNA-Sequenzen ermittelt, und es zeigte sich, dass alle bisher veröffentlichten 18S-Sequenzen dieser Molluskenklasse höchst unvollständig oder fehlerhaft sind. rnRibosomale Proteine sind gute phylogenetische Marker und hier wurden die Auswahl und Anzahl dieser Gene für phylogenetische Analysen optimiert. Über Sonden-basierte Detektion wurde eine sampling-Strategie getestet, die im Vergleich mit standard-phylogenomischen Ansätzen zukünftige molekulare Stammbaumrekonstruktionen mit größerem Taxonsampling ermöglicht.rn

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Mesenchymale Stamzellen (MSC) sind Vertreter der adulten Stammzellen. Sie bergen durch ihre große Plastizität ein immenses Potential für die klinische Nutzung in Form von Stammzelltherapien. Zellen dieses Typs kommen vornehmlich im Knochenmark der großen Röhrenknochen vor und können zu Knochen, Knorpel und Fettzellen differenzieren. MSC leisten einen wichtigen Beitrag im Rahmen regenerativer Prozesse, beispielsweise zur Heilung von Frakturen. Breite Studien demonstrieren bereits jetzt auch bei komplexeren Erkrankungen (z.B. Osteoporose) therapeutisch vielversprechende Einsatzmöglichkeiten. Oft kommen hierbei aus MSC gezielt differenzierte Folgelinien aus Zellkulturen zum Einsatz. Dies bedingt eine kontrollierte Steuerung der Differenzierungsprozesse in vitro. Der Differenzierung einer Stammzelle liegt eine komplexe Veränderung ihrer Genexpression zugrunde. Genexpressionsmuster zur Erhaltung und Proliferation der Stammzellen müssen durch solche, die der linienspezifischen Differenzierung dienen, ersetzt werden. Die mit der Differenzierung einhergehende, transkriptomische Neuausrichtung ist für das Verständnis der Prozesse grundlegend und wurde bislang nur unzureichend untersucht. Ziel der vorliegenden Arbeit ist eine transkriptomweite und vergleichende Genexpressionsanalyse Mesenchymaler Stammzellen und deren in vitro differenzierten Folgelinien mittels Plasmid - DNA Microarrays und Sequenziertechniken der nächsten Generation (RNA-Seq, Illumina Plattform). In dieser Arbeit diente das Hausrind (Bos taurus) als Modellorganismus, da es genetisch betrachtet eine hohe Ähnlichkeit zum Menschen aufweist und Knochenmark als Quelle von MSC gut verfügbar ist. Primärkulturen Mesenchymaler Stammzellen konnten aus dem Knochenmark von Rindern erfolgreich isoliert werden. Es wurden in vitro Zellkultur - Versuche durchgeführt, um die Zellen zu Osteoblasten, Chondrozyten und Adipozyten zu differenzieren. Zur Genexpressionsanalyse wurde RNA aus jungen MSC und einer MSC Langzeitkultur („alte MSC“), sowie aus den differenzierten Zelllinien isoliert und für nachfolgende Experimente wo nötig amplifiziert. Der Erfolg der Differenzierungen konnte anhand der Genexpression von spezifischen Markergenen und mittels histologischer Färbungen belegt werden. Hierbei zeigte sich die Differenzierung zu Osteoblasten und Adipozyten erfolgreich, während die Differenzierung zu Chondrozyten trotz diverser Modifikationen am Protokoll nicht erfolgreich durchgeführt werden konnte. Eine vergleichende Hybridisierung zur Bestimmung differentieller Genexpression (MSC vs. Differenzierung) mittels selbst hergestellter Plasmid - DNA Microarrays ergab für die Osteogenese mit Genen wie destrin und enpp1, für die undifferenzierten MSC mit dem Gen sema3c neue Kandidatengene, deren biologische Funktion aufzuklären in zukünftigen Experimenten vielversprechende Ergebnisse liefern sollte. Die Analyse der transkriptomweiten Genexpression mittels NGS lieferte einen noch umfangreicheren Einblick ins Differenzierungsgeschehen. Es zeigte sich eine hohe Ähnlichkeit im Expressionsprofil von jungen MSC und Adipozyten, sowie zwischen den Profilen der alten MSC (eine Langzeitkultur) und Osteoblasten. Die alten MSC wiesen deutliche Anzeichen für eine spontane Differenzierung in die osteogene Richtung auf. Durch Analyse der 100 am stärksten exprimierten Gene jeder Zelllinie ließen sich für junge MSC und Adipozyten besonders Gene der extrazellulären Matrix (z.B col1a1,6 ; fn1 uvm.) auffinden. Sowohl Osteoblasten, als auch die alten MSC exprimieren hingegen verstärkt Gene mit Bezug zur oxidativen Phosphorylierung, sowie ribosomale Proteine. Eine Betrachtung der differentiellen Genexpression (junge MSC vs. Differenzierung) mit anschließender Pathway Analyse und Genontologie Anreicherungsstatistik unterstützt diese Ergebnisse vor allem bei Osteoblasten, wo nun jedoch zusätzlich auch Gene zur Regulation der Knochenentwicklung und Mineralisierung in den Vordergrund treten. Für Adipozyten konnte mit Genen des „Jak-STAT signaling pathway“, der Fokalen Adhäsion, sowie Genen des „Cytokine-cytokine receptor interaction pathway“ sehr spannende Einsichten in die Biologie dieses Zelltyps erlangt werden, die sicher weiterer Untersuchungen bedürfen. In undifferenzierten MSC konnte durch differentielle Genexpressionsanalyse die Rolle des nicht kanonischen Teils des WNT Signalweges als für die Aufrechterhaltung des Stammzellstatus potentiell äußerst einflussreich ermittelt werden. Die hier diskutierten Ergebnisse zeigen beispielhaft, dass besonders mittels Genexpressionsanalyse im Hochdurchsatzverfahren wertvolle Einblicke in die komplexe Biologie der Stammzelldifferenzierung möglich sind. Als Grundlage für nachfolgende Arbeiten konnten interessante Gene ermittelt und Hypothesen zu deren Einfluss auf Stammzelleigenschaften und Differenzierungsprozesse aufgestellt werden. Um einen besseren Einblick in den Differenzierungsverlauf zu ermöglichen, könnten künftig NGS Analysen zu unterschiedlichen Differenzierungszeitpunkten durchgeführt werden. Zudem wären weitere Anstrengungen zur erfolgreichen Etablierung der chondrogenen Differenzierung zur vollständigen Analyse der Genexpression des trilinearen Differenzierungspotentials von MSC wünschenswert.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The macronuclear genome of the ciliate Oxytricha trifallax displays an extreme and unique eukaryotic genome architecture with extensive genomic variation. During sexual genome development, the expressed, somatic macronuclear genome is whittled down to the genic portion of a small fraction (∼5%) of its precursor "silent" germline micronuclear genome by a process of "unscrambling" and fragmentation. The tiny macronuclear "nanochromosomes" typically encode single, protein-coding genes (a small portion, 10%, encode 2-8 genes), have minimal noncoding regions, and are differentially amplified to an average of ∼2,000 copies. We report the high-quality genome assembly of ∼16,000 complete nanochromosomes (∼50 Mb haploid genome size) that vary from 469 bp to 66 kb long (mean ∼3.2 kb) and encode ∼18,500 genes. Alternative DNA fragmentation processes ∼10% of the nanochromosomes into multiple isoforms that usually encode complete genes. Nucleotide diversity in the macronucleus is very high (SNP heterozygosity is ∼4.0%), suggesting that Oxytricha trifallax may have one of the largest known effective population sizes of eukaryotes. Comparison to other ciliates with nonscrambled genomes and long macronuclear chromosomes (on the order of 100 kb) suggests several candidate proteins that could be involved in genome rearrangement, including domesticated MULE and IS1595-like DDE transposases. The assembly of the highly fragmented Oxytricha macronuclear genome is the first completed genome with such an unusual architecture. This genome sequence provides tantalizing glimpses into novel molecular biology and evolution. For example, Oxytricha maintains tens of millions of telomeres per cell and has also evolved an intriguing expansion of telomere end-binding proteins. In conjunction with the micronuclear genome in progress, the O. trifallax macronuclear genome will provide an invaluable resource for investigating programmed genome rearrangements, complementing studies of rearrangements arising during evolution and disease.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The parasitic protists in the genus Tritrichomonas cause significant disease in domestic cattle and cats. To assess the genetic diversity of feline and bovine isolates of Tritrichomonas foetus (Riedmüller, 1928) Wenrich and Emmerson, 1933, we used 10 different genetic regions, namely the protein coding genes of cysteine proteases 1, 2 and 4-9 (CP1, 2, 4-9) involved in the pathogenesis of the disease caused by the parasite. The cytosolic malate dehydrogenase 1 (MDH1) and internal transcribed spacer region 2 of the rDNA unit (ITS2) were included as additional markers. The gene sequences were compared with those of Tritrichomonas suis (Davaine, 1875) Morgan and Hawkins, 1948 and Tritrichomonas mobilensisCulberson et al., 1986. The study revealed 100% identity for all 10 genes among all feline isolates (=T. foetus cat genotype), 100% identity among all bovine isolates (=T. foetus cattle genotype) and a genetic distinctness of 1% between the cat and cattle genotypes of T. foetus. The cattle genotype of T. foetus was 100% identical to T. suis at nine loci (CP1, 2, 4-8, ITS2, MDH1). At CP9, three out of four T. suis isolates were identical to the T. foetus cattle genotype, while the T. suis isolate SUI-H3B sequence contained a single unique nucleotide substitution. Tritrichomonas mobilensis was 0.4% and 0.7% distinct from the cat and cattle genotypes of T. foetus, respectively. The genetic differences resulted in amino acid changes in the CP genes, most pronouncedly in CP2, potentially providing a platform for elucidation of genotype-specific host-pathogen interactions of T. foetus. On the basis of this data we judge T. suis and T. foetus to be subjective synonyms. For the first time, on objective nomenclatural grounds, the authority of T. suis is given to Davaine, 1875, rather than the commonly cited Gruby and Delafond, 1843. To maintain prevailing usage of T. foetus, we are suppressing the senior synomym T. suisDavaine, 1875 according to Article 23.9, because it has never been used as a valid name after 1899 and T. foetus is widely discussed as the cause of bovine trichomonosis. Thus bovine, feline and porcine isolates should all be given the name T. foetus. This promotes the stability of T. foetus for the veterinary and economically significant venereal parasite causing bovine trichomonosis.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Theoretical and empirical studies were conducted on the pattern of nucleotide and amino acid substitution in evolution, taking into account the effects of mutation at the nucleotide level and purifying selection at the amino acid level. A theoretical model for predicting the evolutionary change in electrophoretic mobility of a protein was also developed by using information on the pattern of amino acid substitution. The specific problems studied and the main results obtained are as follows: (1) Estimation of the pattern of nucleotide substitution in DNA nuclear genomes. The pattern of point mutations and nucleotide substitutions among the four different nucleotides are inferred from the evolutionary changes of pseudogenes and functional genes, respectively. Both patterns are non-random, the rate of change varying considerably with nucleotide pair, and that in both cases transitions occur somewhat more frequently than transversions. In protein evolution, substitution occurs more often between amino acids with similar physico-chemical properties than between dissimilar amino acids. (2) Estimation of the pattern of nucleotide substitution in RNA genomes. The majority of mutations in retroviruses accumulate at the reverse transcription stage. Selection at the amino acid level is very weak, and almost non-existent between synonymous codons. The pattern of mutation is very different from that in DNA genomes. Nevertheless, the pattern of purifying selection at the amino acid level is similar to that in DNA genomes, although selection intensity is much weaker. (3) Evaluation of the determinants of molecular evolutionary rates in protein-coding genes. Based on rates of nucleotide substitution for mammalian genes, the rate of amino acid substitution of a protein is determined by its amino acid composition. The content of glycine is shown to correlate strongly and negatively with the rate of substitution. Empirical formulae, called indices of mutability, are developed in order to predict the rate of molecular evolution of a protein from data on its amino acid sequence. (4) Studies on the evolutionary patterns of electrophoretic mobility of proteins. A theoretical model was constructed that predicts the electric charge of a protein at any given pH and its isoelectric point from data on its primary and quaternary structures. Using this model, the evolutionary change in electrophoretic mobilities of different proteins and the expected amount of electrophoretically hidden genetic variation were studied. In the absence of selection for the pI value, proteins will on the average evolve toward a mildly basic pI. (Abstract shortened with permission of author.) ^