945 resultados para Genome Sequence
Resumo:
The advent and application of high-resolution array-based comparative genome hybridization (array CGH) has led to the detection of large numbers of copy number variants (CNVs) in patients with developmental delay and/or multiple congenital anomalies as well as in healthy individuals. The notion that CNVs are also abundantly present in the normal population challenges the interpretation of the clinical significance of detected CNVs in patients. In this review we will illustrate a general clinical workflow based on our own experience that can be used in routine diagnostics for the interpretation of CNVs.
Resumo:
Understanding the genomic basis of evolutionary adaptation requires insight into the molecular basis underlying phenotypic variation. However, even changes in molecular pathways associated with extreme variation, gains and losses of specific phenotypes, remain largely uncharacterized. Here, we investigate the large interspecific differences in the ability to survive infection by parasitoids across 11 Drosophila species and identify genomic changes associated with gains and losses of parasitoid resistance. We show that a cellular immune defense, encapsulation, and the production of a specialized blood cell, lamellocytes, are restricted to a sublineage of Drosophila, but that encapsulation is absent in one species of this sublineage, Drosophila sechellia. Our comparative analyses of hemopoiesis pathway genes and of genes differentially expressed during the encapsulation response revealed that hemopoiesis-associated genes are highly conserved and present in all species independently of their resistance. In contrast, 11 genes that are differentially expressed during the response to parasitoids are novel genes, specific to the Drosophila sublineage capable of lamellocyte-mediated encapsulation. These novel genes, which are predominantly expressed in hemocytes, arose via duplications, whereby five of them also showed signatures of positive selection, as expected if they were recruited for new functions. Three of these novel genes further showed large-scale and presumably loss-of-function sequence changes in D. sechellia, consistent with the loss of resistance in this species. In combination, these convergent lines of evidence suggest that co-option of duplicated genes in existing pathways and subsequent neofunctionalization are likely to have contributed to the evolution of the lamellocyte-mediated encapsulation in Drosophila.
Resumo:
The complexity of mammalian genome organization demands a complex interplay of DNA and proteins to orchestrate proper gene regulation. CTCF, a highly conserved, ubiquitously expressed protein has been postulated as a primary organizer of genome architecture because of its roles in transcriptional activation/repression, insulation and imprinting. Diverse regulatory functions are exerted through genome wide binding via a central eleven zinc finger DNA binding domain and an array of diverse protein-protein interactions through N- and C- terminal domains. CTCFL has been identified as a paralog of CTCF expressed only in spermatogenic cells of the testis. CTCF and CTCFL have a highly homologous DNA-binding domain, while the flanking amino acid sequences exhibit no significant similarity. Genome- wide mapping of CTCF binding sites has been carried out in many cell types, but no data exist for CTCFL apart from a few identified loci. The lack of high quality antibodies prompted us to generate an endogenously flag-tagged CTCFL mouse model using BAC recombination. IHC staining using anti-flag antibodies confirmed CTCFL localization to type Β spermatogonia and preleptotene spermatocytes and a mutually exclusive pattern of expression with CTCF. ChIP followed by high-throughput sequencing identified 10,382 binding sites showing 70% overlap but representing only 20% of CTCF sites. Consensus sequence analysis identified a significantly longer binding motif with prominently less ambiguity of base calling at every position. The significant difference between CTCF and CTCFL genomic binding patterns proposes that their binding to DNA is differentially regulated. Analysis of CTCFL binding to methylated regions on a genome wide scale identified approximately 1,000 loci. Methylation-independent binding of CTCFL might be at least one of the mechanisms that ensures distinct binding patterns of CTCF and CTCFL since CTCF binding is methylation- sensitive. Co-localization of CTCF with cohesin has been well established and analysis of CTCFL and SMC3 overlap identified around 3,300 binding sites from which two related but distinct consensus sequence motifs were derived. Because virtually all data for cohesin binding originate from mitotically proliferating cells, the anticipated overlap is expected to be considerably higher in meiotic cells. Meiosis-specific cohesin subunit Rec8 is specific for spermatocytes and 6 out of the 12 identified binding sites are also bound by CTCFL. In conclusion, this was the first genome-wide mapping of CTCFL binding sites in spermatocytes, the only cell type where CTCF is not expressed. CTCFL has a unique binding site repertoire distinct from CTCF, binds to methylated sequences and shows a significant overlap with cohesin binding sites. Future efforts will be oriented towards deciphering the role CTCFL plays in conversion of chromatin structure and function from mitotic to meiotic chromosomes. - La complexité de l'organisation du génome des mammifères exige une interaction particulière entre ADN et protéines pour orchestrer une régulation appropriée de l'expression des gènes. CTCFL, une protéine ubiquitaire très conservée, serait le principal organisateur de l'architecture du génome de par son rôle dans l'activation / la répression de la transcription, la protection et la localisation des gènes. Diverses régulations sont opérées, d'une part au travers d'interactions à différents endroits du génome par le biais d'un domaine protéique central de liaison à l'ADN à onze doigts de zinc, et d'autre part par des interactions protéine-protéine variées au niveau de leur domaine N- et C-terminal. CTCFL a été identifié comme un paralogue de CTCF exprimé uniquement dans les cellules spermatiques du testicule. CTCFL et CTCF ont un domaine de liaison à l'ADN très homologue, tandis que les séquences d'acides aminés situées de part et d'autre de ce domaine ne présentent aucune similitude. Une cartographie générale des sites de liaison au CTCF a été réalisée pour de nombreux types cellulaires, mais il n'existe aucune donnée pour CTCFL à l'exception de l'identification de quelques loci. L'absence d'anticorps de bonne qualité nous a conduit à générer un modèle murin portant un CTCFL endogène taggué grâce à un procédé de recombinaison BAC. Une coloration IHC à l'aide d'anticorps anti-FLAG a confirmé la présence de CTCFL au niveau des spermatogonies de type Β et des spermatocytes au stade préleptotène, et une distribution mutuellement exclusive avec CTCF. Une méthode de Chromatine Immunoprecipitation (ChIP) suivie d'un séquençage à haut débit a permis d'identifier 10.382 sites de liaison montrant 70% d'homologie mais ne représentant que 20% des sites CTCF. L'analyse de la séquence consensus révèle un motif de fixation à l'ADN nettement plus long et qui comporte bien moins de bases aléatoires à chaque position nucléotidique. La différence significative entre les séquences génomiques des sites de liaison au CTCF et CTCFL suggère que leur fixation à l'ADN est régulée différemment. Appliquée à l'échelle du génome, l'étude de l'interaction de CTCFL avec des régions méthylées de l'ADN a permis d'identifier environ 1.000 loci. Contrairement à CTCFL, la liaison de CTCF dépend de l'état de méthylation de l'ADN ; cette modification épigénétique constitue donc au moins un des mécanismes de régulation expliquant une localisation de CTCF et CTCFL à des sites distincts du génome. La co- localisation de CTCF avec la cohésine étant établie, l'analyse de la superposition des séquences de CTCFL avec la sous-unité SMC3 identifie environ 3.300 sites de liaison parmi lesquels deux mêmes motifs consensus distincts par leur séquence sont mis en évidence. La presque quasi-totalité des données sur la cohésine ayant été établie à partir de cellules en prolifération mitotique, il est probable que la similitude au sein des séquences consensus soit encore plus grande dans le cas des cellules en méiose. La sous-unité Rec8 de la cohésine propre à l'état de méiose est spécifiquement exprimée dans les spermatocytes. Or 6 des 12 sites de liaison identifiés sont également utilisés par CTCFL. Pour conclure, ce travail constitue la première cartographie à l'échelle du génome des sites de liaison de CTCFL dans les spermatocytes, seul type cellulaire où CTCFL n'est pas exprimé. CTCFL possède un répertoire unique de sites de fixation à l'ADN distinct de CTCF, se lie à des séquences méthylées et présente un nombre important de sites de liaison communs avec la cohésine. Les perspectives futures sont d'élucider le rôle de CTCFL dans le remodelage de la structure de la chromatine et de définir sa fonction dans le processus de méiose.
Resumo:
A total of 880 expressed sequence tags (EST) originated from clones randomly selected from a Trypanosoma cruzi amastigote cDNA library have been analyzed. Of these, 40% (355 ESTs) have been identified by similarity to sequences in public databases and classified according to functional categorization of their putative products. About 11% of the mRNAs expressed in amastigotes are related to the translational machinery, and a large number of them (9% of the total number of clones in the library) encode ribosomal proteins. A comparative analysis with a previous study, where clones from the same library were selected using sera from patients with Chagas disease, revealed that ribosomal proteins also represent the largest class of antigen coding genes expressed in amastigotes (54% of all immunoselected clones). However, although more than thirty classes of ribosomal proteins were identified by EST analysis, the results of the immunoscreening indicated that only a particular subset of them contains major antigenic determinants recognized by antibodies from Chagas disease patients.
Resumo:
Trypanosoma cruzi expresses mucin like glycoproteins encoded by a complex multigene family. In this work, we report the transcription in T. cruzi but not in T. rangeli of a mucin type gene automatically annotated by the T. cruzi genome project. The gene showed no nucleotide similarities with the previously reported T. cruzi mucin like genes, although the computational analysis of the deduced protein showed that it has the characteristic features of mucins: a signal peptide sequence, O-glycosylation sites, and glycosylphosphatidylinositol (GPI) anchor sequence. The presence in this gene of N- terminal and C- terminal coding sequences common to other annotated mucin like genes suggests the existence of a new mucin like gene family.
Resumo:
The CD3ε cytoplasmic tail contains a conserved proline-rich sequence (PRS) that influences TCR-CD3 expression and signaling. Although the PRS can bind the SH3.1 domain of the cytosolic adapter Nck, whether the PRS is constitutively available for Nck binding or instead represents a cryptic motif that is exposed via conformational change upon TCR-CD3 engagement (CD3Δc) is currently unresolved. Furthermore, the extent to which a cis-acting CD3ε basic amino acid-rich stretch (BRS), with its unique phosphoinositide-binding capability, might impact PRS accessibility is not clear. In this study, we found that freshly harvested primary thymocytes expressed low to moderate basal levels of Nck-accessible PRS ("open-CD3"), although most TCR-CD3 complexes were inaccessible to Nck ("closed-CD3"). Ag presentation in vivo induced open-CD3, accounting for half of the basal level found in thymocytes from MHC(+) mice. Additional stimulation with either anti-CD3 Abs or peptide-MHC ligands further elevated open-CD3 above basal levels, consistent with a model wherein antigenic engagement induces maximum PRS exposure. We also found that the open-CD3 conformation induced by APCs outlasted the time of ligand occupancy, marking receptors that had been engaged. Finally, CD3ε BRS-phosphoinositide interactions played no role in either adoption of the initial closed-CD3 conformation or induction of open-CD3 by Ab stimulation. Thus, a basal level of open-CD3 is succeeded by a higher, induced level upon TCR-CD3 engagement, involving CD3Δc and prolonged accessibility of the CD3ε PRS to Nck.
Resumo:
The neuraminidase gene, nanH, is present in the O1, non-toxigenic Vibrio cholerae Amazonia strain. Its location has been assigned to a 150 kb NotI DNA fragment, with the use of pulsed-field gel electrophoresis and DNA hybridization. This NotI fragment is positioned inside 630 kb SfiI and 1900 kb I-CeuI fragments of chromosome 1. Association of the pathogenicity island VPI-2, carrying nanH and other genes, with toxigenic strains has been described by other authors. The presence of nanH in a non-toxigenic strain is an exception to this rule. The Amazonia strain nanH was sequenced (Genbank accession No. AY825932) and compared to available V. cholerae sequences. The sequence is different from those of pandemic strains, with 72 nucleotide substitutions. This is the first description of an O1 strain with a different nanH allele. The most variable domain of the Amazonia NanH is the second lectin wing, comprising 13 out of 17 amino acid substitutions. Based on the presence of nanH in the same region of the genome, and similarity of the adjacent sequences to VPI-2 sequences, it is proposed that the pathogenicity island VPI-2 is present in this strain.
Resumo:
A number of recent studies revealed that epigenetic modifications play a central role in the regulation of lipid and of other metabolic pathways such as cholesterol homeostasis, bile acid synthesis, glucose and energy metabolism. Epigenetics refers to aspects of genome functions regulated in a DNA sequence-independent fashion. Chromatin structure is controlled by epigenetic mechanisms through DNA methylation and histone modifications. The main modifications are histone acetylation and deacetylation on specific lysine residues operated by two different classes of enzymes: Histone acetyltransferases (HATs) and histone deacetylases (HDACs), respectively. The interaction between these enzymes and histones can activate or repress gene transcription: Histone acetylation opens and activates chromatin, while deacetylation of histones and DNA methylation compact chromatin making it transcriptionally silent. The new evidences on the importance of HDACs in the regulation of lipid and other metabolic pathways will open new perspectives in the comprehension of the pathophysiology of metabolic disorders.
Resumo:
Schistosomes have a comparatively large genome, estimated for Schistosoma mansoni to be about 270 megabase pairs (haploid genome). Recent findings have shown that mobile genetic elements constitute significant proportions of the genomes of S. mansoni and S. japonicum. Much less information is available on the genome of the third major human schistosome, S. haematobium. In order to investigate the possible evolutionary origins of the S. mansoni long terminal repeat retrotransposons Boudicca and Sinbad, several genomes were searched by Southern blot for the presence of these retrotransposons. These included three species of schistosomes, S. mansoni, S. japonicum, and S. haematobium, and three related platyhelminth genomes, the liver flukes Fasciola hepatica and Fascioloides magna and the planarian, Dugesia dorotocephala. In addition, Homo sapiens and three snail host genomes, Biomphalaria glabrata, Oncomelania hupensis, and Bulinus truncatus, were examined for possible indications of a horizontal origin for these retrotransposons. Southern hybridization analysis indicated that both Boudicca and Sinbad were present in the genome of S. haematobium. Furthermore, low stringency Southern hybridization analyses suggested that a Boudicca-like retrotransposon was present in the genome of B. truncatus, the snail host of S. haematobium.
Resumo:
In Xenopus laevis four estrogen-responsive genes are expressed simultaneously to produce vitellogenin, the precursor of the yolk proteins. One of these four genes, the gene A2, was sequenced completely, as well as cDNAs representing 75% of the coding region of the gene. From this data the exon-intron structure of the gene was established, revealing 35 exons that give a transcript of 5,619 bp without the poly A-tail. This A2 transcript encodes a vitellogenin of 1,807 amino acids, whose structure is discussed with respect to its function. At the nucleic acid as well as at the protein level no extensive homologies with any sequences other than vitellogenin were observed. Comparison of the amino acid sequence of the vitellogenin A2 molecule with biochemical data obtained from the different yolk proteins allowed us to localize the cleavage products on the vitellogenin precursor as follows: NH2 - lipovitellin I - phosvitin (or phosvette II - phosvette I) - lipovitellin II - COOH.
Resumo:
In vertebrates, genome size has been shown to correlate with nuclear and cell sizes, and influences phenotypic features, such as brain complexity. In three different anuran families, advertisement calls of polyploids exhibit longer notes and intervals than diploids, and difference in cellular dimensions have been hypothesized to cause these modifications. We investigated this phenomenon in green toads (Bufo viridis subgroup) of three ploidy levels, in a different call type (release calls) that may evolve independently from advertisement calls, examining 1205 calls, from ten species, subspecies, and hybrid forms. Significant differences between pulse rates of six diploid and four polyploid (3n, 4n) green toad forms across a range of temperatures from 7 to 27 °C were found. Laboratory data supported differences in pulse rates of triploids vs. tetraploids, but failed to reach significance when including field recordings. This study supports the idea that genome size, irrespective of call type, phylogenetic context, and geographical background, might affect call properties in anurans and suggests a common principle governing this relationship. The nuclear-cell size ratio, affected by genome size, seems the most plausible explanation. However, we cannot rule out hypotheses under which call-influencing genes from an unexamined diploid ancestral species might also affect call properties in the hybrid-origin polyploids.
Resumo:
The hepatitis A virus (HAV) HAF-203 strain was isolated from an acute case of HAV infection. The primary isolation of HAF-203 in Brazil and its adaptation to the FRhK-4 cell lineage allowed the production of large amounts of viral particles enabling molecular characterization of the first HAV isolate in Brazil. The aim of our study was to determine the nucleotide sequence of the HAF-203 strain genome, compare it to other HAV genomes and highlight its genetic variability. The complete nucleotide sequence of the HAF-203 strain (7472 nucleotides) was compared to those obtained earlier by others for other HAV isolates. These analyses revealed 19 HAF-specific nucleotide sequence differences with 10 amino acid substitutions. Most of the non-conservative changes were located at VP1, 2C, and 3D genes, but the 3B region was the most variable. The availability of HAF-203 complementary DNA was useful for the production of the recombinant VP1 protein, which is a major determinant of viral infectivity. This recombinant protein was shown by enzyme-linked immunoassay and blotting, to be immunogenic and resemble the native protein, therefore suggesting its value as a reagent for incorporation into diagnostic tests.
Resumo:
The horizontal transfer of Trypanosoma cruzi mitochondrial minicircle DNA to the genomes of naturally infected humans may play an important role in the pathogenesis of Chagas disease. Minicircle integrations within LINE-1 elements create the potential for foreign DNA mobility within the host genome via the machinery associated with this retrotransposon. Here we document integration of minicircle DNA fragments in clonal human macrophage cell lines and their mobilization over time. The movement of an integration event in a clonal transfected cell line was tracked at three months and three years post-infection. The minicircle sequence integrated into a LINE-1 retrotransposon; one such foreign fragment subsequently relocated to another genomic location in association with associated LINE-1 elements. The p15 locus was altered at three years as a direct effect of minicircle/LINE-1 acquisition, resulting in elimination of p15 mRNA. Here we show for the first time a molecular pathology stemming from mobilization of a kDNA/LINE-1 mutation. These genomic changes and detected transcript variations are consistent with our hypothesis that minicircle integration is a causal component of parasite-independent, autoimmune-driven lesions seen in the heart and other target tissues associated with Chagas disease.
Resumo:
Genomic islands, large potentially mobile regions of bacterial chromosomes, are a major contributor to bacteria evolution. Here, we investigated the fitness cost and phenotypic differences between the bacterium Pseudomonas aeruginosa PAO1 and a derivative carrying one integrated copy of the clc element, a 103-kb genomic island [and integrative and conjugative element (ICE)] originating in Pseudomonas sp. strain B13 and a close relative of genomic islands found in clinical and environmental isolates of P. aeruginosa. By using a combination of whole genome transcriptome profiling, phenotypic arrays, competition experiments, and biofilm formation studies, only few differences became apparent, such as reduced biofilm growth and fourfold stationary phase repression of genes involved in acetoin metabolism in PAO1 containing the clc element. In contrast, PAO1 carrying the clc element acquired the capacity to grow on 3-chlorobenzoate and 2-aminophenol as sole carbon and energy substrates. No fitness loss >1% was detectable in competition experiments between PAO1 and PAO1 carrying the clc element. The genes from the clc element were not silent in PAO1, and excision was observed, although transfer of clc from PAO1 to other recipient bacteria was reduced by two orders of magnitude. Our results indicate that newly acquired mobile DNA not necessarily invoke an important fitness cost on their host. Absence of immediate detriment to the host may have contributed to the wide distribution of genomic islands like clc in bacterial genomes