911 resultados para genome organization
Resumo:
Although patterns of somatic alterations have been reported for tumor genomes, little is known on how they compare with alterations present in non-tumor genomes. A comparison of the two would be crucial to better characterize the genetic alterations driving tumorigenesis. We sequenced the genomes of a lymphoblastoid (HCC1954BL) and a breast tumor (HCC1954) cell line derived from the same patient and compared the somatic alterations present in both. The lymphoblastoid genome presents a comparable number and similar spectrum of nucleotide substitutions to that found in the tumor genome. However, a significant difference in the ratio of non-synonymous to synonymous substitutions was observed between both genomes (P = 0.031). Protein-protein interaction analysis revealed that mutations in the tumor genome preferentially affect hub-genes (P = 0.0017) and are co-selected to present synergistic functions (P < 0.0001). KEGG analysis showed that in the tumor genome most mutated genes were organized into signaling pathways related to tumorigenesis. No such organization or synergy was observed in the lymphoblastoid genome. Our results indicate that endogenous mutagens and replication errors can generate the overall number of mutations required to drive tumorigenesis and that it is the combination rather than the frequency of mutations that is crucial to complete tumorigenic transformation.
Resumo:
Schistosoma mansoni is responsible for the neglected tropical disease schistosomiasis that affects 210 million people in 76 countries. Here we present analysis of the 363 megabase nuclear genome of the blood fluke. It encodes at least 11,809 genes, with an unusual intron size distribution, and new families of micro-exon genes that undergo frequent alternative splicing. As the first sequenced flatworm, and a representative of the Lophotrochozoa, it offers insights into early events in the evolution of the animals, including the development of a body pattern with bilateral symmetry, and the development of tissues into organs. Our analysis has been informed by the need to find new drug targets. The deficits in lipid metabolism that make schistosomes dependent on the host are revealed, and the identification of membrane receptors, ion channels and more than 300 proteases provide new insights into the biology of the life cycle and new targets. Bioinformatics approaches have identified metabolic chokepoints, and a chemogenomic screen has pinpointed schistosome proteins for which existing drugs may be active. The information generated provides an invaluable resource for the research community to develop much needed new control tools for the treatment and eradication of this important and neglected disease.
Resumo:
To understand the biology and evolution of ruminants, the cattle genome was sequenced to about sevenfold coverage. The cattle genome contains a minimum of 22,000 genes, with a core set of 14,345 orthologs shared among seven mammalian species of which 1217 are absent or undetected in noneutherian (marsupial or monotreme) genomes. Cattle-specific evolutionary breakpoint regions in chromosomes have a higher density of segmental duplications, enrichment of repetitive elements, and species-specific variations in genes associated with lactation and immune responsiveness. Genes involved in metabolism are generally highly conserved, although five metabolic genes are deleted or extensively diverged from their human orthologs. The cattle genome sequence thus provides a resource for understanding mammalian evolution and accelerating livestock genetic improvement for milk and meat production.
Resumo:
The complete genome sequence of wild-type rabies virus (RABV) isolated from a wild Brazilian hoary fox (Dusicyon sp.), the BR-Pfx1 isolate, was determined and compared with fixed RABV strains. The genome structure and organization of the BR-Pfx1 isolate were composed of 11,924 nt and included the five standard genes of rhabdoviruses. Sequences of mRNA start and stop signals for transcription were highly conserved among all structural protein genes of the BR-Pfx1 isolate. All amino acid residues in the glycoprotein (G) gene associated with pathogenicity were retained in the BR-Pfx1 isolate, while unique amino acid substitutions were found in antigenic region I of the nucleoprotein gene and III of G. These results suggest that although the standard genome structure and organization of the RABV isolate are common between the BR-Pfx1 isolate and fixed RABV strains, the unique amino acid substitutions in functional sites of the BR-Pfx1 isolate may result in different biological characteristics from fixed RABV strains.
Resumo:
The complete arrangement of genes in the mitochondrial (mt) genome is known for 12 species of insects, and part of the gene arrangement in the mt genome is known for over 300 other species of insects. The arrangement of genes in the mt genome is very conserved in insects studied, since all of the protein-coding and rRNA genes and most of the tRNA genes are arranged in the same way. We sequenced the entire mt genome of the wallaby louse, Heterodoxus macropus, which is 14,670 bp long and has the 37 genes typical of animals and some noncoding regions. The largest noncoding region is 73 bp long (93% A+T), and the second largest is 47 bp long (92% AST). Both of these noncoding regions seem to be able to form stem-loop structures. The arrangement of genes in the mt genome of this louse is unlike that of any other animal studied. All tRNA genes have moved and/or inverted relative to the ancestral gene arrangement of insects, which is present in the fruit fly Drosophila yakuba. At least nine protein-coding genes (atp6, atp8, cox2, cob, nad1-nad3, nad5, and nad6) have moved; moreover, four of these genes (atp6, atp8, nad1, and nad3) have inverted. The large number of gene rearrangements in the mt genome of H. macropus is unprecedented for an arthropod.
Resumo:
To help understand the mechanisms of gene rearrangement in the mitochondrial (mt) genomes of hemipteroid insects, we sequenced the mt genome of the plague thrips, Thrips imaginis (Thysanoptera). This genome is circular, 15,407 by long, and has many unusual features, including (1) rRNA genes inverted and distant from one another, (2) an extra gene for tRNA-Ser, (3) a tRNA-Val lacking a D-arm, (4) two pseudo-tRNA genes, (5) duplicate control regions, and (6) translocations and/or inversions of 24 of the 37 genes. The mechanism of rRNA gene transcription in T. imaginis may be different from that of other arthropods since the two rRNA genes have inverted and are distant from one another. Further, the rRNA genes are not adjacent or even close to either of the two control regions. Tandem duplication and deletion is a plausible model for the evolution of duplicate control regions and for the gene translocations, but intramitochondrial recombination may account for the gene inversions in T. imaginis. All the 18 genes between control regions #1 and #2 have translocated and/or inverted, whereas only six of the 20 genes outside this region have translocated and/or inverted. Moreover, the extra tRNA gene and the two pseudo-tRNA genes are either in this region or immediately adjacent to one of the control regions. These observations suggest that tandem duplication and deletion may be facilitated by the duplicate control regions and may have occurred a number of times in the lineage leading to T. imaginis. T. imaginis shares two novel gene boundaries with a lepidopsocid species from another order of hemipteroid insects, the Psocoptera. The evidence available suggests that these shared gene boundaries evolved by convergence and thus are not informative for the interordinal phylogeny of hemipteroid insects. We discuss the potential of hemipteroid insects as a model system for studies of the evolution of animal rut genomes and outline some fundamental questions that may be addressed with this system.
Resumo:
The mutualistic symbiosis involving Glomeromycota, a distinctive phylum of early diverging Fungi, is widely hypothesized to have promoted the evolution of land plants during the middle Paleozoic. These arbuscular mycorrhizal fungi (AMF) perform vital functions in the phosphorus cycle that are fundamental to sustainable crop plant productivity. The unusual biological features of AMF have long fascinated evolutionary biologists. The coenocytic hyphae host a community of hundreds of nuclei and reproduce clonally through large multinucleated spores. It has been suggested that the AMF maintain a stable assemblage of several different genomes during the life cycle, but this genomic organization has been questioned. Here we introduce the 153-Mb haploid genome of Rhizophagus irregularis and its repertoire of 28,232 genes. The observed low level of genome polymorphism (0.43 SNP per kb) is not consistent with the occurrence of multiple, highly diverged genomes. The expansion of mating-related genes suggests the existence of cryptic sex-related processes. A comparison of gene categories confirms that R. irregularis is close to the Mucoromycotina. The AMF obligate biotrophy is not explained by genome erosion or any related loss of metabolic complexity in central metabolism, but is marked by a lack of genes encoding plant cell wall-degrading enzymes and of genes involved in toxin and thiamine synthesis. A battery of mycorrhiza-induced secreted proteins is expressed in symbiotic tissues. The present comprehensive repertoire of R. irregularis genes provides a basis for future research on symbiosis-related mechanisms in Glomeromycota.
Resumo:
Ants are powerful model systems for the study of cooperation and sociality. In this review, we discuss how recent advances in ant genomics have contributed to our understanding of the evolution and organization of insect societies at the molecular level.
Resumo:
Rigorous organization and quality control (QC) are necessary to facilitate successful genome-wide association meta-analyses (GWAMAs) of statistics aggregated across multiple genome-wide association studies. This protocol provides guidelines for (i) organizational aspects of GWAMAs, and for (ii) QC at the study file level, the meta-level across studies and the meta-analysis output level. Real-world examples highlight issues experienced and solutions developed by the GIANT Consortium that has conducted meta-analyses including data from 125 studies comprising more than 330,000 individuals. We provide a general protocol for conducting GWAMAs and carrying out QC to minimize errors and to guarantee maximum use of the data. We also include details for the use of a powerful and flexible software package called EasyQC. Precise timings will be greatly influenced by consortium size. For consortia of comparable size to the GIANT Consortium, this protocol takes a minimum of about 10 months to complete.
Resumo:
Pendant ma thèse de doctorat, j'ai utilisé des espèces modèles, comme la souris et le poisson-zèbre, pour étudier les facteurs qui affectent l'évolution des gènes et leur expression. Plus précisément, j'ai montré que l'anatomie et le développement sont des facteurs clés à prendre en compte, car ils influencent la vitesse d'évolution de la séquence des gènes, l'impact sur eux de mutations (i.e. la délétion du gène est-elle létale ?), et leur tendance à se dupliquer. Où et quand il est exprimé impose à un gène certaines contraintes ou au contraire lui donne des opportunités d'évoluer. J'ai pu comparer ces tendances aux modèles classiques d'évolution de la morphologie, que l'on pensait auparavant refléter directement les contraintes s'appliquant sur le génome. Nous avons montré que les contraintes entre ces deux niveaux d'organisation ne peuvent pas être transférées simplement : il n'y a pas de lien direct entre la conservation du génotype et celle de phénotypes comme la morphologie. Ce travail a été possible grâce au développement d'outils bioinformatiques. Notamment, j'ai travaillé sur le développement de la base de données Bgee, qui a pour but de comparer l'expression des gènes entre différentes espèces de manière automatique et à large échelle. Cela implique une formalisation de l'anatomie, du développement et de concepts liés à l'homologie grâce à l'utilisation d'ontologies. Une intégration cohérente de données d'expression hétérogènes (puces à ADN, marqueurs de séquence exprimée, hybridations in situ) a aussi été nécessaire. Cette base de données est mise à jour régulièrement et disponible librement. Elle devrait contribuer à étendre les possibilités de comparaison de l'expression des gènes entre espèces pour des études d'évo-devo (évolution du développement) et de génomique. During my PhD, I used model species of vertebrates, such as mouse and zebrafish, to study factors affecting the evolution of genes and their expression. More precisely I have shown that anatomy and development are key factors to take into account, influencing the rate of gene sequence evolution, the impact of mutations (i.e. is the deletion of a gene lethal?), and the propensity of a gene to duplicate. Where and when genes are expressed imposes constraints, or on the contrary leaves them some opportunity to evolve. We analyzed these patterns in relation to classical models of morphological evolution in vertebrates, which were previously thought to directly reflect constraints on the genomes. We showed that the patterns of evolution at these two levels of organization do not translate smoothly: there is no direct link between the conservation of genotype and phenotypes such as morphology. This work was made possible by the development of bioinformatics tools. Notably, I worked on the development of the database Bgee, which aims at comparing gene expression between different species in an automated and large-scale way. This involves the formalization of anatomy, development, and concepts related to homology, through the use of ontologies. A coherent integration of heterogeneous expression data (microarray, expressed sequence tags, in situ hybridizations) is also required. This database is regularly updated and freely available. It should contribute to extend the possibilities for comparison of gene expression between species in evo-devo and genomics studies.
Resumo:
Lancelets ('amphioxus') are the modern survivors of an ancient chordate lineage, with a fossil record dating back to the Cambrian period. Here we describe the structure and gene content of the highly polymorphic approximately 520-megabase genome of the Florida lancelet Branchiostoma floridae, and analyse it in the context of chordate evolution. Whole-genome comparisons illuminate the murky relationships among the three chordate groups (tunicates, lancelets and vertebrates), and allow not only reconstruction of the gene complement of the last common chordate ancestor but also partial reconstruction of its genomic organization, as well as a description of two genome-wide duplications and subsequent reorganizations in the vertebrate lineage. These genome-scale events shaped the vertebrate genome and provided additional genetic variation for exploitation during vertebrate evolution.
Resumo:
Islet-brain 1 (IB1), a regulator of the pancreatic beta-cell function in the rat, is homologous to JIP-1, a murine inhibitor of c-Jun amino-terminal kinase (JNK). Whether IB1 and JIP-1 are present in humans was not known. We report the sequence of the 2133-bp human IB1 cDNA, the expression, structure, and fine-mapping of the human IB1 gene, and the characterization of an IB1 pseudogene. Human IB1 is 94% identical to rat IB1. The tissue-specific expression of IB1 in human is similar to that observed in rodent. The IB1 gene contains 12 exons and maps to chromosome 11 (11p11.2-p12), a region that is deleted in DEFECT-11 syndrome. Apart from an IB1 pseudogene on chromosome 17 (17q21), no additional IB1-related gene was found in the human genome. Our data indicate that the sequence and expression pattern of IB1 are highly conserved between rodent and human and provide the necessary tools to investigate whether IB1 is involved in human diseases.
Resumo:
Background: Integrative and conjugative elements (ICE) form a diverse group of DNA elements that are integrated in the chromosome of the bacterial host, but can occasionally excise and horizontally transfer to a new host cell. ICE come in different families, typically with a conserved core for functions controlling the element's behavior and a variable region providing auxiliary functions to the host. The ICEclc element of Pseudomonas knackmussii strain B13 is representative for a large family of chromosomal islands detected by genome sequencing approaches. It provides the host with the capacity to degrade chloroaromatics and 2-aminophenol. Results: Here we study the transcriptional organization of the ICEclc core region. By northern hybridizations, reverse-transcriptase polymerase chain reaction (RT-PCR) and Rapid Amplification of cDNA Ends (5'-RACE) fifteen transcripts were mapped in the core region. The occurrence and location of those transcripts were further confirmed by hybridizing labeled cDNA to a semi-tiling micro-array probing both strands of the ICEclc core region. Dot blot and semi-tiling array hybridizations demonstrated most of the core transcripts to be upregulated during stationary phase on 3-chlorobenzoate, but not on succinate or glucose. Conclusions: The transcription analysis of the ICEclc core region provides detailed insights in the mode of regulatory organization and will help to further understand the complex mode of behavior of this class of mobile elements. We conclude that ICEclc core transcription is concerted at a global level, more reminiscent of a phage program than of plasmid conjugation.
Resumo:
Selective pressures related to gene function and chromosomal architecture are acting on genome sequences and can be revealed, for instance, by appropriate genometric methods. Cumulative nucleotide skew analyses, i.e., GC, TA, and ORF orientation skews, predict the location of the origin of DNA replication for 88 out of 100 completely sequenced bacterial chromosomes. These methods appear fully reliable for proteobacteria, Gram-positives, and spirochetes as well as for euryarchaeotes. Based on this genome architecture information, coorientation analyses reveal that in prokaryotes, ribosomal RNA (rRNA) genes encoding the small and large ribosomal subunits are all transcribed in the same direction as DNA replication; that is, they are located along the leading strand. This result offers a simple and reliable method for circumscribing the region containing the origin of the DNA replication and reveals a strong selective pressure acting on the orientation of rRNA genes similar to the weaker one acting on the orientation of ORFs. Rate of coorientation of transfer RNA (tRNA) genes with DNA replication appears to be taxon-specific. Analyzing nucleotide biases such as GC and TA skews of genes and plotting one against the other reveals a taxonomic clusterization of species. All ribosomal RNA genes are enriched in Gs and depleted in Cs, the only so far known exception being the rRNA genes of deuterostomian mitochondria. However, this exception can be explained by the fact that in the chromosome of the human mitochondrion, the model of the deuterostomian organelle genome, DNA replication, and rRNA transcription proceed in opposite directions. A general rule is deduced from prokaryotic and mitochondrial genomes: ribosomal RNA genes that are transcribed in the same direction as the DNA replication are enriched in Gs, and those transcribed in the opposite direction are depleted in Gs.
Resumo:
In natural conditions, basidiomycete ectomycorrhizal fungi such as Laccaria bicolor are typically in the dikaryotic state when forming symbioses with trees, meaning that two genetically different individuals have to fuse or 'mate'. Nevertheless, nothing is known about the molecular mechanisms of mating in these ecologically important fungi. Here, advantage was taken of the first sequenced genome of the ectomycorrhizal fungus, Laccaria bicolor, to determine the genes that govern the establishment of cell-type identity and orchestrate mating. The L. bicolor mating type loci were identified through genomic screening. The evolutionary history of the genomic regions that contained them was determined by genome-wide comparison of L. bicolor sequences with those of known tetrapolar and bipolar basidiomycete species, and by phylogenetic reconstruction of gene family history. It is shown that the genes of the two mating type loci, A and B, are conserved across the Agaricales, but they are contained in regions of the genome with different evolutionary histories. The A locus is in a region where the gene order is under strong selection across the Agaricales. By contrast, the B locus is in a region where the gene order is likely under a low selection pressure but where gene duplication, translocation and transposon insertion are frequent.