964 resultados para Comparative genomics
Resumo:
The North Atlantic intertidal community provides a rich set of organismal and environmental material for the study of ecological genetics. Clearly defined environmental gradients exist at multiple spatial scales: there are broad latitudinal trends in temperature, meso-scale changes in salinity along estuaries, and smaller scale gradients in desiccation and temperature spanning the intertidal range. The geology and geography of the American and European coasts provide natural replication of these gradients, allowing for population genetic analyses of parallel adaptation to environmental stress and heterogeneity. Statistical methods have been developed that provide genomic neutrality tests of population differentiation and aid in the process of candidate gene identification. In this paper, we review studies of marine organisms that illustrate associations between an environmental gradient and specific genetic markers. Such highly differentiated markers become candidate genes for adaptation to the environmental factors in question, but the functional significance of genetic variants must be comprehensively evaluated. We present a set of predictions about locus-specific selection across latitudinal, estuarine, and intertidal gradients that are likely to exist in the North Atlantic. We further present new data and analyses that support and contradict these simple selection models. Some taxa show pronounced clinal variation at certain loci against a background of mild clinal variation at many loci. These cases illustrate the procedures necessary for distinguishing selection driven by internal genomic vs. external environmental factors. We suggest that the North Atlantic intertidal community provides a model system for identifying genes that matter in ecology due to the clarity of the environmental stresses and an extensive experimental literature on ecological function. While these organisms are typically poor genetic and genomic models, advances in comparative genomics have provided access to molecular tools that can now be applied to taxa with well-defined ecologies. As many of the organisms we discuss have tight physiological limits driven by climatic factors, this synthesis of molecular population genetics with marine ecology could provide a sensitive means of assessing evolutionary responses to climate change.
Resumo:
The evolution of calcified tissues is a defining feature in vertebrate evolution. Investigating the evolution of proteins involved in tissue calcification should help elucidate how calcified tissues have evolved. The purpose of this study was to collect and compare sequences of matrix and bone γ-carboxyglutamic acid proteins (MGP and BGP, respectively) to identify common features and determine the evolutionary relationship between MGP and BGP. Thirteen cDNAs and genes were cloned using standard methods or reconstructed through the use of comparative genomics and data mining. These sequences were compared with available annotated sequences (a total of 48 complete or nearly complete sequences, 28 BGPs and 20 MGPs) have been identified across 32 different species (representing most classes of vertebrates), and evolutionarily conserved features in both MGP and BGP were analyzed using bioinformatic tools and the Tree-Puzzle software. We propose that: 1) MGP and BGP genes originated from two genome duplications that occurred around 500 and 400 million years ago before jawless and jawed fish evolved, respectively; 2) MGP appeared first concomitantly with the emergence of cartilaginous structures, and BGP appeared thereafter along with bony structures; and 3) BGP derives from MGP. We also propose a highly specific pattern definition for the Gla domain of BGP and MGP. Previous Section Next Section BGP1 (bone Gla protein or osteocalcin) and MGP (matrix Gla protein) belong to the growing family of vitamin K-dependent (VKD) proteins, the members of which are involved in a broad range of biological functions such as skeletogenesis and bone maintenance (BGP and MGP), hemostasis (prothrombin, clotting factors VII, IX, and X, and proteins C, S, and Z), growth control (gas6), and potentially signal transduction (proline-rich Gla proteins 1 and 2). VKD proteins are characterized by the presence of several Gla residues resulting from the post-translational vitamin K-dependent γ-carboxylation of specific glutamates, through which they can bind to calcium-containing mineral such as hydroxyapatite. To date, VKD proteins have only been clearly identified in vertebrates (1) although the presence of a γ-glutamyl carboxylase has been reported in the fruit fly Drosophila melanogaster (2) and in marine snails belonging to the genus Conus (3). Gla residues have also been found in neuropeptides from Conus venoms (4), suggesting a wider prevalence of γ-carboxylation.
Resumo:
Fusobacterium necrophorum is a causative agent of persistent sore throat syndrome, tonsillar abscesses and Lemierre’s syndrome (LS) in humans. LS is characterised by thrombophlebitis of the jugular vein and bacteraemia. It is a Gram-negative, anaerobic bacterium which to date has no available reference genome. Draft genomes suggest it to be a single circular chromosome of approximately 2.2Mb. A reference strain of each of the two F. necrophorum subspecies and a clinical isolate from a LS patient were sequenced on a Roche 454 GS-FLX+. Sequence data was assembled using Roche GS Assembler and the resulting contigs annotated using xBASE, Pfam and BLAST. The annotation data was mined for gene products associated with virulence revealing a leukotoxin, haemolysin, filamentous haemagglutinnin, adhesin, hemin receptor, phage genes, CRISPR-associated proteins, ecotin and a putative type V secretion system. Data will be presented on comparative genomics of the three strains, with a focus on putative virulence genes. Tools such as Artemis Comparison Tool and ClustalO were used for sequence alignments and PhyML was used to generate phylogenetic trees. Conserved motifs associated with virulence were also located. Understanding variations at the genomic level may help to explain the increased virulence of some F. necrophorum strains.
Resumo:
Genome sequence varies in numerous ways among individuals although the gross architecture is fixed for all humans. Retrotransposons create one of the most abundant structural variants in the human genome and are divided in many families, with certain members in some families, e.g., L1, Alu, SVA, and HERV-K, remaining active for transposition. Along with other types of genomic variants, retrotransponson-derived variants contribute to the whole spectrum of genome variants in humans. With the advancement of sequencing techniques, many human genomes are being sequenced at the individual level, fueling the comparative research on these variants among individuals. In this thesis, the evolution and functional impact of structural variations is examined primarily focusing on retrotransposons in the context of human evolution. The thesis comprises of three different studies on the topics that are presented in three data chapters. First, the recent evolution of all human specific AluYb members, representing the second most active subfamily of Alus, was tracked to identify their source/master copy using a novel approach. All human-specific AluYb elements from the reference genome were extracted, aligned with one another to construct clusters of similar copies and each cluster was analyzed to generate the evolutionary relationship between the members of the cluster. The approach resulted in identification of one major driver copy of all human specific Yb8 and the source copy of the Yb9 lineage. Three new subfamilies within the AluYb family – Yb8a1, Yb10 and Yb11 were also identified, with Yb11 being the youngest and most polymorphic. Second, an attempt to construct a relation between transposable elements (TEs) and tandem repeats (TRs) was made at a genome-wide scale for the first time. Upon sequence comparison, positional cross-checking and other relevant analyses, it was observed that over 20% of all TRs are derived from TEs. This result established the first connection between these two types of repetitive elements, and extends our appreciation for the impact of TEs on genomes. Furthermore, only 6% of these TE-derived TRs follow the already postulated initiation and expansion mechanisms, suggesting that the others are likely to follow a yet-unidentified mechanism. Third, by taking a combination of multiple computational approaches involving all types of genetic variations published so far including transposable elements, the first whole genome sequence of the most recent common ancestor of all modern human populations that diverged into different populations around 125,000-100,000 years ago was constructed. The study shows that the current reference genome sequence is 8.89 million base pairs larger than our common ancestor’s genome, contributed by a whole spectrum of genetic mechanisms. The use of this ancestral reference genome to facilitate the analysis of personal genomes was demonstrated using an example genome and more insightful recent evolutionary analyses involving the Neanderthal genome. The three data chapters presented in this thesis conclude that the tandem repeats and transposable elements are not two entirely distinctly isolated elements as over 20% TRs are actually derived from TEs. Certain subfamilies of TEs themselves are still evolving with the generation of newer subfamilies. The evolutionary analyses of all TEs along with other genomic variants helped to construct the genome sequence of the most recent common ancestor to all modern human populations which provides a better alternative to human reference genome and can be a useful resource for the study of personal genomics, population genetics, human and primate evolution.
Resumo:
Une réconciliation entre un arbre de gènes et un arbre d’espèces décrit une histoire d’évolution des gènes homologues en termes de duplications et pertes de gènes. Pour inférer une réconciliation pour un arbre de gènes et un arbre d’espèces, la parcimonie est généralement utilisée selon le nombre de duplications et/ou de pertes. Les modèles de réconciliation sont basés sur des critères probabilistes ou combinatoires. Le premier article définit un modèle combinatoire simple et général où les duplications et les pertes sont clairement identifiées et la réconciliation parcimonieuse n’est pas la seule considérée. Une architecture de toutes les réconciliations est définie et des algorithmes efficaces (soit de dénombrement, de génération aléatoire et d’exploration) sont développés pour étudier les propriétés combinatoires de l’espace de toutes les réconciliations ou seulement les plus parcimonieuses. Basée sur le processus classique nommé naissance-et-mort, un algorithme qui calcule la vraisemblance d’une réconciliation a récemment été proposé. Le deuxième article utilise cet algorithme avec les outils combinatoires décrits ci-haut pour calculer efficacement (soit approximativement ou exactement) les probabilités postérieures des réconciliations localisées dans le sous-espace considéré. Basé sur des taux réalistes (selon un modèle probabiliste) de duplication et de perte et sur des données réelles/simulées de familles de champignons, nos résultats suggèrent que la masse probabiliste de toute l’espace des réconciliations est principalement localisée autour des réconciliations parcimonieuses. Dans un contexte d’approximation de la probabilité d’une réconciliation, notre approche est une alternative intéressante face aux méthodes MCMC et peut être meilleure qu’une approche sophistiquée, efficace et exacte pour calculer la probabilité d’une réconciliation donnée. Le problème nommé Gene Tree Parsimony (GTP) est d’inférer un arbre d’espèces qui minimise le nombre de duplications et/ou de pertes pour un ensemble d’arbres de gènes. Basé sur une approche qui explore tout l’espace des arbres d’espèces pour les génomes considérés et un calcul efficace des coûts de réconciliation, le troisième article décrit un algorithme de Branch-and-Bound pour résoudre de façon exacte le problème GTP. Lorsque le nombre de taxa est trop grand, notre algorithme peut facilement considérer des relations prédéfinies entre ensembles de taxa. Nous avons testé notre algorithme sur des familles de gènes de 29 eucaryotes.
Resumo:
La phagocytose est un processus cellulaire par lequel de larges particules sont internalisées dans une vésicule, le phagosome. Lorsque formé, le phagosome acquiert ses propriétés fonctionnelles à travers un processus complexe de maturation nommé la biogénèse du phagolysosome. Cette voie implique une série d’interactions rapides avec les organelles de l’appareil endocytaire permettant la transformation graduelle du phagosome nouvellement formé en phagolysosome à partir duquel la dégradation protéolytique s’effectue. Chez l’amibe Dictyostelium discoideum, la phagocytose est employée pour ingérer les bactéries de son environnement afin de se nourrir alors que les organismes multicellulaires utilisent la phagocytose dans un but immunitaire, où des cellules spécialisées nommées phagocytes internalisent, tuent et dégradent les pathogènes envahissant de l’organisme et constitue la base de l’immunité innée. Chez les vertébrés à mâchoire cependant, la transformation des mécanismes moléculaires du phagosome en une organelle perfectionnée pour l’apprêtement et la présentation de peptides antigéniques place cette organelle au centre de l’immunité innée et de l’immunité acquise. Malgré le rôle crucial auquel participe cette organelle dans la réponse immunitaire, il existe peu de détails sur la composition protéique et l’organisation fonctionnelles du phagosome. Afin d’approfondir notre compréhension des divers aspects qui relient l’immunité innée et l’immunité acquise, il devient essentiel d’élargir nos connaissances sur les fonctions moléculaire qui sont recrutées au phagosome. Le profilage par protéomique à haut débit de phagosomes isolés fut extrêmement utile dans la détermination de la composition moléculaire de cette organelle. Des études provenant de notre laboratoire ont révélé les premières listes protéiques identifiées à partir de phagosomes murins sans toutefois déterminer le ou les rôle(s) de ces protéines lors du processus de la phagocytose (Brunet et al, 2003; Garin et al, 2001). Au cours de la première étude de cette thèse (Stuart et al, 2007), nous avons entrepris la caractérisation fonctionnelle du protéome entier du phagosome de la drosophile en combinant diverses techniques d’analyses à haut débit (protéomique, réseaux d’intéractions protéique et ARN interférent). En utilisant cette stratégie, nous avons identifié 617 protéines phagosomales par spectrométrie de masse à partir desquelles nous avons accru cette liste en construisant des réseaux d’interactions protéine-protéine. La contribution de chaque protéine à l’internalisation de bactéries fut ensuite testée et validée par ARN interférent à haut débit et nous a amené à identifier un nouveau régulateur de la phagocytose, le complexe de l’exocyst. En appliquant ce modèle combinatoire de biologie systémique, nous démontrons la puissance et l’efficacité de cette approche dans l’étude de processus cellulaire complexe tout en créant un cadre à partir duquel il est possible d’approfondir nos connaissances sur les différents mécanismes de la phagocytose. Lors du 2e article de cette thèse (Boulais et al, 2010), nous avons entrepris la caractérisation moléculaire des étapes évolutives ayant contribué au remodelage des propriétés fonctionnelles de la phagocytose au cours de l’évolution. Pour ce faire, nous avons isolé des phagosomes à partir de trois organismes distants (l’amibe Dictyostelium discoideum, la mouche à fruit Drosophila melanogaster et la souris Mus musculus) qui utilisent la phagocytose à des fins différentes. En appliquant une approche protéomique à grande échelle pour identifier et comparer le protéome et phosphoprotéome des phagosomes de ces trois espèces, nous avons identifié un cœur protéique commun à partir duquel les fonctions immunitaires du phagosome se seraient développées. Au cours de ce développement fonctionnel, nos données indiquent que le protéome du phagosome fut largement remodelé lors de deux périodes de duplication de gènes coïncidant avec l’émergence de l’immunité innée et acquise. De plus, notre étude a aussi caractérisée en détail l’acquisition de nouvelles protéines ainsi que le remodelage significatif du phosphoprotéome du phagosome au niveau des constituants du cœur protéique ancien de cette organelle. Nous présentons donc la première étude approfondie des changements qui ont engendré la transformation d’un compartiment phagotrophe à une organelle entièrement apte pour la présentation antigénique.
Resumo:
Background: Plasmodium vivax malaria remains a major health problem in tropical and sub-tropical regions worldwide. Several rhoptry proteins which are important for interaction with and/or invasion of red blood cells, such as PfRONs, Pf92, Pf38, Pf12 and Pf34, have been described during the last few years and are being considered as potential anti-malarial vaccine candidates. This study describes the identification and characterization of the P. vivax rhoptry neck protein 1 (PvRON1) and examine its antigenicity in natural P. vivax infections. Methods: The PvRON1 encoding gene, which is homologous to that encoding the P. falciparum apical sushi protein (ASP) according to the plasmoDB database, was selected as our study target. The pvron1 gene transcription was evaluated by RT-PCR using RNA obtained from the P. vivax VCG-1 strain. Two peptides derived from the deduced P. vivax Sal-I PvRON1 sequence were synthesized and inoculated in rabbits for obtaining anti-PvRON1 antibodies which were used to confirm the protein expression in VCG-1 strain schizonts along with its association with detergent-resistant microdomains (DRMs) by Western blot, and its localization by immunofluorescence assays. The antigenicity of the PvRON1 protein was assessed using human sera from individuals previously exposed to P. vivax malaria by ELISA. Results: In the P. vivax VCG-1 strain, RON1 is a 764 amino acid-long protein. In silico analysis has revealed that PvRON1 shares essential characteristics with different antigens involved in invasion, such as the presence of a secretory signal, a GPI-anchor sequence and a putative sushi domain. The PvRON1 protein is expressed in parasite's schizont stage, localized in rhoptry necks and it is associated with DRMs. Recombinant protein recognition by human sera indicates that this antigen can trigger an immune response during a natural infection with P. vivax. Conclusions: This study shows the identification and characterization of the P. vivax rhoptry neck protein 1 in the VCG-1 strain. Taking into account that PvRON1 shares several important characteristics with other Plasmodium antigens that play a functional role during RBC invasion and, as shown here, it is antigenic, it could be considered as a good vaccine candidate. Further studies aimed at assessing its immunogenicity and protection-inducing ability in the Aotus monkey model are thus recommended.
Resumo:
The recently described cupin superfamily of proteins includes the germin and germinlike proteins, of which the cereal oxalate oxidase is the best characterized. This superfamily also includes seed storage proteins, in addition to several microbial enzymes and proteins with unknown function. All these proteins are characterized by the conservation of two central motifs, usually containing two or three histidine residues presumed to be involved with metal binding in the catalytic active site. The present study on the coding regions of Synechocystis PCC6803 identifies a previously unknown group of 12 related cupins, each containing the characteristic two-motif signature. This group comprises 11 single-domain proteins, ranging in length from 104 to 289 residues, and includes two phosphomannose isomerases and two epimerases involved in cell wall synthesis, a member of the pirin group of nuclear proteins, a possible transcriptional regulator, and a close relative-of a cytochrome c551 from Rhodococcus. Additionally, there is a duplicated, two-domain protein that has close similarity to an oxalate decarboxylase from the fungus Collybia velutipes and that is a putative progenitor of the storage proteins of land plants.
Resumo:
The cupin superfamily of proteins, named on the basis of a conserved β-barrel fold (‘cupa’ is the Latin term for a small barrel), was originally discovered using a conserved motif found within germin and germin-like proteins from higher plants. Previous analysis of cupins had identified some 18 different functional classes that range from single-domain bacterial enzymes such as isomerases and epimerases involved in the modification of cell wall carbohydrates, through to two-domain bicupins such as the desiccation-tolerant seed storage globulins, and multidomain transcription factors including one linked to the nodulation response in legumes. Recent advances in comparative genomics, and the resolution of many more 3-D structures have now revealed that the largest subset of the cupin superfamily is the 2-oxyglutarate-Fe2+ dependent dioxygenases. The substrates for this subclass of enzyme are many and varied and in total amount to probably 50–100 different biochemical reactions, including several involved in plant growth and development. Although the majority of enzymatic cupins contain iron as an active site metal, other members contain either copper, zinc, cobalt, nickel or manganese ions as a cofactor, with each cofactor allowing a different type of chemistry to occur within the conserved tertiary structure. This review discusses the range of structures and functions found in this most diverse of superfamilies.
Resumo:
BACKGROUND:The Salmonella enterica serovar Derby is frequently isolated from pigs and turkeys whereas serovar Mbandaka is frequently isolated from cattle, chickens and animal feed in the UK. Through comparative genomics, phenomics and mutant construction we previously suggested possible mechanistic reasons why these serovars demonstrate apparently distinct host ranges. Here, we investigate the genetic and phenotypic diversity of these two serovars in the UK. We produce a phylogenetic reconstruction and perform several biochemical assays on isolates of S. Derby and S. Mbandaka acquired from sites across the UK between the years 2000 and 2010. RESULTS:We show that UK isolates of S. Mbandaka comprise of one clonal lineage which is adapted to proficient utilisation of metabolites found in soya beans under ambient conditions. We also show that this clonal lineage forms a biofilm at 25 °C, suggesting that this serovar maybe well adapted to survival ex vivo, growing in animal feed. Conversely, we show that S. Derby is made of two distinct lineages, L1 and L2. These lineages differ genotypically and phenotypically, being divided by the presence and absence of SPI-23 and the ability to more proficiently invade porcine jejunum derived cell line IPEC-J2. CONCLUSION:The results of this study lend support to the hypothesis that the differences in host ranges of S. Derby and S. Mbandaka are adaptations to pathogenesis, environmental persistence, as well as utilisation of metabolites abundant in their respective host environments.
Resumo:
Type XVIII collagen is a component of basement membranes, and expressed prominently in the eye, blood vessels, liver, and the central nervous system. Homozygous mutations in COL18A1 lead to Knobloch Syndrome, characterized by ocular defects and occipital encephalocele. However, relatively little has been described on the role of type XVIII collagen in development, and nothing is known about the regulation of its tissue-specific expression pattern. We have used zebrafish transgenesis to identify and characterize cis-regulatory sequences controlling expression of the human gene. Candidate enhancers were selected from non-coding sequence associated with COL18A1 based on sequence conservation among mammals. Although these displayed no overt conservation with orthologous zebrafish sequences, four regions nonetheless acted as tissue-specific transcriptional enhancers in the zebrafish embryo, and together recapitulated the major aspects of col18a1 expression. Additional post-hoc computational analysis on positive enhancer sequences revealed alignments between mammalian and teleost sequences, which we hypothesize predict the corresponding zebrafish enhancers; for one of these, we demonstrate functional overlap with the orthologous human enhancer sequence. Our results provide important insight into the biological function and regulation of COL18A1, and point to additional sequences that may contribute to complex diseases involving COL18A1. More generally, we show that combining functional data with targeted analyses for phylogenetic conservation can reveal conserved cis-regulatory elements in the large number of cases where computational alignment alone falls short. (C) 2009 Elsevier Inc. All rights reserved.
Resumo:
The biosynthesis of quinolinate, the de novo precursor of nicotinamide adenine dinucleotide (NAD), may be performed by two distinct pathways, namely, the bacterial aspartate (aspartate-to-quinolinate) and the eukaryotic kynurenine (tryptophan-to-quinolinate). Even though the separation into eukaryotic and bacterial routes is long established, recent genomic surveys have challenged this view, because certain bacterial species also carry the genes for the kynurenine pathway. In this work, both quinolinate biosynthetic pathways were investigated in the Bacteria clade and with special attention to Xanthomonadales and Bacteroidetes, from an evolutionary viewpoint. Genomic screening has revealed that a small number of bacterial species possess some of the genes for the kynurenine pathway, which is complete in the genus Xanthomonas and in the order Flavobacteriales, where the aspartate pathway is absent. The opposite pattern (presence of the aspartate pathway and absence of the kynurenine pathway) in close relatives (Xylella ssp. and the order Bacteroidales, respectively) points to the idea of a recent acquisition of the kynurenine pathway through lateral gene transfer in these bacterial groups. In fact, sequence similarity comparison and phylogenetic reconstruction both suggest that at least part of the genes of the kynurenine pathway in Xanthomonas and Flavobacteriales is shared by eukaryotes. These results reinforce the idea of the role that lateral gene transfer plays in the configuration of bacterial genomes, thereby providing alternative metabolic pathways, even with the replacement of primary and essential cell functions, as exemplified by NAD biosynthesis.
Resumo:
Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES)
Resumo:
Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES)
Resumo:
Corynebacterium pseudotuberculosis é uma bactéria Gram-positiva, intracelular facultativa, nãoesporulante, não-capsulada e sem mobilidade, contudo possui fímbria, e pode assumir formas cocóides e filamentosas (pleomórfica), além disto, apresenta crescimento ótimo à 37°C. Este patógeno apresenta dois biovares: ovis que geralmente acomete pequenos ruminantes, e causa a doença linfadenite caseosa, e biovar equi, mais comum em equinos, bovinos, camelídeos, e bubalinos causando a Linfangite ulcerativa. A infecção por esta bactéria pode levar a condenação das carcaças e redução de lã (em ovinos e caprinos), leite e carne destes animais, e consequentemente a perdas econômicas para a indústria agropecuária mundial. Atualmente, ainda não existe uma vacina eficaz para estas doenças. A fim de obter um maior entendimento biológico entre as espécies o presente trabalho tem como objetivo principal analisar, por meio da genômica comparativa a linhagem C. pseudotuberculosis 226 biotipo ovis isolada de um caprino na Califórnia com outras linhagens do biovarar ovis e equi. Na análise de sintenia entre as linhagens foi possível identificar que a linhagem 226 apresenta alta conservação da ordem gênica entre as linhagens do biótipo ovis. Através de análises filogenômicas foi possível identificar que as linhagens I19 e 267 apresentaram maior e menor proximidade filogenética com a linhagem 226. A linhagem 1/06-A foi a que apresentou maior proximidade filogenômica entre as linhagens do biovar equi, quando comparadas a linhagem 226. Foram preditas 8 ilhas de patogenicidade, estando presente na ilha 1 os genes relacionados a virulência de C. pseudotuberculosis mais bem descritos na literatura. Não houveram regiões novas relacionadas a genes de virulência entre nenhuma das linhagens. Foram identificados 248 genes ortólogos entre as linhagens I19, 267 e 226 e 282 genes ortólogos entre as linhagens 258,1/06-A e 226. Com base nesse estudo é possível inferir que as linhagens do biovar ovis possuem um repertório gênico pouco variado e que as linhagens do biovar equi apresentam uma quantidade menor de genes compartilhados com a linhagem 226, corroborando com a diversidade gênica entre os biovares.