922 resultados para DNA-microarray data
Resumo:
La majorité des hyperplasies macronodulaires bilatérales des surrénales avec syndrome de Cushing ACTH-indépendant (AIMAH) est due à l’expression aberrante de divers récepteurs hormonaux au niveau du cortex surrénalien. Les gènes responsables des AIMAH familiales avec récepteurs aberrants n’ont pas été identifiés. Le but de ce projet est de les identifier. Une étude de liaison, visant à identifier la ou les régions du génome comprenant le ou les gènes pouvant être en cause dans les AIMAH familiales, a été réalisée en utilisant l’ADN des membres d’une famille (10 malades et 7 sains) originaire du Québec, atteinte d’AIMAH et syndrome de Cushing et caractérisée par l’expression des récepteurs β-adrénergique et V1-vasopressine. Diverses régions chromosomiques entre les personnes atteintes et non-atteintes de la famille ont été soulignées. Un total de 707453 SNPs a été obtenu, et après analyse statistique, 159 SNPs significatifs, pouvant être associés au phénotype, ont été mis en évidence entre les deux groupes. Il a été constaté que la majorité de ces SNPs se situaient sur les régions chromosomiques 1q32.1 et 16q12.2. Une étude du transcriptome a aussi été réalisée en utilisant l’ADN des tumeurs de deux patients de la famille, ainsi que l’ADN d'autres tumeurs surrénaliennes. Les analyses statistiques ont permis d’identifier 15 gènes susceptibles d’être reliés à la maladie (11 surexprimés et 4 sous-exprimés). En utilisant les données de ces deux études, nous avons ciblé six gènes du chromosome 1 (ATP2B4, PPP1R12B, SOX13, CACNA1S, ADORA1et PHLDA3), un du chromosome 16 (CHD9) et un du chromosome 13 (SPRY2), afin de rechercher la présence de mutations. Le séquençage n’a révélé aucun changement de nucléotide dans les gènes PPP1R12B et SOX13. Dans les gènes ATP2B4, CACNA1S, ADORA1et PHLDA3, le séquençage a révélé des changements de nucléotides n’entrainant soit pas de changement d’acide aminé soit un changement d’acide aminé jugé « non pertinent », du fait qu’il ne permettait pas de différencier les sujets sains des sujets atteints. Pour ce qui est de CHD9 et SPRY2, le séquençage a permis d’identifier des changements de nucléotides entrainant des changements d’acides aminés de façon plus fréquente chez les sujets atteints par rapport aux sujets sains. En conclusion, nos travaux nous ont donc permis d’identifier, par étude de liaison et par analyse du transcriptome, des gènes candidats qui pourraient être responsables de cette pathologie. Le séquençage de ces gènes candidats a révélé des mutations de CHD9 et SPRY2. Ces résultats s’avèrent prometteurs puisque ces deux gènes produisent des protéines impliquées dans le remodelage de la chromatine et dans la régulation de la signalisation des protéines kinases. Le phénotypage et le génotypage des patients atteints doivent être poursuivis pour vérification.
Resumo:
Les études génétiques, telles que les études de liaison ou d’association, ont permis d’acquérir une plus grande connaissance sur l’étiologie de plusieurs maladies affectant les populations humaines. Même si une dizaine de milliers d’études génétiques ont été réalisées sur des centaines de maladies ou autres traits, une grande partie de leur héritabilité reste inexpliquée. Depuis une dizaine d’années, plusieurs percées dans le domaine de la génomique ont été réalisées. Par exemple, l’utilisation des micropuces d’hybridation génomique comparative à haute densité a permis de démontrer l’existence à grande échelle des variations et des polymorphismes en nombre de copies. Ces derniers sont maintenant détectables à l’aide de micropuce d’ADN ou du séquençage à haut débit. De plus, des études récentes utilisant le séquençage à haut débit ont permis de démontrer que la majorité des variations présentes dans l’exome d’un individu étaient rares ou même propres à cet individu. Ceci a permis la conception d’une nouvelle micropuce d’ADN permettant de déterminer rapidement et à faible coût le génotype de plusieurs milliers de variations rares pour un grand ensemble d’individus à la fois. Dans ce contexte, l’objectif général de cette thèse vise le développement de nouvelles méthodologies et de nouveaux outils bio-informatiques de haute performance permettant la détection, à de hauts critères de qualité, des variations en nombre de copies et des variations nucléotidiques rares dans le cadre d’études génétiques. Ces avancées permettront, à long terme, d’expliquer une plus grande partie de l’héritabilité manquante des traits complexes, poussant ainsi l’avancement des connaissances sur l’étiologie de ces derniers. Un algorithme permettant le partitionnement des polymorphismes en nombre de copies a donc été conçu, rendant possible l’utilisation de ces variations structurales dans le cadre d’étude de liaison génétique sur données familiales. Ensuite, une étude exploratoire a permis de caractériser les différents problèmes associés aux études génétiques utilisant des variations en nombre de copies rares sur des individus non reliés. Cette étude a été réalisée avec la collaboration du Wellcome Trust Centre for Human Genetics de l’University of Oxford. Par la suite, une comparaison de la performance des algorithmes de génotypage lors de leur utilisation avec une nouvelle micropuce d’ADN contenant une majorité de marqueurs rares a été réalisée. Finalement, un outil bio-informatique permettant de filtrer de façon efficace et rapide des données génétiques a été implémenté. Cet outil permet de générer des données de meilleure qualité, avec une meilleure reproductibilité des résultats, tout en diminuant les chances d’obtenir une fausse association.
Resumo:
Le Staphylococcus aureus résistant à la méthicilline (SARM) est un enjeu majeur en santé publique. Il est responsable d’une grande variété d’infections. Les “Livestock Associated-MRSA” (LA-MRSA) sont des SARM ayant comme origine les animaux de production tels le porc ou la volaille. Ils constituent un risque de transmission à l’humain via la chaîne alimentaire. Les LA-MRSA peuvent former du biofilm ce qui augmente leur tolérance aux stress environnementaux. Le biofilm est partiellement régulé par le système Agr. Il n’existe aucune donnée sur les ‘LA-MRSA’ d’origine aviaire au Québec. Les objectifs de ce projet étaient : (i) de déterminer la prévalence de ces SARM dans la viande de poulet et le poulet à griller de la province de Québec et (ii) de caractériser les isolats retrouvés. La collecte d’échantillons s’est effectuée dans 43 épiceries (309 cuisses et pilons de poulet) et dans deux abattoirs (échantillons nasaux et fécaux de 200 poulets) de la Montérégie. La prévalence de SARM a été évaluée à 1.29% (IC 95%: 0.35-3.28) et 0% dans la viande et les oiseaux respectivement. Les isolats testés se sont révélés résistants aux bêta-lactamines (n=15), à la tétracycline (n=10), à l’oxytétracycline (n=10), à la spectinomycine (n=10) et à la tobramycine (n=1). Le typage a révélé deux clones différents (ST398-V, n=10; et ST8-IVa ’USA300’, n=5). La présence de gènes de résistance aux antibiotiques (blaZ, blaR, blaI, erm(A), lnu(A), aad(D), fosB, tet(K), tet(L) et spc) ainsi que plusieurs gènes codant pour l’évasion du système immunitaire (IEC), la production de toxines ou encore pour la production de biofilm ont aussi été détectés. Une forte production de biofilm a été observée pour la majorité des isolats (n=11) à l’exception de certains isolats ST398. Le taux d’expression du système Agr n’a révélé aucune différence particulière entre les SARM testés. Pour conclure, nos données indiquent une faible prévalence de SARM chez la volaille et la viande de poulet. Les isolats ont été catégorisés en deux génotypes, dont un portant plus de gènes de résistance aux antibiotiques (ST398) et l’autre possédant plus de gènes de virulence (ST8).
Resumo:
Microarray data analysis is one of data mining tool which is used to extract meaningful information hidden in biological data. One of the major focuses on microarray data analysis is the reconstruction of gene regulatory network that may be used to provide a broader understanding on the functioning of complex cellular systems. Since cancer is a genetic disease arising from the abnormal gene function, the identification of cancerous genes and the regulatory pathways they control will provide a better platform for understanding the tumor formation and development. The major focus of this thesis is to understand the regulation of genes responsible for the development of cancer, particularly colorectal cancer by analyzing the microarray expression data. In this thesis, four computational algorithms namely fuzzy logic algorithm, modified genetic algorithm, dynamic neural fuzzy network and Takagi Sugeno Kang-type recurrent neural fuzzy network are used to extract cancer specific gene regulatory network from plasma RNA dataset of colorectal cancer patients. Plasma RNA is highly attractive for cancer analysis since it requires a collection of small amount of blood and it can be obtained at any time in repetitive fashion allowing the analysis of disease progression and treatment response.
Resumo:
As the technologies for the fabrication of high quality microarray advances rapidly, quantification of microarray data becomes a major task. Gridding is the first step in the analysis of microarray images for locating the subarrays and individual spots within each subarray. For accurate gridding of high-density microarray images, in the presence of contamination and background noise, precise calculation of parameters is essential. This paper presents an accurate fully automatic gridding method for locating suarrays and individual spots using the intensity projection profile of the most suitable subimage. The method is capable of processing the image without any user intervention and does not demand any input parameters as many other commercial and academic packages. According to results obtained, the accuracy of our algorithm is between 95-100% for microarray images with coefficient of variation less than two. Experimental results show that the method is capable of gridding microarray images with irregular spots, varying surface intensity distribution and with more than 50% contamination
Resumo:
Phylogenetic hypotheses for the largely South African genus Pelargonium L'Hér. (Geraniaceae) were derived based on DNA sequence data from nuclear, chloroplast and mitochondrial encoded regions. The datasets were unequally represented and comprised cpDNA trnL-F sequences for 152 taxa, nrDNA ITS sequences for 55 taxa, and mtDNA nad1 b/c exons for 51 taxa. Phylogenetic hypotheses derived from the separate three datasets were overall congruent. A single hypothesis synthesising the information in the three datasets was constructed following a total evidence approach and implementing dataset specific stepmatrices in order to correct for substitution biases. Pelargonium was found to consist of five main clades, some with contrasting evolutionary patterns with respect to biogeographic distributions, dispersal capacity, pollination biology and karyological diversification. The five main clades are structured in two (subgeneric) clades that correlate with chromosome size. One of these clades includes a "winter rainfall clade" containing more than 70% of all currently described Pelargonium species, and all restricted to the South African Cape winter rainfall region. Apart from (woody) shrubs and small herbaceous rosette subshrubs, this clade comprises a large "xerophytic" clade including geophytes, stem and leaf succulents, harbouring in total almost half of the genus. This clade is considered to be the result of in situ proliferation, possibly in response to late-Miocene and Pliocene aridification events. Nested within it is a radiation comprising c. 80 species from the geophytic Pelargonium section Hoarea, all characterised by the possession of (a series of) tunicate tubers.
Resumo:
Background: Microarray based comparative genomic hybridisation (CGH) experiments have been used to study numerous biological problems including understanding genome plasticity in pathogenic bacteria. Typically such experiments produce large data sets that are difficult for biologists to handle. Although there are some programmes available for interpretation of bacterial transcriptomics data and CGH microarray data for looking at genetic stability in oncogenes, there are none specifically to understand the mosaic nature of bacterial genomes. Consequently a bottle neck still persists in accurate processing and mathematical analysis of these data. To address this shortfall we have produced a simple and robust CGH microarray data analysis process that may be automated in the future to understand bacterial genomic diversity. Results: The process involves five steps: cleaning, normalisation, estimating gene presence and absence or divergence, validation, and analysis of data from test against three reference strains simultaneously. Each stage of the process is described and we have compared a number of methods available for characterising bacterial genomic diversity, for calculating the cut-off between gene presence and absence or divergence, and shown that a simple dynamic approach using a kernel density estimator performed better than both established, as well as a more sophisticated mixture modelling technique. We have also shown that current methods commonly used for CGH microarray analysis in tumour and cancer cell lines are not appropriate for analysing our data. Conclusion: After carrying out the analysis and validation for three sequenced Escherichia coli strains, CGH microarray data from 19 E. coli O157 pathogenic test strains were used to demonstrate the benefits of applying this simple and robust process to CGH microarray studies using bacterial genomes.
Resumo:
Background Somatic embryogenesis (SE) in plants is a process by which embryos are generated directly from somatic cells, rather than from the fused products of male and female gametes. Despite the detailed expression analysis of several somatic-to-embryonic marker genes, a comprehensive understanding of SE at a molecular level is still lacking. The present study was designed to generate high resolution transcriptome datasets for early SE providing the way for future research to understand the underlying molecular mechanisms that regulate this process. We sequenced Arabidopsis thaliana somatic embryos collected from three distinct developmental time-points (5, 10 and 15 d after in vitro culture) using the Illumina HiSeq 2000 platform. Results This study yielded a total of 426,001,826 sequence reads mapped to 26,520 genes in the A. thaliana reference genome. Analysis of embryonic cultures after 5 and 10 d showed differential expression of 1,195 genes; these included 778 genes that were more highly expressed after 5 d as compared to 10 d. Moreover, 1,718 genes were differentially expressed in embryonic cultures between 10 and 15 d. Our data also showed at least eight different expression patterns during early SE; the majority of genes are transcriptionally more active in embryos after 5 d. Comparison of transcriptomes derived from somatic embryos and leaf tissues revealed that at least 4,951 genes are transcriptionally more active in embryos than in the leaf; increased expression of genes involved in DNA cytosine methylation and histone deacetylation were noted in embryogenic tissues. In silico expression analysis based on microarray data found that approximately 5% of these genes are transcriptionally more active in somatic embryos than in actively dividing callus and non-dividing leaf tissues. Moreover, this identified 49 genes expressed at a higher level in somatic embryos than in other tissues. This included several genes with unknown function, as well as others related to oxidative and osmotic stress, and auxin signalling. Conclusions The transcriptome information provided here will form the foundation for future research on genetic and epigenetic control of plant embryogenesis at a molecular level. In follow-up studies, these data could be used to construct a regulatory network for SE; the genes more highly expressed in somatic embryos than in vegetative tissues can be considered as potential candidates to validate these networks.
Resumo:
In this paper, we present an algorithm for cluster analysis that integrates aspects from cluster ensemble and multi-objective clustering. The algorithm is based on a Pareto-based multi-objective genetic algorithm, with a special crossover operator, which uses clustering validation measures as objective functions. The algorithm proposed can deal with data sets presenting different types of clusters, without the need of expertise in cluster analysis. its result is a concise set of partitions representing alternative trade-offs among the objective functions. We compare the results obtained with our algorithm, in the context of gene expression data sets, to those achieved with multi-objective Clustering with automatic K-determination (MOCK). the algorithm most closely related to ours. (C) 2009 Elsevier B.V. All rights reserved.
Resumo:
Blastocladiella emersonii is an aquatic fungus of the Chytridiomycete class. During germination, the zoospore, a motile nongrowing cell, goes through a cascade of morphological changes that culminates with its differentiation into the germling cell, capable of coenocytic vegetative growth. Transcriptome analyses of B. emersonii cells were carried out during germination induced under various environmental conditions. Microarray data analyzing 3,563 distinct B. emersonii genes revealed that 26% of them are differentially expressed during germination in nutrient medium at at least one of the time points investigated. Over 500 genes are upregulated during the time course of germination under those conditions, most being related to cell growth, including genes involved in protein biosynthesis, DNA transcription, energetic metabolism, carbohydrate and oligopeptide transport, and cell cycle control. On the other hand, several transcripts stored in the zoospores are downregulated during germination in nutrient medium, such as genes involved in signal transduction, amino acid transport, and chromosome organization. In addition, germination induced in the presence of nutrients was compared with that triggered either by adenine or potassium ions in inorganic salt solution. Several genes involved in cell growth, induced during germination in nutrient medium, do not show increased expression when B. emersonii zoospores germinate in inorganic solution, suggesting that nutrients exert a positive effect on gene transcription. The transcriptome data also revealed that most genes involved in cell signaling show the same expression pattern irrespective of the initial germination stimulus.
Resumo:
Objective. Given their involvement in pathological and physiological angiogenesis, there has been growing interest in understanding and manipulating endothellial progenitor cells (EPC) for therapeutic purposes. However, detailed molecular analysis of EPC before and during endothelial differentiation is lacking and is the subject of the present study. Materials and Methods. We report a detailed microarray gene-expression profile of freshly isolated (day 0) human cord blood (CB)-derived EPC (CD133(+)KDR(+) or CD34(+)KDR(+)), and at different time points during in vitro differentiation (early: day 13; late: day 27). Results. Data obtained reflect an EPC transcriptome enriched in genes related to stem/progenitor cells properties (chromatin remodeling, self-renewal, signaling, cytoskeleton organization and biogenesis, recruitment, and adhesion). Using a complementary DNA microarray enriched in intronic transcribed sequences, we observed, as well, that naturally transcribed intronic noncoding RNAs were specifically expressed at the EPC stage. Conclusion. Taken together, we have defined the global gene-expression profile of CB-derived EPC during the process of endothelial differentiation, which can be used to identify genes involved in different vascular pathologies. (C) 2008 ISEH - Society for Hematology and Stem Cells. Published by Elsevier Inc.
Resumo:
Lycopene is a natural pigment synthesized by plants and microorganisms, and it is mainly found in tomatoes. It is an acyclic isomer of P-carotene and one of the most potent antioxidants. Several studies have demonstrated the ability of lycopene to prevent chemically induced DNA damage; however, the mechanisms involved are still not clear. In the present study, we investigated the antigenotoxic/antimutagenic effects of lycopene in Chinese Hamster Ovary Cells (CHO) treated with hydrogen peroxide, methylmethanesulphonate (MMS), or 4-nitroquinoline-1-oxide (4-NQO). Lycopene (97%), at final concentrations of 10, 25, and 50 M, was tested under three different protocols: before, simultaneously, and after the treatment with the mutagens. Comet and cytokinesis-block micronucleus assays were used to evaluate the level of DNA damage. Data showed that lycopene reduced the frequency of micronucleated cells induced by the three mutagens. However, this chemopreventive activity was dependent on the concentrations and treatment schedules used. Similar results were observed in the comet assay, although some enhancements of primary DNA damage were detected when the carotenoid was administered after the mutagens. In conclusion, our findings confirmed the chemopreventive activity of lycopene, and showed that this effect occurs under different mechanisms. (c) 2007 Elsevier Ltd. All rights reserved.
Resumo:
Sixty-five accessions of the species-rich freshwater red algal order Batrachospermales were characterized through DNA sequencing of two regions: the mitochondrial cox1 gene (664 bp), which is proposed as the DNA barcode for red algae, and the UPA (universal plastid amplicon) marker (370 bp), which has been recently identified as a universally amplifying region of the plastid genome. upgma phenograms of both markers were consistent in their species-level relationships, although levels of sequence divergence were very different. Intraspecific variation of morphologically identified accessions for the cox1 gene ranged from 0 to 67 bp (divergences were highest for the two taxa with the greatest number of accessions; Batrachospermum helminthosum and Batrachospermum macrosporum); while in contrast, the more conserved universal plastid amplicon exhibited much lower intraspecific variation (generally 0-3 bp). Comparisons to previously published mitochondrial cox2-3 spacer sequences for B. helminthosum indicated that the cox1 gene and cox2-3 spacer were characterized by similar levels of sequence divergence, and phylogeographic patterns based on these two markers were consistent. The two taxa represented by the largest numbers of specimens (B. helminthosum and B. macrosporum) have cox1 intraspecific divergence values that are substantially higher than previously reported, but no morphological differences can be discerned at this time among the intraspecific groups revealed in the analyses. DNA barcode data, which are based on a short fragment of an organellar genome, need to be interpreted in conjunction with other taxonomic characters, and additional batrachospermalean taxa need to be analyzed in detail to be able to draw generalities regarding intraspecific variation in this order. Nevertheless, these analyses reveal a number of batrachospermalean taxa worthy of more detailed DNA barcode study, and it is predicted that such research will have a substantial effect on the taxonomy of species within the Batrachospermales in the future.
Resumo:
Pós-graduação em Biologia Geral e Aplicada - IBB
Resumo:
Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES)