881 resultados para RNA sequencing
Resumo:
L'hypothyroïdie congénitale par dysgénésie thyroïdienne (HCDT, ectopie dans plus de 80 %) a une prévalence de 1 cas sur 4000 naissances vivantes. L’HCDT est la conséquence d'une défaillance de la thyroïde embryonnaire à se différencier, à se maintenir ou à migrer vers sa localisation anatomique (partie antérieure du cou), qui aboutit à une absence totale de la thyroïde (athyréose) ou à une ectopie thyroïdienne (linguale ou sublinguale). Les HCDT sont principalement non-syndromiques (soit 98% des cas sont non-familiale), ont un taux de discordance de 92% chez les jumeaux monozygotes, et ont une prédominance féminine et ethnique (i.e., Caucasienne). La majorité des cas d’HCDT n’a pas de cause connue, mais est associée à un déficit sévère en hormones thyroïdiennes (hypothyroïdie). Des mutations germinales dans les facteurs de transcription liés à la thyroïde (NKX2.1, FOXE1, PAX8, NKX2.5) ont été identifiées dans seulement 3% des patients atteints d’HCDT sporadiques et l’analyse de liaisons exclue ces gènes dans les rares familles multiplex avec HCDT. Nous supposons que le manque de transmission familiale claire d’HCDT peut résulter de la nécessité d’au moins deux « hits » génétiques différents dans des gènes importants pour le développement thyroïdien. Pour répondre au mieux nos questions de recherche, nous avons utilisé deux approches différentes: 1) une approche gène candidat, FOXE1, seul gène impliqué dans l’ectopie dans le modèle murin et 2) une approche en utilisant les techniques de séquençage de nouvelle génération (NGS) afin de trouver des variants génétiques pouvant expliquer cette pathologie au sein d’une cohorte de patients avec HCDT. Pour la première approche, une étude cas-contrôles a été réalisée sur le promoteur de FOXE1. Il a récemment été découvert qu’une région du promoteur de FOXE1 est différentiellement méthylée au niveau de deux dinucléotides CpG consécutifs, définissant une zone cruciale de contrôle de l’expression de FOXE1. L’analyse d’association basée sur les haplotypes a révélé qu’un haplotype (Hap1: ACCCCCCdel1C) est associé avec le HCDT chez les Caucasiens (p = 5x10-03). Une réduction significative de l’activité luciférase est observée pour Hap1 (réduction de 68%, p<0.001) comparé au promoteur WT de FOXE1. Une réduction de 50% de l’expression de FOXE1 dans une lignée de cellules thyroïdienne humaine est suffisante pour réduire significativement la migration cellulaire (réduction de 55%, p<0.05). Un autre haplotype (Hap2: ACCCCCCC) est observé moins fréquemment chez les Afro-Américain comparés aux Caucasiens (p = 1.7x10-03) et Hap2 diminue l’activité luciférase (réduction de 26%, p<0.001). Deux haplotypes distincts sont trouvés fréquemment dans les contrôles Africains (Black-African descents). Le premier haplotype (Hap3: GTCCCAAC) est fréquent (30.2%) chez les contrôles Afro-Américains comparés aux contrôles Caucasiens (6.3%; p = 2.59 x 10-9) tandis que le second haplotype (Hap4: GTCCGCAC) est trouvé exclusivement chez les contrôles Afro-Américains (9.4%) et est absent chez les contrôles Caucasiens (P = 2.59 x 10-6). Pour la deuxième approche, le séquençage de l’exome de l’ADN leucocytaire entre les jumeaux MZ discordants n’a révélé aucune différence. D'où l'intérêt du projet de séquençage de l’ADN et l’ARN de thyroïdes ectopiques et orthotopiques dans lesquelles de l'expression monoallélique aléatoire dans a été observée, ce qui pourrait expliquer comment une mutation monoallélique peut avoir des conséquences pathogéniques. Finalement, le séquençage de l’exome d’une cohorte de 36 cas atteints d’HCDT a permis d’identifier de nouveaux variants probablement pathogéniques dans les gènes récurrents RYR3, SSPO, IKBKE et TNXB. Ces quatre gènes sont impliqués dans l’adhésion focale (jouant un rôle dans la migration cellulaire), suggérant un rôle direct dans les défauts de migration de la thyroïde. Les essais de migration montrent une forte diminution (au moins 60% à 5h) de la migration des cellules thyroïdiennes infectées par shRNA comparés au shCtrl dans 2 de ces gènes. Des zebrafish KO (-/- et +/-) pour ces nouveaux gènes seront réalisés afin d’évaluer leur impact sur l’embryologie de la thyroïde.
Resumo:
The fruit is one of the most complex and important structures produced by flowering plants, and understanding the development and maturation process of fruits in different angiosperm species with diverse fruit structures is of immense interest. In the work presented here, molecular genetics and genomic analysis are used to explore the processes that form the fruit in two species: The model organism Arabidopsis and the diploid strawberry Fragaria vesca. One important basic question concerns the molecular genetic basis of fruit patterning. A long-standing model of Arabidopsis fruit (the gynoecium) patterning holds that auxin produced at the apex diffuses downward, forming a gradient that provides apical-basal positional information to specify different tissue types along the gynoecium’s length. The proposed gradient, however, has never been observed and the model appears inconsistent with a number of observations. I present a new, alternative model, wherein auxin acts to establish the adaxial-abaxial domains of the carpel primordia, which then ensures proper development of the final gynoecium. A second project utilizes genomics to identify genes that regulate fruit color by analyzing the genome sequences of Fragaria vesca, a species of wild strawberry. Shared and distinct SNPs among three F. vesca accessions were identified, providing a foundation for locating candidate mutations underlying phenotypic variations among different F. vesca accessions. Through systematic analysis of relevant SNP variants, a candidate SNP in FveMYB10 was identified that may underlie the fruit color in the yellow-fruited accessions, which was subsequently confirmed by functional assays. Our lab has previously generated extensive RNA-sequencing data that depict genome-scale gene expression profiles in F. vesca fruit and flower tissues at different developmental stages. To enhance the accessibility of this dataset, the web-based eFP software was adapted for this dataset, allowing visualization of gene expression in any tissues by user-initiated queries. Together, this thesis work proposes a well-supported new model of fruit patterning in Arabidopsis and provides further resources for F. vesca, including genome-wide variant lists and the ability to visualize gene expression. This work will facilitate future work linking traits of economic importance to specific genes and gaining novel insights into fruit patterning and development.
Resumo:
Background Flatfish metamorphosis denotes the extraordinary transformation of a symmetric pelagic larva into an asymmetric benthic juvenile. Metamorphosis in vertebrates is driven by thyroid hormones (THs), but how they orchestrate the cellular, morphological and functional modifications associated with maturation to juvenile/adult states in flatfish is an enigma. Since THs act via thyroid receptors that are ligand activated transcription factors, we hypothesized that the maturation of tissues during metamorphosis should be preceded by significant modifications in the transcriptome. Targeting the unique metamorphosis of flatfish and taking advantage of the large size of Atlantic halibut (Hippoglossus hippoglossus) larvae, we determined the molecular basis of TH action using RNA sequencing. Results De novo assembly of sequences for larval head, skin and gastrointestinal tract (GI-tract) yielded 90,676, 65,530 and 38,426 contigs, respectively. More than 57 % of the assembled sequences were successfully annotated using a multi-step Blast approach. A unique set of biological processes and candidate genes were identified specifically associated with changes in morphology and function of the head, skin and GI-tract. Transcriptome dynamics during metamorphosis were mapped with SOLiD sequencing of whole larvae and revealed greater than 8,000 differentially expressed (DE) genes significantly (p < 0.05) up- or down-regulated in comparison with the juvenile stage. Candidate transcripts quantified by SOLiD and qPCR analysis were significantly (r = 0.843; p < 0.05) correlated. The majority (98 %) of DE genes during metamorphosis were not TH-responsive. TH-responsive transcripts clustered into 6 groups based on their expression pattern during metamorphosis and the majority of the 145 DE TH-responsive genes were down-regulated. Conclusions A transcriptome resource has been generated for metamorphosing Atlantic halibut and over 8,000 DE transcripts per stage were identified. Unique sets of biological processes and candidate genes were associated with changes in the head, skin and GI-tract during metamorphosis. A small proportion of DE transcripts were TH-responsive, suggesting that they trigger gene networks, signalling cascades and transcription factors, leading to the overt changes in tissue occurring during metamorphosis.
Resumo:
Tese de Doutoramento, Ciências Biomédicas, Departamento de Ciências Biomédicas e Medicina, Universidade do Algarve, 2016
Resumo:
La leucémie lymphoblastique aiguë (LLA) représente environ 25% des cancers pédiatriques diagnostiqués chaque année. Dans 80 % des cas, une rémission complète est observée. Cependant, les patients résistants aux traitements ainsi que les patients en rechute présentent un mauvais pronostique. Les altérations épigénétiques sont des facteurs essentiels dans le développement et la progression de la maladie, ainsi qu’à la résistance aux traitements. Lors d’un criblage de médicaments approuvés par la FDA, nous avons découvert des molécules ayant des caractéristiques anticancéreux et épigénétiques. Pour évaluer l’activité de ces molécules, nous avons procédé à un criblage secondaire sur plusieurs lignées cellulaires leucémiques. Nous avons découvert qu’une de ces molécules, un glucoside cardiotonique appelé la proscillaridine A, avait une activité anticancéreuse spécifique pour des cellules leucémiques. Nous faisons donc l’hypothèse que la proscillaridine A pourrait avoir des effets épigénétiques et anticancéreux dans des modèles précliniques de LLA. Pour tester cette hypothèse, nous avons traité deux lignées cellulaires de LLA Nalm-6 (LLA pre-B) et Molt-4 (T-LLA) in vitro pendant 2 à 96 heures à des doses pertinentes sur le plan clinique. Nous avons alors pu observer une inhibition de croissance qui était dépendante de la dose administrée dans les deux lignées cellulaires, avec des valeurs de 50% d’inhibition de croissance (CI50) de 3.0 nM pour les Nalm-6 et de et 2.3 nM pour les Molt-4. De plus, nos études sur le cycle cellulaire par BrdU démontrent un arrêt en phase G2/M. Nous avons également détecté par immunobuvardage de type western des baisses significatives de l’acétylation de résidus de l’histone 3. Les niveaux d’expression des enzymes responsables de cette acétylation, les histones acétyltransférases CBP, P300 et TIP60 ainsi que de l’oncogène C-MYC étaient également diminuées. Par des analyses de séquençage de l’ARN, nous avons observé une augmentation de l’expression des gènes impliquées dans les processus d’apoptose et de différentiation cellulaire, ainsi qu’une diminution des gènes impliqués dans la prolifération cellulaire comme en particulier les gènes cibles de C-MYC. Ces résultats prometteurs suggèrent le potentiel prometteur de la proscillaridine A comme nouvelle thérapie pour les patients atteints de LLA.
Resumo:
I proposed the study of two distinct aspects of Ten-Eleven Translocation 2 (TET2) protein for understanding specific functions in different body systems. ^ In Part I, I characterized the molecular mechanisms of Tet2 in the hematological system. As the second member of Ten-Eleven Translocation protein family, TET2 is frequently mutated in leukemic patients. Previous studies have shown that the TET2 mutations frequently occur in 20% myelodysplastic syndrome/myeloproliferative neoplasm (MDS/MPN), 10% T-cell lymphoma leukemia and 2% B-cell lymphoma leukemia. Genetic mouse models also display distinct phenotypes of various types of hematological malignancies. I performed 5-hydroxymethylcytosine (5hmC) chromatin immunoprecipitation sequencing (ChIP-Seq) and RNA sequencing (RNA-Seq) of hematopoietic stem/progenitor cells to determine whether the deletion of Tet2 can affect the abundance of 5hmC at myeloid, T-cell and B-cell specific gene transcription start sites, which ultimately result in various hematological malignancies. Subsequent Exome sequencing (Exome-Seq) showed that disease-specific genes are mutated in different types of tumors, which suggests that TET2 may protect the genome from being mutated. The direct interaction between TET2 and Mutator S Homolog 6 (MSH6) protein suggests TET2 is involved in DNA mismatch repair. Finally, in vivo mismatch repair studies show that the loss of Tet2 causes a mutator phenotype. Taken together, my data indicate that TET2 binds to MSH6 to protect genome integrity. ^ In Part II, I intended to better understand the role of Tet2 in the nervous system. 5-hydroxymethylcytosine regulates epigenetic modification during neurodevelopment and aging. Thus, Tet2 may play a critical role in regulating adult neurogenesis. To examine the physiological significance of Tet2 in the nervous system, I first showed that the deletion of Tet2 reduces the 5hmC levels in neural stem cells. Mice lacking Tet2 show abnormal hippocampal neurogenesis along with 5hmC alternations at different gene promoters and corresponding gene expression downregulation. Through the luciferase reporter assay, two neural factors Neurogenic differentiation 1 (NeuroD1) and Glial fibrillary acidic protein (Gfap) were down-regulated in Tet2 knockout cells. My results suggest that Tet2 regulates neural stem/progenitor cell proliferation and differentiation in adult brain.^
Resumo:
La leucémie lymphoblastique aiguë (LLA) représente environ 25% des cancers pédiatriques diagnostiqués chaque année. Dans 80 % des cas, une rémission complète est observée. Cependant, les patients résistants aux traitements ainsi que les patients en rechute présentent un mauvais pronostique. Les altérations épigénétiques sont des facteurs essentiels dans le développement et la progression de la maladie, ainsi qu’à la résistance aux traitements. Lors d’un criblage de médicaments approuvés par la FDA, nous avons découvert des molécules ayant des caractéristiques anticancéreux et épigénétiques. Pour évaluer l’activité de ces molécules, nous avons procédé à un criblage secondaire sur plusieurs lignées cellulaires leucémiques. Nous avons découvert qu’une de ces molécules, un glucoside cardiotonique appelé la proscillaridine A, avait une activité anticancéreuse spécifique pour des cellules leucémiques. Nous faisons donc l’hypothèse que la proscillaridine A pourrait avoir des effets épigénétiques et anticancéreux dans des modèles précliniques de LLA. Pour tester cette hypothèse, nous avons traité deux lignées cellulaires de LLA Nalm-6 (LLA pre-B) et Molt-4 (T-LLA) in vitro pendant 2 à 96 heures à des doses pertinentes sur le plan clinique. Nous avons alors pu observer une inhibition de croissance qui était dépendante de la dose administrée dans les deux lignées cellulaires, avec des valeurs de 50% d’inhibition de croissance (CI50) de 3.0 nM pour les Nalm-6 et de et 2.3 nM pour les Molt-4. De plus, nos études sur le cycle cellulaire par BrdU démontrent un arrêt en phase G2/M. Nous avons également détecté par immunobuvardage de type western des baisses significatives de l’acétylation de résidus de l’histone 3. Les niveaux d’expression des enzymes responsables de cette acétylation, les histones acétyltransférases CBP, P300 et TIP60 ainsi que de l’oncogène C-MYC étaient également diminuées. Par des analyses de séquençage de l’ARN, nous avons observé une augmentation de l’expression des gènes impliquées dans les processus d’apoptose et de différentiation cellulaire, ainsi qu’une diminution des gènes impliqués dans la prolifération cellulaire comme en particulier les gènes cibles de C-MYC. Ces résultats prometteurs suggèrent le potentiel prometteur de la proscillaridine A comme nouvelle thérapie pour les patients atteints de LLA.
Resumo:
Background: Disease flares of established atopic dermatitis (AD) are generally associated with a low-diversity skin microbiota and Staphylococcus aureus dominance. The temporal transition of the skin microbiome between early infancy and the dysbiosis of established AD is unknown. Methods: We randomly selected 50 children from the Cork Babies After SCOPE: Evaluating the Longitudinal Impact Using Neurological and Nutritional Endpoints (BASELINE) longitudinal birth cohort for microbiome sampling at 3 points in the first 6 months of life at 4 skin sites relevant to AD: the antecubital and popliteal fossae, nasal tip, and cheek. We identified 10 infants with AD and compared them with 10 randomly selected control infants with no AD. We performed bacterial 16S ribosomal RNA sequencing and analysis directly from clinical samples. Results: Bacterial community structures and diversity shifted over time, suggesting that age strongly affects the skin microbiome in infants. Unlike established AD, these patients with infantile AD did not have noticeably dysbiotic communities before or with disease and were not colonized by S aureus. In comparing patients and control subjects, infants who had affected skin at month 12 had statistically significant differences in bacterial communities on the antecubital fossa at month 2 compared with infants who were unaffected at month 12. In particular, commensal staphylococci were significantly less abundant in infants affected at month 12, suggesting that this genus might protect against the later development of AD. Conclusions: This study suggests that 12-month-old infants with AD were not colonized with S aureus before having AD. Additional studies are needed to confirm whether colonization with commensal staphylococci modulates skin immunity and attenuates development of AD.
Resumo:
Le rétrécissement valvulaire aortique (RVA) est causé par une calcification et une fibrose progressive de la valve aortique. Le risque de développer la maladie augmente avec l’âge. À cause de l’augmentation de l’espérance de vie, le RVA est devenu un problème de santé publique. Le RVA est fatal en absence de traitement médical. Actuellement, la chirurgie est le seul traitement pour le stade sévère de la maladie, mais près de 50% des individus avec RVA n’y sont pas éligibles, principalement due à la présence de comorbidités. Plusieurs processus biologiques ont été associés à la maladie, mais les voies moléculaires spécifiques et les gènes impliqués dans le développement et la progression du RVA ne sont pas connus. Il est donc urgent de découvrir les gènes de susceptibilité pour le RVA afin d’identifier les personnes à risque ainsi que les biomarqueurs et les cibles thérapeutiques pouvant mener au développement de médicaments pour inverser ou limiter la progression de la maladie. L’objectif de cette thèse de doctorat était d’identifier la base moléculaire du RVA. Des approches modernes en génomique, incluant l’étude de gènes candidats et le criblage génomique par association (GWAS), ont été réalisées à l’aide de collections d’ADN provenant d’un grand nombre de patients bien caractérisés pour le RVA. Des études complémentaires en transciptomique ont comparé le profil d’expression global des gènes entre des valves calcifiées et non-calcifiées à l’aide de biopuces à ADN et de séquençage de l’ARN. Une première étude a identifié des variations dans le gène NOTCH1 et suggère pour la première fois la présence d’un polymorphisme commun dans ce gène conférant une susceptibilité au RVA. La deuxième étude a combiné par méta-analyse deux GWAS de patients provenant de la ville de Québec et Paris (France) aux données transcriptomiques. Cette étude de génomique intégrative a confirmé le rôle de RUNX2 dans le RVA et a permis l’identification d’un nouveau gène de susceptibilité, CACNA1C. Les troisième et quatrième études sur l’expression des gènes ont permis de mieux comprendre les bases moléculaires de la calcification des valves aortiques bicuspides et ainsi d’identifier de nouvelles cibles thérapeutiques pour le RVA. Les données générées par ce projet sont la base de futures découvertes importantes qui permettront d’améliorer les options de traitement et la qualité de vie des patients atteints du RVA.
Resumo:
Despite the ecological importance of copepods, few Next Generation Sequencing studies (NGS) have been performed on small crustaceans, and a standard method for RNA extraction is lacking. In this study, we compared three commonly-used methods: TRIzol®, Aurum Total RNA Mini Kit and Qiagen RNeasy Micro Kit, in combination with preservation reagents TRIzol® or RNAlater®, to obtain high-quality and quantity of RNA from copepods for NGS. Total RNA was extracted from the copepods Calanus helgolandicus, Centropages typicus and Temora stylifera and its quantity and quality were evaluated using NanoDrop, agarose gel electrophoresis and Agilent Bioanalyzer. Our results demonstrate that preservation of copepods in RNAlater® and extraction with Qiagen RNeasy Micro Kit were the optimal isolation method for high-quality and quantity of RNA for NGS studies of C. helgolandicus. Intriguingly, C. helgolandicus 28S rRNA is formed by two subunits that separate after heat-denaturation and migrate along with 18S rRNA. This unique property of protostome RNA has never been reported in copepods. Overall, our comparative study on RNA extraction protocols will help increase gene expression studies on copepods using high-throughput applications, such as RNA-Seq and microarrays.
Resumo:
The advent of next generation sequencing technologies (NGS) has expanded the area of genomic research, offering high coverage and increased sensitivity over older microarray platforms. Although the current cost of next generation sequencing is still exceeding that of microarray approaches, the rapid advances in NGS will likely make it the platform of choice for future research in differential gene expression. Connectivity mapping is a procedure for examining the connections among diseases, genes and drugs by differential gene expression initially based on microarray technology, with which a large collection of compound-induced reference gene expression profiles have been accumulated. In this work, we aim to test the feasibility of incorporating NGS RNA-Seq data into the current connectivity mapping framework by utilizing the microarray based reference profiles and the construction of a differentially expressed gene signature from a NGS dataset. This would allow for the establishment of connections between the NGS gene signature and those microarray reference profiles, alleviating the associated incurring cost of re-creating drug profiles with NGS technology. We examined the connectivity mapping approach on a publicly available NGS dataset with androgen stimulation of LNCaP cells in order to extract candidate compounds that could inhibit the proliferative phenotype of LNCaP cells and to elucidate their potential in a laboratory setting. In addition, we also analyzed an independent microarray dataset of similar experimental settings. We found a high level of concordance between the top compounds identified using the gene signatures from the two datasets. The nicotine derivative cotinine was returned as the top candidate among the overlapping compounds with potential to suppress this proliferative phenotype. Subsequent lab experiments validated this connectivity mapping hit, showing that cotinine inhibits cell proliferation in an androgen dependent manner. Thus the results in this study suggest a promising prospect of integrating NGS data with connectivity mapping. © 2013 McArt et al.
Resumo:
Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)
Resumo:
Abstract Background The implication of post-transcriptional regulation by microRNAs in molecular mechanisms underlying cancer disease is well documented. However, their interference at the cellular level is not fully explored. Functional in vitro studies are fundamental for the comprehension of their role; nevertheless results are highly dependable on the adopted cellular model. Next generation small RNA transcriptomic sequencing data of a tumor cell line and keratinocytes derived from primary culture was generated in order to characterize the microRNA content of these systems, thus helping in their understanding. Both constitute cell models for functional studies of microRNAs in head and neck squamous cell carcinoma (HNSCC), a smoking-related cancer. Known microRNAs were quantified and analyzed in the context of gene regulation. New microRNAs were investigated using similarity and structural search, ab initio classification, and prediction of the location of mature microRNAs within would-be precursor sequences. Results were compared with small RNA transcriptomic sequences from HNSCC samples in order to access the applicability of these cell models for cancer phenotype comprehension and for novel molecule discovery. Results Ten miRNAs represented over 70% of the mature molecules present in each of the cell types. The most expressed molecules were miR-21, miR-24 and miR-205, Accordingly; miR-21 and miR-205 have been previously shown to play a role in epithelial cell biology. Although miR-21 has been implicated in cancer development, and evaluated as a biomarker in HNSCC progression, no significant expression differences were seen between cell types. We demonstrate that differentially expressed mature miRNAs target cell differentiation and apoptosis related biological processes, indicating that they might represent, with acceptable accuracy, the genetic context from which they derive. Most miRNAs identified in the cancer cell line and in keratinocytes were present in tumor samples and cancer-free samples, respectively, with miR-21, miR-24 and miR-205 still among the most prevalent molecules at all instances. Thirteen miRNA-like structures, containing reads identified by the deep sequencing, were predicted from putative miRNA precursor sequences. Strong evidences suggest that one of them could be a new miRNA. This molecule was mostly expressed in the tumor cell line and HNSCC samples indicating a possible biological function in cancer. Conclusions Critical biological features of cells must be fully understood before they can be chosen as models for functional studies. Expression levels of miRNAs relate to cell type and tissue context. This study provides insights on miRNA content of two cell models used for cancer research. Pathways commonly deregulated in HNSCC might be targeted by most expressed and also by differentially expressed miRNAs. Results indicate that the use of cell models for cancer research demands careful assessment of underlying molecular characteristics for proper data interpretation. Additionally, one new miRNA-like molecule with a potential role in cancer was identified in the cell lines and clinical samples.
Resumo:
La RNA interference è un processo attraverso il quale alcuni piccoli frammenti di RNA (19-25 nucleotidi) sono in grado di silenziare l'espressione genica. La sua scoperta, nel 1998, ha rivoluzionato le concezioni della biologia molecolare, minando le basi del cosiddetto Dogma Centrale. Si è visto che la RNAi riveste ruoli fondamentali in meccanismi di regolazione genica, nello spegnimento dell'espressione e funziona come meccanismo di difesa innata contro varie tipologie di virus. Proprio a causa di queste implicazioni richiama interesse non solo dal punto di vista scientifico, ma anche da quello medico, in quanto potrebbe essere impiegata per lo sviluppo di nuove cure. Nonostante la scoperta di tale azione desti la curiosità e l'interesse di molti, i vari processi coinvolti, soprattutto a livello molecolare, non sono ancora chiari. In questo lavoro si propongono i metodi di analisi di dati di un esperimento prodotto dall'Istituto di Biologia molecolare e cellulare di Strasburgo. Nell'esperimento in questione vengono studiate le funzioni che l'enzima Dicer-2 ha nel pathway - cioè la catena di reazioni biomolecolari - della RNA interference durante un'infezione virale nel moscerino della frutta Drosophila Melanogaster. Per comprendere in che modo Dicer-2 intervenga nel silenziamento bisogna capire in quali casi e quali parti di RNA vengono silenziate, a seconda del diverso tipo di mutazione dell'enzima stesso. Dunque è necessario sequenziare l'RNA nelle diverse condizioni sperimentali, ottenendo così i dati da analizzare. Parte dei metodi statistici che verranno proposti risultano poco convenzionali, come conseguenza della peculiarità e della difficoltà dei quesiti che l'esperimento mette in luce. Siccome le tematiche affrontate richiedono un approccio sempre più interdisciplinare, è aumentata considerevolmente la richiesta di esperti di altri settori scientifici come matematici, informatici, fisici, statistici e ingegneri. Questa collaborazione, grazie a una diversità di approccio ai problemi, può fornire nuovi strumenti di comprensione in ambiti che, fino a poco tempo fa, rientravano unicamente nella sfera di competenza dei biologi.
Resumo:
Next-generation sequencing (NGS) is a valuable tool for the detection and quantification of HIV-1 variants in vivo. However, these technologies require detailed characterization and control of artificially induced errors to be applicable for accurate haplotype reconstruction. To investigate the occurrence of substitutions, insertions, and deletions at the individual steps of RT-PCR and NGS, 454 pyrosequencing was performed on amplified and non-amplified HIV-1 genomes. Artificial recombination was explored by mixing five different HIV-1 clonal strains (5-virus-mix) and applying different RT-PCR conditions followed by 454 pyrosequencing. Error rates ranged from 0.04-0.66% and were similar in amplified and non-amplified samples. Discrepancies were observed between forward and reverse reads, indicating that most errors were introduced during the pyrosequencing step. Using the 5-virus-mix, non-optimized, standard RT-PCR conditions introduced artificial recombinants in a fraction of at least 30% of the reads that subsequently led to an underestimation of true haplotype frequencies. We minimized the fraction of recombinants down to 0.9-2.6% by optimized, artifact-reducing RT-PCR conditions. This approach enabled correct haplotype reconstruction and frequency estimations consistent with reference data obtained by single genome amplification. RT-PCR conditions are crucial for correct frequency estimation and analysis of haplotypes in heterogeneous virus populations. We developed an RT-PCR procedure to generate NGS data useful for reliable haplotype reconstruction and quantification.