922 resultados para High-Throughput Nucleotide Sequencing


Relevância:

100.00% 100.00%

Publicador:

Resumo:

Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)

Relevância:

100.00% 100.00%

Publicador:

Resumo:

High throughput sequencing (HTS) provides new research opportunities for work on non-model organisms, such as differential expression studies between populations exposed to different environmental conditions. However, such transcriptomic studies first require the production of a reference assembly. The choice of sampling procedure, sequencing strategy and assembly workflow is crucial. To develop a reliable reference transcriptome for Triatoma brasiliensis, the major Chagas disease vector in Northeastern Brazil, different de novo assembly protocols were generated using various datasets and software. Both 454 and Illumina sequencing technologies were applied on RNA extracted from antennae and mouthparts from single or pooled individuals. The 454 library yielded 278 Mb. Fifteen Illumina libraries were constructed and yielded nearly 360 million RNA-seq single reads and 46 million RNA-seq paired-end reads for nearly 45 Gb. For the 454 reads, we used three assemblers, Newbler, CAP3 and/or MIRA and for the Illumina reads, the Trinity assembler. Ten assembly workflows were compared using these programs separately or in combination. To compare the assemblies obtained, quantitative and qualitative criteria were used, including contig length, N50, contig number and the percentage of chimeric contigs. Completeness of the assemblies was estimated using the CEGMA pipeline. The best assembly (57,657 contigs, completeness of 80 %, < 1 % chimeric contigs) was a hybrid assembly leading to recommend the use of (1) a single individual with large representation of biological tissues, (2) merging both long reads and short paired-end Illumina reads, (3) several assemblers in order to combine the specific advantages of each.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

High Throughput Sequencing capabilities have made the process of assembling a transcriptome easier, whether or not there is a reference genome. But the quality of a transcriptome assembly must be good enough to capture the most comprehensive catalog of transcripts and their variations, and to carry out further experiments on transcriptomics. There is currently no consensus on which of the many sequencing technologies and assembly tools are the most effective. Many non-model organisms lack a reference genome to guide the transcriptome assembly. One question, therefore, is whether or not a reference-based genome assembly gives better results than de novo assembly. The blood-sucking insect Rhodnius prolixus-a vector for Chagas disease-has a reference genome. It is therefore a good model on which to compare reference-based and de novo transcriptome assemblies. In this study, we compared de novo and reference-based genome assembly strategies using three datasets (454, Illumina, 454 combined with Illumina) and various assembly software. We developed criteria to compare the resulting assemblies: the size distribution and number of transcripts, the proportion of potentially chimeric transcripts, how complete the assembly was (completeness evaluated both through CEGMA software and R. prolixus proteome fraction retrieved). Moreover, we looked for the presence of two chemosensory gene families (Odorant-Binding Proteins and Chemosensory Proteins) to validate the assembly quality. The reference-based assemblies after genome annotation were clearly better than those generated using de novo strategies alone. Reference-based strategies revealed new transcripts, including new isoforms unpredicted by automatic genome annotation. However, a combination of both de novo and reference-based strategies gave the best result, and allowed us to assemble fragmented transcripts.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Abstract Background Transcript enumeration methods such as SAGE, MPSS, and sequencing-by-synthesis EST "digital northern", are important high-throughput techniques for digital gene expression measurement. As other counting or voting processes, these measurements constitute compositional data exhibiting properties particular to the simplex space where the summation of the components is constrained. These properties are not present on regular Euclidean spaces, on which hybridization-based microarray data is often modeled. Therefore, pattern recognition methods commonly used for microarray data analysis may be non-informative for the data generated by transcript enumeration techniques since they ignore certain fundamental properties of this space. Results Here we present a software tool, Simcluster, designed to perform clustering analysis for data on the simplex space. We present Simcluster as a stand-alone command-line C package and as a user-friendly on-line tool. Both versions are available at: http://xerad.systemsbiology.net/simcluster. Conclusion Simcluster is designed in accordance with a well-established mathematical framework for compositional data analysis, which provides principled procedures for dealing with the simplex space, and is thus applicable in a number of contexts, including enumeration-based gene expression data.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The Ph chromosome is the most frequent cytogenetic aberration associated with adult ALL and it represents the single most significant adverse prognostic marker. Despite imatinib has led to significant improvements in the treatment of patients with Ph+ ALL, in the majority of cases resistance developed quickly and disease progressed. Some mechanisms of resistance have been widely described but the full knowledge of contributing factors, driving both the disease and resistance, remains to be defined. The observation of rapid development of lymphoblastic leukemia in mice expressing altered Ikaros (Ik) isoforms represented the background of this study. Ikaros is a zinc finger transcription factor required for normal hemopoietic differentiation and proliferation, particularly in the lymphoid lineages. By means of alternative splicing, Ikaros encodes several proteins that differ in their abilities to bind to a consensus DNA-binding site. Shorter, DNA nonbinding isoforms exert a dominant negative effect, inhibiting the ability of longer heterodimer partners to bind DNA. The differential expression pattern of Ik isoforms in Ph+ ALL patients was analyzed in order to determine if molecular abnormalities involving the Ik gene could associate with resistance to imatinib and dasatinib. Bone marrow and peripheral blood samples from 46 adult patients (median age 55 yrs, 18-76) with Ph+ ALL at diagnosis and during treatment with imatinib (16 pts) or dasatinib (30 pts) were collected. We set up a fast, high-throughput method based on capillary electrophoresis technology to detect and quantify splice variants. 41% Ph+ ALL patients expressed high levels of the non DNA-binding dominant negative Ik6 isoform lacking critical N-terminal zinc-fingers which display abnormal subcellular compartmentalization pattern. Nuclear extracts from patients expressed Ik6 failed to bind DNA in mobility shift assay using a DNA probe containing an Ikaros-specific DNA binding sequence. In 59% Ph+ ALL patients there was the coexistence in the same PCR sample and at the same time of many splice variants corresponded to Ik1, Ik2, Ik4, Ik4A, Ik5A, Ik6, Ik6 and Ik8 isoforms. In these patients aberrant full-length Ikaros isoforms in Ph+ ALL characterized by a 60-bp insertion immediately downstream of exon 3 and a recurring 30-bp in-frame deletion at the end of exon 7 involving most frequently the Ik2, Ik4 isoforms were also identified. Both the insertion and deletion were due to the selection of alternative splice donor and acceptor sites. The molecular monitoring of minimal residual disease showed for the first time in vivo that the Ik6 expression strongly correlated with the BCR-ABL transcript levels suggesting that this alteration could depend on the Bcr-Abl activity. Patient-derived leukaemia cells expressed dominant-negative Ik6 at diagnosis and at the time of relapse, but never during remission. In order to mechanistically demonstrated whether in vitro the overexpression of Ik6 impairs the response to tyrosine kinase inhibitors (TKIs) and contributes to resistance, an imatinib-sensitive Ik6-negative Ph+ ALL cell line (SUP-B15) was transfected with the complete Ik6 DNA coding sequence. The expression of Ik6 strongly increased proliferation and inhibited apoptosis in TKI sensitive cells establishing a previously unknown link between specific molecular defects that involve the Ikaros gene and the resistance to TKIs in Ph+ ALL patients. Amplification and genomic sequence analysis of the exon splice junction regions showed the presence of 2 single nucleotide polymorphisms (SNPs): rs10251980 [A/G] in the exon2/3 splice junction and of rs10262731 [A/G] in the exon 7/8 splice junction in 50% and 36% of patients, respectively. A variant of the rs11329346 [-/C], in 16% of patients was also found. Other two different single nucleotide substitutions not recognized as SNP were observed. Some mutations were predicted by computational analyses (RESCUE approach) to alter cis-splicing elements. In conclusion, these findings demonstrated that the post-transcriptional regulation of alternative splicing of Ikaros gene is defective in the majority of Ph+ ALL patients treated with TKIs. The overexpression of Ik6 blocking B-cell differentiation could contribute to resistance opening a time frame, during which leukaemia cells acquire secondary transforming events that confer definitive resistance to imatinib and dasatinib.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

In the past decade, the advent of efficient genome sequencing tools and high-throughput experimental biotechnology has lead to enormous progress in the life science. Among the most important innovations is the microarray tecnology. It allows to quantify the expression for thousands of genes simultaneously by measurin the hybridization from a tissue of interest to probes on a small glass or plastic slide. The characteristics of these data include a fair amount of random noise, a predictor dimension in the thousand, and a sample noise in the dozens. One of the most exciting areas to which microarray technology has been applied is the challenge of deciphering complex disease such as cancer. In these studies, samples are taken from two or more groups of individuals with heterogeneous phenotypes, pathologies, or clinical outcomes. these samples are hybridized to microarrays in an effort to find a small number of genes which are strongly correlated with the group of individuals. Eventhough today methods to analyse the data are welle developed and close to reach a standard organization (through the effort of preposed International project like Microarray Gene Expression Data -MGED- Society [1]) it is not unfrequant to stumble in a clinician's question that do not have a compelling statistical method that could permit to answer it.The contribution of this dissertation in deciphering disease regards the development of new approaches aiming at handle open problems posed by clinicians in handle specific experimental designs. In Chapter 1 starting from a biological necessary introduction, we revise the microarray tecnologies and all the important steps that involve an experiment from the production of the array, to the quality controls ending with preprocessing steps that will be used into the data analysis in the rest of the dissertation. While in Chapter 2 a critical review of standard analysis methods are provided stressing most of problems that In Chapter 3 is introduced a method to adress the issue of unbalanced design of miacroarray experiments. In microarray experiments, experimental design is a crucial starting-point for obtaining reasonable results. In a two-class problem, an equal or similar number of samples it should be collected between the two classes. However in some cases, e.g. rare pathologies, the approach to be taken is less evident. We propose to address this issue by applying a modified version of SAM [2]. MultiSAM consists in a reiterated application of a SAM analysis, comparing the less populated class (LPC) with 1,000 random samplings of the same size from the more populated class (MPC) A list of the differentially expressed genes is generated for each SAM application. After 1,000 reiterations, each single probe given a "score" ranging from 0 to 1,000 based on its recurrence in the 1,000 lists as differentially expressed. The performance of MultiSAM was compared to the performance of SAM and LIMMA [3] over two simulated data sets via beta and exponential distribution. The results of all three algorithms over low- noise data sets seems acceptable However, on a real unbalanced two-channel data set reagardin Chronic Lymphocitic Leukemia, LIMMA finds no significant probe, SAM finds 23 significantly changed probes but cannot separate the two classes, while MultiSAM finds 122 probes with score >300 and separates the data into two clusters by hierarchical clustering. We also report extra-assay validation in terms of differentially expressed genes Although standard algorithms perform well over low-noise simulated data sets, multi-SAM seems to be the only one able to reveal subtle differences in gene expression profiles on real unbalanced data. In Chapter 4 a method to adress similarities evaluation in a three-class prblem by means of Relevance Vector Machine [4] is described. In fact, looking at microarray data in a prognostic and diagnostic clinical framework, not only differences could have a crucial role. In some cases similarities can give useful and, sometimes even more, important information. The goal, given three classes, could be to establish, with a certain level of confidence, if the third one is similar to the first or the second one. In this work we show that Relevance Vector Machine (RVM) [2] could be a possible solutions to the limitation of standard supervised classification. In fact, RVM offers many advantages compared, for example, with his well-known precursor (Support Vector Machine - SVM [3]). Among these advantages, the estimate of posterior probability of class membership represents a key feature to address the similarity issue. This is a highly important, but often overlooked, option of any practical pattern recognition system. We focused on Tumor-Grade-three-class problem, so we have 67 samples of grade I (G1), 54 samples of grade 3 (G3) and 100 samples of grade 2 (G2). The goal is to find a model able to separate G1 from G3, then evaluate the third class G2 as test-set to obtain the probability for samples of G2 to be member of class G1 or class G3. The analysis showed that breast cancer samples of grade II have a molecular profile more similar to breast cancer samples of grade I. Looking at the literature this result have been guessed, but no measure of significance was gived before.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Die vorliegende Dissertation entstand im Rahmen eines multizentrischen EU-geförderten Projektes, das die Anwendungsmöglichkeiten von Einzelnukleotid-Polymorphismen (SNPs) zur Individualisierung von Personen im Kontext der Zuordnung von biologischen Tatortspuren oder auch bei der Identifizierung unbekannter Toter behandelt. Die übergeordnete Zielsetzung des Projektes bestand darin, hochauflösende Genotypisierungsmethoden zu etablieren und zu validieren, die mit hoher Genauigkeit aber geringen Aufwand SNPs im Multiplexformat simultan analysieren können. Zunächst wurden 29 Y-chromosomale und 52 autosomale SNPs unter der Anforderung ausgewählt, dass sie als Multiplex eine möglichst hohe Individualisierungschance aufweisen. Anschließend folgten die Validierungen beider Multiplex-Systeme und der SNaPshot™-Minisequenzierungsmethode in systematischen Studien unter Beteiligung aller Arbeitsgruppen des Projektes. Die validierte Referenzmethode auf der Basis einer Minisequenzierung diente einerseits für die kontrollierte Zusammenarbeit unterschiedlicher Laboratorien und andererseits als Grundlage für die Entwicklung eines Assays zur SNP-Genotypisierung mittels der elektronischen Microarray-Technologie in dieser Arbeit. Der eigenständige Hauptteil dieser Dissertation beschreibt unter Verwendung der zuvor validierten autosomalen SNPs die Neuentwicklung und Validierung eines Hybridisierungsassays für die elektronische Microarray-Plattform der Firma Nanogen Dazu wurden im Vorfeld drei verschiedene Assays etabliert, die sich im Funktionsprinzip auf dem Microarray unterscheiden. Davon wurde leistungsorientiert das Capture down-Assay zur Weiterentwicklung ausgewählt. Nach zahlreichen Optimierungsmaßnahmen hinsichtlich PCR-Produktbehandlung, gerätespezifischer Abläufe und analysespezifischer Oligonukleotiddesigns stand das Capture down-Assay zur simultanen Typisierung von drei Individuen mit je 32 SNPs auf einem Microarray bereit. Anschließend wurde dieses Verfahren anhand von 40 DNA-Proben mit bekannten Genotypen für die 32 SNPs validiert und durch parallele SNaPshot™-Typisierung die Genauigkeit bestimmt. Das Ergebnis beweist nicht nur die Eignung des validierten Analyseassays und der elektronischen Microarray-Technologie für bestimmte Fragestellungen, sondern zeigt auch deren Vorteile in Bezug auf Schnelligkeit, Flexibilität und Effizienz. Die Automatisierung, welche die räumliche Anordnung der zu untersuchenden Fragmente unmittelbar vor der Analyse ermöglicht, reduziert unnötige Arbeitsschritte und damit die Fehlerhäufigkeit und Kontaminationsgefahr bei verbesserter Zeiteffizienz. Mit einer maximal erreichten Genauigkeit von 94% kann die Zuverlässigkeit der in der forensischen Genetik aktuell eingesetzten STR-Systeme jedoch noch nicht erreicht werden. Die Rolle des neuen Verfahrens wird damit nicht in einer Ablösung der etablierten Methoden, sondern in einer Ergänzung zur Lösung spezieller Probleme wie z.B. der Untersuchung stark degradierter DNA-Spuren zu finden sein.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Network Theory is a prolific and lively field, especially when it approaches Biology. New concepts from this theory find application in areas where extensive datasets are already available for analysis, without the need to invest money to collect them. The only tools that are necessary to accomplish an analysis are easily accessible: a computing machine and a good algorithm. As these two tools progress, thanks to technology advancement and human efforts, wider and wider datasets can be analysed. The aim of this paper is twofold. Firstly, to provide an overview of one of these concepts, which originates at the meeting point between Network Theory and Statistical Mechanics: the entropy of a network ensemble. This quantity has been described from different angles in the literature. Our approach tries to be a synthesis of the different points of view. The second part of the work is devoted to presenting a parallel algorithm that can evaluate this quantity over an extensive dataset. Eventually, the algorithm will also be used to analyse high-throughput data coming from biology.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

This thesis is developed in the contest of Ritmare project WP1, which main objective is the development of a sustainable fishery through the identification of populations boundaries in commercially important species in Italian Seas. Three main objectives are discussed in order to help reach the main purpose of identification of stock boundaries in Parapenaeus longirostris: 1 -Development of a representative sampling design for Italian seas; 2 -Evaluation of 2b-RAD protocol; 3 -Investigation of populations through biological data analysis. First of all we defined and accomplished a sampling design which properly represents all Italian seas. Then we used information and data about nursery areas distribution, abundance of populations and importance of P. longirostris in local fishery, to develop an experimental design that prioritize the most important areas to maximize the results with actual project funds. We introduced for the first time the use of 2b-RAD on this species, a genotyping method based on sequencing the uniform fragments produced by type IIB restriction endonucleases. Thanks to this method we were able to move from genetics to the more complex genomics. In order to proceed with 2b-RAD we performed several tests to identify the best DNA extraction kit and protocol and finally we were able to extract 192 high quality DNA extracts ready to be processed. We tested 2b-RAD with five samples and after high-throughput sequencing of libraries we used the software “Stacks” to analyze the sequences. We obtained positive results identifying a great number of SNP markers among the five samples. To guarantee a multidisciplinary approach we used the biological data associated to the collected samples to investigate differences between geographical samples. Such approach assures continuity with other project, for instance STOCKMED, which utilize a combination of molecular and biological analysis as well.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Erkrankungen des Skelettapparats wie beispielsweise die Osteoporose oder Arthrose gehören neben den Herz-Kreislauferkrankungen und Tumoren zu den Häufigsten Erkrankungen des Menschen. Ein besseres Verständnis der Bildung und des Erhalts von Knochen- oder Knorpelgewebe ist deshalb von besonderer Bedeutung. Viele bisherige Ansätze zur Identifizierung hierfür relevanter Gene, deren Produkte und Interaktionen beruhen auf der Untersuchung pathologischer Situationen. Daher ist die Funktion vieler Gene nur im Zusammenhang mit Krankheiten beschrieben. Untersuchungen, die die Genaktivität bei der Normalentwicklung von knochen- und knorpelbildenden Geweben zum Ziel haben, sind dagegen weit weniger oft durchgeführt worden. rnEines der entwicklungsphysiologisch interessantesten Gewebe ist die Epiphysenfuge der Röhrenknochen. In dieser sogenannten Wachstumsfuge ist insbesondere beim fötalen Gewebe eine sehr hohe Aktivität derjenigen Gene zu erwarten, die an der Knochen- und Knorpelbildung beteiligt sind. In der vorliegenden Arbeit wurde daher aus der Epiphysenfuge von Kälberknochen RNA isoliert und eine cDNA-Bibliothek konstruiert. Von dieser wurden ca. 4000 Klone im Rahmen eines klassischen EST-Projekts sequenziert. Durch die Analyse konnte ein ungefähr 900 Gene umfassendes Expressionsprofil erstellt werden und viele Transkripte für Komponenten der regulatorischen und strukturbildenden Bestandteile der Knochen- und Knorpelentwicklung identifiziert werden. Neben den typischen Genen für Komponenten der Knochenentwicklung sind auch deutlich Bestandteile für embryonale Entwicklungsprozesse vertreten. Zu ersten gehören in erster Linie die Kollagene, allen voran Kollagen II alpha 1, das mit Abstand höchst exprimierte Gen in der fötalen Wachstumsfuge. Nach den ribosomalen Proteinen stellen die Kollagene mit ca. 10 % aller auswertbaren Sequenzen die zweitgrößte Gengruppe im erstellten Expressionsprofil dar. Proteoglykane und andere niedrig exprimierte regulatorische Elemente, wie Transkriptionsfaktoren, konnten im EST-Projekt aufgrund der geringen Abdeckung nur in sehr geringer Kopienzahl gefunden werden. Allerdings förderte die EST-Analyse mehrere interessante, bisher nicht bekannte Transkripte zutage, die detaillierter untersucht wurden. Dazu gehören Transkripte die, die dem LOC618319 zugeordnet werden konnten. Neben den bisher beschriebenen drei Exonbereichen konnte ein weiteres Exon im 3‘-UTR identifiziert werden. Im abgeleiteten Protein, das mindestens 121 AS lang ist, wurden ein Signalpeptid und eine Transmembrandomäne nachgewiesen. In Verbindung mit einer möglichen Glykosylierung ist das Genprodukt in die Gruppe der Proteoglykane einzuordnen. Leicht abweichend von den typischen Strukturen knochen- und knorpelspezifischer Proteoglykane ist eine mögliche Funktion dieses Genprodukts bei der Interaktion mit Integrinen und der Zell-Zellinteraktion, aber auch bei der Signaltransduktion denkbar. rnDie EST-Sequenzierungen von ca. 4000 cDNA-Klonen können aber in der Regel nur einen Bruchteil der möglichen Transkripte des untersuchten Gewebes abdecken. Mit den neuen Sequenziertechnologien des „Next Generation Sequencing“ bestehen völlig neue Möglichkeiten, komplette Transkriptome mit sehr hoher Abdeckung zu sequenzieren und zu analysieren. Zur Unterstützung der EST-Daten und zur deutlichen Verbreiterung der Datenbasis wurde das Transkriptom der bovinen fötalen Wachstumsfuge sowohl mit Hilfe der Roche-454/FLX- als auch der Illumina-Solexa-Technologie sequenziert. Bei der Auswertung der ca. 40000 454- und 75 Millionen Illumina-Sequenzen wurden Verfahren zur allgemeinen Handhabung, der Qualitätskontrolle, dem „Clustern“, der Annotation und quantitativen Auswertung von großen Mengen an Sequenzdaten etabliert. Beim Vergleich der Hochdurchsatz Blast-Analysen im klassischen „Read-Count“-Ansatz mit dem erstellten EST-Expressionsprofil konnten gute Überstimmungen gezeigt werden. Abweichungen zwischen den einzelnen Methoden konnten nicht in allen Fällen methodisch erklärt werden. In einigen Fällen sind Korrelationen zwischen Transkriptlänge und „Read“-Verteilung zu erkennen. Obwohl schon simple Methoden wie die Normierung auf RPKM („reads per kilo base transkript per million mappable reads“) eine Verbesserung der Interpretation ermöglichen, konnten messtechnisch durch die Art der Sequenzierung bedingte systematische Fehler nicht immer ausgeräumt werden. Besonders wichtig ist daher die geeignete Normalisierung der Daten beim Vergleich verschieden generierter Datensätze. rnDie hier diskutierten Ergebnisse aus den verschiedenen Analysen zeigen die neuen Sequenziertechnologien als gute Ergänzung und potentiellen Ersatz für etablierte Methoden zur Genexpressionsanalyse.rn

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Kiwifruit (genus Actinidia) is an important horticultural crop grown in the temperate regions. The four world’s largest producers are China, Italy, New Zealand and Chile. More than 50 species are recognized in the genus but the principal species in cultivation are A. deliciosa and A. chinensis. In Italy, as well as in many other countries, the kiwifruit crop has been considered to be relatively disease free and then no certification system for this species has been developed to regulate importation of propagation plant material in the European Union. During the last years a number of fungal and bacterial diseases have been recorded such as Botrytis cinerea and Pseudomonas syringae pv. actinidiae. Since 2003, several viruses and virus-like diseases have been identified and more recent studies demonstrated that Actinidia spp can be infected by a wide range of viral agents. In collaboration with the University of Auckland we have been detected thirteen different viral species on kiwifruit plants. During the three years of my PhD I worked on the characterization of Cucumber mosaic virus (CMV) and Pelargonium zonate spot virus (PZSV). The determination of causal agents has been based on host range, symptom expression in the test plant species and morphological properties of the virus particles using transmission electron microscopy (TEM) and using specific oligonucleotide primers in reverse transcription-polymerase chain reaction (RT-PCR). Both viruses induced several symptoms on kiwifruit plants. Moreover with new technologies such as high-throughput sequencing we detected additional viruses, a new member of the family Closteroviridae and a new member of the family Totiviridae. Taking together all results of my studies it is clear that, in order to minimize the risk of serious viral disease in kiwifruit, it is vital to use virus-free propagation material in order to prevent the spread of these viruses.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The investigation of phylogenetic diversity and functionality of complex microbial communities in relation to changes in the environmental conditions represents a major challenge of microbial ecology research. Nowadays, particular attention is paid to microbial communities occurring at environmental sites contaminated by recalcitrant and toxic organic compounds. Extended research has evidenced that such communities evolve some metabolic abilities leading to the partial degradation or complete mineralization of the contaminants. Determination of such biodegradation potential can be the starting point for the development of cost effective biotechnological processes for the bioremediation of contaminated matrices. This work showed how metagenomics-based microbial ecology investigations supported the choice or the development of three different bioremediation strategies. First, PCR-DGGE and PCR-cloning approaches served the molecular characterization of microbial communities enriched through sequential development stages of an aerobic cometabolic process for the treatment of groundwater contaminated by chlorinated aliphatic hydrocarbons inside an immobilized-biomass packed bed bioreactor (PBR). In this case the analyses revealed homogeneous growth and structure of immobilized communities throughout the PBR and the occurrence of dominant microbial phylotypes of the genera Rhodococcus, Comamonas and Acidovorax, which probably drive the biodegradation process. The same molecular approaches were employed to characterize sludge microbial communities selected and enriched during the treatment of municipal wastewater coupled with the production of polyhydroxyalkanoates (PHA). Known PHA-accumulating microorganisms identified were affiliated with the genera Zooglea, Acidovorax and Hydrogenophaga. Finally, the molecular investigation concerned communities of polycyclic aromatic hydrocarbon (PAH) contaminated soil subjected to rhizoremediation with willow roots or fertilization-based treatments. The metabolic ability to biodegrade naphthalene, as a representative model for PAH, was assessed by means of stable isotope probing in combination with high-throughput sequencing analysis. The phylogenetic diversity of microbial populations able to derive carbon from naphthalene was evaluated as a function of the type of treatment.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Im Rahmen der vorliegenden Arbeit wurde ein Bereich aus der geschlechtsbestimmenden Chromosomenregion, der „Contig SDR“, von C. tentans mit einer Größe von ~87 kb untersucht. Zur Erstellung des Contigs wurden 8 BAC-Klone aus C. tentans isoliert, teilweise subkloniert und sequenziert. Innerhalb des Contigs SDR konnten insgesamt 13 Gene sowie ein Teilbereich des Gens rpS5-like im distalen Bereich des Contigs SDR identifiziert werden. Hierbei handelt es sich um dieselben Gene, welche schon im Contig SDR von C. thummi identifiziert werden konnten. Ein Vergleich der beiden Contigs zeigt, dass die Abfolge der Gene zwischen den beiden Arten C. thummi und C. tentans identisch ist. Weiterhin konnten im Contig SDR von C. tentans sechs Bereiche lokalisiert werden, in denen repetitive oder transposable Elemente zu finden sind. Ein Vergleich der larvalen Transkripte (L4-Stadium) von 11 Genen des Contigs SDR aus C. thummi ♂ und C. thummi ♀ per RT-PCR und Hochdurchsatz-Sequenzierung zeigte mit Ausnahme der Gene luc7(p)-like sowie fs(1)K10-like bislang keine weiteren geschlechtsspezifischen Unterschiede. Im Gen luc7(p)-like konnte in C. thummi ♀ und C. piger ♀ ein alternativ gespleißtes Intron im eigentlichen Exon 2 identifiziert werden. Bei fs(1)K10-like konnte in C. thummi ♂ eine Duplikation des Gens nachgewiesen werden. Weiterhin wurden mit Hilfe des RACE-Verfahrens die 5’UTR- und 3’UTR-Bereiche der Transkripte analysiert. Hierdurch konnten differentielle Spleißprodukte identifiziert werden, welche jedoch nicht geschlechtsspezifisch auftreten. Die bioinformatische Bearbeitung der von den Genen der SDR kodierten Proteine auf konservierte Domänen zeigt vier Proteine, die möglicherweise als Transkriptions- oder Spleißfaktoren wirken können. Hierbei handelt es sich um die Proteine der Gene mi-er1-like, luc7(p)-like, polyhomeotic-like sowie rpn5-like. Für das auf Grund der männchenspezifischen Duplikation in C. thummi in den Fokus geratene Genprodukt von fs(1)K10-like konnten keine Domänen vorhergesagt werden. Somit kann nicht gesagt werden, ob das Protein im Rahmen der Geschlechtsdetermination eine Funktion haben könnte. Die Sequenzidentität der abgeleiteten AS-Sequenz liegt für alle Gene außer mi-er1-like (87,6 %) zwischen den beiden Arten C. tentans und C. thummi bei 93,3 %.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Analisi dello sviluppo delle recenti metodologie di sequenziamento di acidi nucleici, con particolare attenzione a NGS e metodi high-throughput sequencing.