948 resultados para Illumina sequencing
Resumo:
Background: The capacity of European pear fruit (Pyrus communis L.) to ripen after harvest develops during the final stages of growth on the tree. The objective of this study was to characterize changes in 'Bartlett' pear fruit physico-chemical properties and transcription profiles during fruit maturation leading to attainment of ripening capacity. Results: The softening response of pear fruit held for 14days at 20°C after harvest depended on their maturity. We identified four maturity stages: S1-failed to soften and S2- displayed partial softening (with or without ET-ethylene treatment); S3 - able to soften following ET; and S4 - able to soften without ET. Illumina sequencing and Trinity assembly generated 68,010 unigenes (mean length of 911bp), of which 32.8% were annotated to the RefSeq plant database. Higher numbers of differentially expressed transcripts were recorded in the S3-S4 and S1-S2 transitions (2805 and 2505 unigenes, respectively) than in the S2-S3 transition (2037 unigenes). High expression of genes putatively encoding pectin degradation enzymes in the S1-S2 transition suggests pectic oligomers may be involved as early signals triggering the transition to responsiveness to ethylene in pear fruit. Moreover, the co-expression of these genes with Exps (Expansins) suggests their collaboration in modifying cell wall polysaccharide networks that are required for fruit growth. K-means cluster analysis revealed that auxin signaling associated transcripts were enriched in cluster K6 that showed the highest gene expression at S3. AP2/EREBP (APETALA 2/ethylene response element binding protein) and bHLH (basic helix-loop-helix) transcripts were enriched in all three transition S1-S2, S2-S3, and S3-S4. Several members of Aux/IAA (Auxin/indole-3-acetic acid), ARF (Auxin response factors), and WRKY appeared to play an important role in orchestrating the S2-S3 transition. Conclusions: We identified maturity stages associated with the development of ripening capacity in 'Bartlett' pear, and described the transcription profile of fruit at these stages. Our findings suggest that auxin is essential in regulating the transition of pear fruit from being ethylene-unresponsive (S2) to ethylene-responsive (S3), resulting in fruit softening. The transcriptome will be helpful for future studies about specific developmental pathways regulating the transition to ripening. © 2015 Nham et al.
Resumo:
Microorganisms in the plant rhizosphere, the zone under the influence of roots, and phyllosphere, the aboveground plant habitat, exert a strong influence on plant growth, health, and protection. Tomatoes and cucumbers are important players in produce safety, and the microbial life on their surfaces may contribute to their fitness as hosts for foodborne pathogens such as Salmonella enterica and Listeria monocytogenes. External factors such as agricultural inputs and environmental conditions likely also play a major role. However, the relative contributions of the various factors at play concerning the plant surface microbiome remain obscure, although this knowledge could be applied to crop protection from plant and human pathogens. Recent advances in genomic technology have made investigations into the diversity and structure of microbial communities possible in many systems and at multiple scales. Using Illumina sequencing to profile particular regions of the 16S rRNA gene, this study investigates the influences of climate and crop management practices on the field-grown tomato and cucumber microbiome. The first research chapter (Chapter 3) involved application of 4 different soil amendments to a tomato field and profiling of harvest-time phyllosphere and rhizosphere microbial communities. Factors such as water activity, soil texture, and field location influenced microbial community structure more than soil amendment use, indicating that field conditions may exert more influence on the tomato microbiome than certain agricultural inputs. In Chapter 4, the impact of rain on tomato and cucumber-associated microbial community structures was evaluated. Shifts in bacterial community composition and structure were recorded immediately following rain events, an effect which was partially reversed after 4 days and was strongest on cucumber fruit surfaces. Chapter 5 focused on the contribution of insect visitors to the tomato microbiota, finding that insects introduced diverse bacterial taxa to the blossom and green tomato fruit microbiome. This study advances our understanding of the factors that influence the microbiomes of tomato and cucumber. Farms are complex environments, and untangling the interactions between farming practices, the environment, and microbial diversity will help us develop a comprehensive understanding of how microbial life, including foodborne pathogens, may be influenced by agricultural conditions.
Resumo:
The complete genome sequence of bovine papillomavirus 2 (BPV2) from Brazilian Amazon Region was determined using multiple-primed rolling circle amplification followed by Illumina sequencing. The genome is 7,947 bp long, with 45.9% GC content. It encodes seven early (E1, E2, E4, E5, E6, E7, and E8) and two late (L1 and L2) genes. The complete genome of a BPV2 can help in future studies since this BPV type is highly reported worldwide although the lack of complete genome sequences available.
Resumo:
Globally, peatlands occupy a small portion of terrestrial land area but contain up to one-third of all soil organic carbon. This carbon pool is vulnerable to increased decomposition under projected climate change scenarios but little is known about how plant functional groups will influence microbial communities responsible for regulating carbon cycling processes. Here we examined initial shifts in microbial community structure within two sampling depths under plant functional group manipulations in mesocosms of an oligotrophic bog. Microbial community composition for bacteria and archaea was characterized using targeted 16S rRNA Illumina gene sequencing. We found statistically distinct spatial patterns between the more shallow 10-20 cm sampling depth and the deeper 30-40 cm depth. Significant effects by plant functional groups were found only within the 10-20 cm depth, indicating plant-mediated microbial community shifts respond more quickly near the peat surface. Specifically, the relative abundance of Acidobacteria decreased under ericaceous shrub treatments in the 10-20 cm depth and was replaced by increased abundance of Gammaproteobacteria and Bacteroidetes. In contrast, the sedge rhizosphere continued to be dominated by Acidobacteria but also promoted an increase in the relative recovery of Alphaproteobacteria and Verrucomicrobia. These initial results suggest microbial communities under ericaceous shrubs may be limited by anaerobic soil conditions accompanying high water table conditions, while sedge aerenchyma may be promoting aerobic taxa in the upper peat rhizosphere regardless of ambient soil oxygen limitations.
Resumo:
Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)
Resumo:
Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)
Resumo:
With the advent of high through-put sequencing (HTS), the emerging science of metagenomics is transforming our understanding of the relationships of microbial communities with their environments. While metagenomics aims to catalogue the genes present in a sample through assessing which genes are actively expressed, metatranscriptomics can provide a mechanistic understanding of community inter-relationships. To achieve these goals, several challenges need to be addressed from sample preparation to sequence processing, statistical analysis and functional annotation. Here we use an inbred non-obese diabetic (NOD) mouse model in which germ-free animals were colonized with a defined mixture of eight commensal bacteria, to explore methods of RNA extraction and to develop a pipeline for the generation and analysis of metatranscriptomic data. Applying the Illumina HTS platform, we sequenced 12 NOD cecal samples prepared using multiple RNA-extraction protocols. The absence of a complete set of reference genomes necessitated a peptide-based search strategy. Up to 16% of sequence reads could be matched to a known bacterial gene. Phylogenetic analysis of the mapped ORFs revealed a distribution consistent with ribosomal RNA, the majority from Bacteroides or Clostridium species. To place these HTS data within a systems context, we mapped the relative abundance of corresponding Escherichia coli homologs onto metabolic and protein-protein interaction networks. These maps identified bacterial processes with components that were well-represented in the datasets. In summary this study highlights the potential of exploiting the economy of HTS platforms for metatranscriptomics.
Resumo:
Funding This work was supported by the HADEEP projects, funded by the Nippon Foundation, Japan (2009765188), the Natural Environmental Research Council, UK (NE/E007171/1) and the Total Foundation, France. We acknowledge additional support from the Marine Alliance for Science and Technology for Scotland (MASTS) funded by the Scottish Funding Council (Ref: HR09011) and contributing institutions. We also acknowledge support from the Leverhulme Trust to SBP. Additional sea time was supported by NIWA’s ‘Impact of Resource Use on Vulnerable Deep-Sea Communities’ project (CO1_0906)
Resumo:
Chromatin immunoprecipitation (ChIP) provides a means of enriching DNA associated with transcription factors, histone modifications, and indeed any other proteins for which suitably characterized antibodies are available. Over the years, sequence detection has progressed from quantitative real-time PCR and Southern blotting to microarrays (ChIP-chip) and now high-throughput sequencing (ChIP-seq). This progression has vastly increased the sequence coverage and data volumes generated. This in turn has enabled informaticians to predict the identity of multi-protein complexes on DNA based on the overrepresentation of sequence motifs in DNA enriched by ChIP with a single antibody against a single protein. In the course of the development of high-throughput sequencing, little has changed in the ChIP methodology until recently. In the last three years, a number of modifications have been made to the ChIP protocol with the goal of enhancing the sensitivity of the method and further reducing the levels of nonspecific background sequences in ChIPped samples. In this chapter, we provide a brief commentary on these methodological changes and describe a detailed ChIP-exo method able to generate narrower peaks and greater peak coverage from ChIPped material.
Resumo:
BACKGROUND: Ultra high throughput sequencing (UHTS) technologies find an important application in targeted resequencing of candidate genes or of genomic intervals from genetic association studies. Despite the extraordinary power of these new methods, they are still rarely used in routine analysis of human genomic variants, in part because of the absence of specific standard procedures. The aim of this work is to provide human molecular geneticists with a tool to evaluate the best UHTS methodology for efficiently detecting DNA changes, from common SNPs to rare mutations. METHODOLOGY/PRINCIPAL FINDINGS: We tested the three most widespread UHTS platforms (Roche/454 GS FLX Titanium, Illumina/Solexa Genome Analyzer II and Applied Biosystems/SOLiD System 3) on a well-studied region of the human genome containing many polymorphisms and a very rare heterozygous mutation located within an intronic repetitive DNA element. We identify the qualities and the limitations of each platform and describe some peculiarities of UHTS in resequencing projects. CONCLUSIONS/SIGNIFICANCE: When appropriate filtering and mapping procedures are applied UHTS technology can be safely and efficiently used as a tool for targeted human DNA variations detection. Unless particular and platform-dependent characteristics are needed for specific projects, the most relevant parameter to consider in mainstream human genome resequencing procedures is the cost per sequenced base-pair associated to each machine.
Resumo:
BACKGROUND: Solexa/Illumina short-read ultra-high throughput DNA sequencing technology produces millions of short tags (up to 36 bases) by parallel sequencing-by-synthesis of DNA colonies. The processing and statistical analysis of such high-throughput data poses new challenges; currently a fair proportion of the tags are routinely discarded due to an inability to match them to a reference sequence, thereby reducing the effective throughput of the technology. RESULTS: We propose a novel base calling algorithm using model-based clustering and probability theory to identify ambiguous bases and code them with IUPAC symbols. We also select optimal sub-tags using a score based on information content to remove uncertain bases towards the ends of the reads. CONCLUSION: We show that the method improves genome coverage and number of usable tags as compared with Solexa's data processing pipeline by an average of 15%. An R package is provided which allows fast and accurate base calling of Solexa's fluorescence intensity files and the production of informative diagnostic plots.
Resumo:
Determination of the precise composition and variation of microbiota in cystic fibrosis lungs is crucial since chronic inflammation due to microorganisms leads to lung damage and ultimately, death. However, this constitutes a major technical challenge. Culturing of microorganisms does not provide a complete representation of a microbiota, even when using culturomics (high-throughput culture). So far, only PCR-based metagenomics have been investigated. However, these methods are biased towards certain microbial groups, and suffer from uncertain quantification of the different microbial domains. We have explored whole genome sequencing (WGS) using the Illumina high-throughput technology applied directly to DNA extracted from sputa obtained from two cystic fibrosis patients. To detect all microorganism groups, we used four procedures for DNA extraction, each with a different lysis protocol. We avoided biases due to whole DNA amplification thanks to the high efficiency of current Illumina technology. Phylogenomic classification of the reads by three different methods produced similar results. Our results suggest that WGS provides, in a single analysis, a better qualitative and quantitative assessment of microbiota compositions than cultures and PCRs. WGS identified a high quantity of Haemophilus spp. (patient 1) or Staphylococcus spp. plus Streptococcus spp. (patient 2) together with low amounts of anaerobic (Veillonella, Prevotella, Fusobacterium) and aerobic bacteria (Gemella, Moraxella, Granulicatella). WGS suggested that fungal members represented very low proportions of the microbiota, which were detected by cultures and PCRs because of their selectivity. The future increase of reads' sizes and decrease in cost should ensure the usefulness of WGS for the characterisation of microbiota.
Resumo:
Colorectal cancer (CRC) is the third most common cancer and the fourth leading cause of cancer death worldwide. About 85% of the cases of CRC are known to have chromosomal instability, an allelic imbalance at several chromosomal loci, and chromosome amplification and translocation. The aim of this study is to determine the recurrent copy number variant (CNV) regions present in stage II of CRC through whole exome sequencing, a rapidly developing targeted next-generation sequencing (NGS) technology that provides an accurate alternative approach for accessing genomic variations. 42 normal-tumor paired samples were sequenced by Illumina Genome Analyzer. Data was analyzed with Varscan2 and segmentation was performed with R package R-GADA. Summary of the segments across all samples was performed and the result was overlapped with DEG data of the same samples from a previous study in the group1. Major and more recurrent segments of CNV were: gain of chromosome 7pq(13%), 13q(31%) and 20q(75%) and loss of 8p(25%), 17p(23%), and 18pq(27%). This results are coincident with the known literature of CNV in CRC or other cancers, but our methodology should be validated by array comparative genomic hybridisation (aCGH) profiling, which is currently the gold standard for genetic diagnosis of CNV.
Resumo:
Objective: In Southern European countries up to one-third of the patients with hereditary hemochromatosis (HH) do not present the common HFE risk genotype. In order to investigate the molecular basis of these cases we have designed a gene panel for rapid and simultaneous analysis of 6 HH-related genes (HFE, TFR2, HJV, HAMP, SLC40A1 and FTL) by next-generation sequencing (NGS). Materials and Methods: Eighty-eight iron overload Portuguese patients, negative for the common HFE mutations, were analysed. A TruSeq Custom Amplicon kit (TSCA, by Illumina) was designed in order to generate 97 amplicons covering exons, intron/exon junctions and UTRs of the mentioned genes with a cumulative target sequence of 12115bp. Amplicons were sequenced in the MiSeq instrument (IIlumina) using 250bp paired-end reads. Sequences were aligned against human genome reference hg19 using alignment and variant caller algorithms in the MiSeq reporter software. Novel variants were validated by Sanger sequencing and their pathogenic significance were assessed by in silico studies. Results: We found a total of 55 different genetic variants. These include novel pathogenic missense and splicing variants (in HFE and TFR2), a very rare variant in IRE of FTL, a variant that originates a novel translation initiation codon in the HAMP gene, among others. Conclusion: The merging of TSCA methodology and NGS technology appears to be an appropriate tool for simultaneous and fast analysis of HH-related genes in a large number of samples. However, establishing the clinical relevance of NGS-detected variants for HH development remains a hard-working task, requiring further functional studies.
Resumo:
The quality and the speed for genome sequencing has advanced at the same time that technology boundaries are stretched. This advancement has been divided so far in three generations. The first-generation methods enabled sequencing of clonal DNA populations. The second-generation massively increased throughput by parallelizing many reactions while the third-generation methods allow direct sequencing of single DNA molecules. The first techniques to sequence DNA were not developed until the mid-1970s, when two distinct sequencing methods were developed almost simultaneously, one by Alan Maxam and Walter Gilbert, and the other one by Frederick Sanger. The first one is a chemical method to cleave DNA at specific points and the second one uses ddNTPs, which synthesizes a copy from the DNA chain template. Nevertheless, both methods generate fragments of varying lengths that are further electrophoresed. Moreover, it is important to say that until the 1990s, the sequencing of DNA was relatively expensive and it was seen as a long process. Besides, using radiolabeled nucleotides also compounded the problem through safety concerns and prevented the automation. Some advancements within the first generation include the replacement of radioactive labels by fluorescent labeled ddNTPs and cycle sequencing with thermostable DNA polymerase, which allows automation and signal amplification, making the process cheaper, safer and faster. Another method is Pyrosequencing, which is based on the “sequencing by synthesis” principle. It differs from Sanger sequencing, in that it relies on the detection of pyrophosphate release on nucleotide incorporation. By the end of the last millennia, parallelization of this method started the Next Generation Sequencing (NGS) with 454 as the first of many methods that can process multiple samples, calling it the 2º generation sequencing. Here electrophoresis was completely eliminated. One of the methods that is sometimes used is SOLiD, based on sequencing by ligation of fluorescently dye-labeled di-base probes which competes to ligate to the sequencing primer. Specificity of the di-base probe is achieved by interrogating every 1st and 2nd base in each ligation reaction. The widely used Solexa/Illumina method uses modified dNTPs containing so called “reversible terminators” which blocks further polymerization. The terminator also contains a fluorescent label, which can be detected by a camera. Now, the previous step towards the third generation was in charge of Ion Torrent, who developed a technique that is based in a method of “sequencing-by-synthesis”. Its main feature is the detection of hydrogen ions that are released during base incorporation. Likewise, the third generation takes into account nanotechnology advancements for the processing of unique DNA molecules to a real time synthesis sequencing system like PacBio; and finally, the NANOPORE, projected since 1995, also uses Nano-sensors forming channels obtained from bacteria that conducts the sample to a sensor that allows the detection of each nucleotide residue in the DNA strand. The advancements in terms of technology that we have nowadays have been so quick, that it makes wonder: ¿How do we imagine the next generation?