993 resultados para De novo peptide sequencing


Relevância:

30.00% 30.00%

Publicador:

Resumo:

Pós-graduação em Medicina Veterinária - FMVZ

Relevância:

30.00% 30.00%

Publicador:

Resumo:

High throughput sequencing (HTS) provides new research opportunities for work on non-model organisms, such as differential expression studies between populations exposed to different environmental conditions. However, such transcriptomic studies first require the production of a reference assembly. The choice of sampling procedure, sequencing strategy and assembly workflow is crucial. To develop a reliable reference transcriptome for Triatoma brasiliensis, the major Chagas disease vector in Northeastern Brazil, different de novo assembly protocols were generated using various datasets and software. Both 454 and Illumina sequencing technologies were applied on RNA extracted from antennae and mouthparts from single or pooled individuals. The 454 library yielded 278 Mb. Fifteen Illumina libraries were constructed and yielded nearly 360 million RNA-seq single reads and 46 million RNA-seq paired-end reads for nearly 45 Gb. For the 454 reads, we used three assemblers, Newbler, CAP3 and/or MIRA and for the Illumina reads, the Trinity assembler. Ten assembly workflows were compared using these programs separately or in combination. To compare the assemblies obtained, quantitative and qualitative criteria were used, including contig length, N50, contig number and the percentage of chimeric contigs. Completeness of the assemblies was estimated using the CEGMA pipeline. The best assembly (57,657 contigs, completeness of 80 %, < 1 % chimeric contigs) was a hybrid assembly leading to recommend the use of (1) a single individual with large representation of biological tissues, (2) merging both long reads and short paired-end Illumina reads, (3) several assemblers in order to combine the specific advantages of each.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

High Throughput Sequencing capabilities have made the process of assembling a transcriptome easier, whether or not there is a reference genome. But the quality of a transcriptome assembly must be good enough to capture the most comprehensive catalog of transcripts and their variations, and to carry out further experiments on transcriptomics. There is currently no consensus on which of the many sequencing technologies and assembly tools are the most effective. Many non-model organisms lack a reference genome to guide the transcriptome assembly. One question, therefore, is whether or not a reference-based genome assembly gives better results than de novo assembly. The blood-sucking insect Rhodnius prolixus-a vector for Chagas disease-has a reference genome. It is therefore a good model on which to compare reference-based and de novo transcriptome assemblies. In this study, we compared de novo and reference-based genome assembly strategies using three datasets (454, Illumina, 454 combined with Illumina) and various assembly software. We developed criteria to compare the resulting assemblies: the size distribution and number of transcripts, the proportion of potentially chimeric transcripts, how complete the assembly was (completeness evaluated both through CEGMA software and R. prolixus proteome fraction retrieved). Moreover, we looked for the presence of two chemosensory gene families (Odorant-Binding Proteins and Chemosensory Proteins) to validate the assembly quality. The reference-based assemblies after genome annotation were clearly better than those generated using de novo strategies alone. Reference-based strategies revealed new transcripts, including new isoforms unpredicted by automatic genome annotation. However, a combination of both de novo and reference-based strategies gave the best result, and allowed us to assemble fragmented transcripts.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Snake venom proteomes/peptidomes are highly complex and maintenance of their integrity within the gland lumen is crucial for the expression of toxin activities. There has been considerable progress in the field of venom proteomics, however, peptidomics does not progress as fast, because of the lack of comprehensive venom sequence databases for analysis of MS data. Therefore, in many cases venom peptides have to be sequenced manually by MS/MS analysis or Edman degradation. This is critical for rare snake species, as is the case of Bothrops cotiara (BC) and B. fonsecai (BF), which are regarded as near threatened with extinction. In this study we conducted a comprehensive analysis of the venom peptidomes of BC, BF, and B. jararaca (BJ) using a combination of solid-phase extraction and reversed-phase HPLC to fractionate the peptides, followed by nano-liquid chromatography-tandem MS (LC-MS/MS) or direct infusion electrospray ionization-(ESI)-MS/MS or MALDI-MS/MS analyses. We detected marked differences in the venom peptidomes and identified peptides ranging from 7 to 39 residues in length by de novo sequencing. Forty-four unique sequences were manually identified, out of which 30 are new peptides, including 17 bradykinin-potentiating peptides, three poly-histidine-poly-glycine peptides and interestingly, 10 L-amino acid oxidase fragments. Some of the new bradykinin-potentiating peptides display significant bradykinin potentiating activity. Automated database search revealed fragments from several toxins in the peptidomes, mainly from L-amino acid oxidase, and allowed the determination of the peptide bond specificity of proteinases and amino acid occurrences for the P4-P4' sites. We also demonstrate that the venom lyophilization/resolubilization process greatly increases the complexity of the peptidome because of the imbalance caused to the venom proteome and the consequent activity of proteinases on venom components. The use of proteinase inhibitors clearly showed different outcomes in the peptidome characterization and suggested that degradomic-peptidomic analysis of snake venoms is highly sensitive to the conditions of sampling procedures. Molecular & Cellular Proteomics 11: 10.1074/mcp.M112.019331, 1245-1262, 2012.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Background: Black pepper (Piper nigrum L.) is one of the most popular spices in the world. It is used in cooking and the preservation of food and even has medicinal properties. Losses in production from disease are a major limitation in the culture of this crop. The major diseases are root rot and foot rot, which are results of root infection by Fusarium solani and Phytophtora capsici, respectively. Understanding the molecular interaction between the pathogens and the host's root region is important for obtaining resistant cultivars by biotechnological breeding. Genetic and molecular data for this species, though, are limited. In this paper, RNA-Seq technology has been employed, for the first time, to describe the root transcriptome of black pepper. Results: The root transcriptome of black pepper was sequenced by the NGS SOLiD platform and assembled using the multiple-k method. Blast2Go and orthoMCL methods were used to annotate 10338 unigenes. The 4472 predicted proteins showed about 52% homology with the Arabidopsis proteome. Two root proteomes identified 615 proteins, which seem to define the plant's root pattern. Simple-sequence repeats were identified that may be useful in studies of genetic diversity and may have applications in biotechnology and ecology. Conclusions: This dataset of 10338 unigenes is crucially important for the biotechnological breeding of black pepper and the ecogenomics of the Magnoliids, a major group of basal angiosperms.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Sea anemones are known to contain a wide diversity of biologically active peptides, mostly unexplored according to recent peptidomic and transcriptomic studies. In the present work, the neurotoxic fractions from the exudates of Stichodactyla helianthus and Bunodosoma granulifera were analyzed by reversed-phase chromatography and mass spectrometry. The first peptide fingerprints of these sea anemones were assessed, revealing the largest number of peptide components (156) so far found in sea anemone species, as well as the richer peptide diversity of B. granulifera in relation to S. helianthus. The transcriptomic analysis of B. granulifera, performed by massive cDNA sequencing with 454 pyrosequencing approach allowed the discovery of five new APETx-like peptides (U-AITX-Bg1a-e - including the full sequences of their precursors for four of them), which together with type 1 sea anemone sodium channel toxins constitute a very distinguishable feature of studied sea anemone species belonging to genus Bunodosoma. The molecular modeling of these new APETx-like peptides showed a distribution of positively charged and aromatic residues in putative contact surfaces as observed in other animal toxins. On the other hand, they also showed variable electrostatic potentials, thus suggesting a docking onto their targeted channels in different spatial orientations. Moreover several crab paralyzing toxins (other than U-AITX-Bg1a-e), which induce a variety of symptoms in crabs, were isolated. Some of them presumably belong to new classes of crab-paralyzing peptide toxins, especially those with molecular masses below 2 kDa, which represent the smallest peptide toxins found in sea anemones. (C) 2011 Elsevier Inc. All rights reserved.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

We report a 26-year-old female patient who was diagnosed within 4 years with chest sarcoma, lung adenocarcinoma, and breast cancer. While her family history was unremarkable, DNA sequencing of TP53 revealed a germline de novo non-sense mutation in exon 6 p.Arg213X. One year later, she further developed a contralateral ductal carcinoma in situ, and 18 months later a jaw osteosarcoma. This case illustrates the therapeutic pitfalls in the care of a young cancer patient with TP53 de novo germline mutations and the complications related to her first-line therapy. Suggestion is made to use the less stringent Chompret criteria for germline TP53 mutation screening. Our observation underlines the possibly negative effect of radiotherapy in generating second tumors in patients with a TP53 mutation. We also present a review of six previously reported cases, comparing their cancer phenotypes with those generally produced by TP53 mutations.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

With the advent of high through-put sequencing (HTS), the emerging science of metagenomics is transforming our understanding of the relationships of microbial communities with their environments. While metagenomics aims to catalogue the genes present in a sample through assessing which genes are actively expressed, metatranscriptomics can provide a mechanistic understanding of community inter-relationships. To achieve these goals, several challenges need to be addressed from sample preparation to sequence processing, statistical analysis and functional annotation. Here we use an inbred non-obese diabetic (NOD) mouse model in which germ-free animals were colonized with a defined mixture of eight commensal bacteria, to explore methods of RNA extraction and to develop a pipeline for the generation and analysis of metatranscriptomic data. Applying the Illumina HTS platform, we sequenced 12 NOD cecal samples prepared using multiple RNA-extraction protocols. The absence of a complete set of reference genomes necessitated a peptide-based search strategy. Up to 16% of sequence reads could be matched to a known bacterial gene. Phylogenetic analysis of the mapped ORFs revealed a distribution consistent with ribosomal RNA, the majority from Bacteroides or Clostridium species. To place these HTS data within a systems context, we mapped the relative abundance of corresponding Escherichia coli homologs onto metabolic and protein-protein interaction networks. These maps identified bacterial processes with components that were well-represented in the datasets. In summary this study highlights the potential of exploiting the economy of HTS platforms for metatranscriptomics.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

CONTEXT: Thyroid transcription factor 1 (TITF1/NKX2.1) is expressed in the thyroid, lung, ventral forebrain, and pituitary. In the lung, TITF1/NKX2.1 activates the expression of genes critical for lung development and function. Titf/Nkx2.1(-/-) mice have pituitary and thyroid aplasia but also impairment of pulmonary branching. Humans with heterozygous TITF1/NKX2.1 mutations present with various combinations of primary hypothyroidism, respiratory distress, and neurological disorders. OBJECTIVE: The objective of the study was to report clinical and molecular studies of the first patient with lethal neonatal respiratory distress from a novel heterozygous TITF1/NKX2.1 mutation. Participant: This girl, the first child of healthy nonconsanguineous French-Canadian parents, was born at 41 wk. Birth weight was 3,460 g and Apgar scores were normal. Soon after birth, she developed acute respiratory failure with pulmonary hypertension. At neonatal screening on the second day of life, TSH was 31 mU/liter (N <15) and total T(4) 245 nmol/liter (N = 120-350). Despite mechanical ventilation, thyroxine, surfactant, and pulmonary vasodilators, the patient died on the 40th day. RESULTS: Histopathology revealed pulmonary tissue with low alveolar counts. The thyroid was normal. Sequencing of the patient's lymphocyte DNA revealed a novel heterozygous TITF1/NKX2.1 mutation (I207F). This mutation was not found in either parent. In vitro, the mutant TITF-1 had reduced DNA binding and transactivation capacity. CONCLUSION: This is the first reported case of a heterozygous TITF1/NKX2.1 mutation leading to neonatal death from respiratory failure. The association of severe unexplained respiratory distress in a term neonate with mild primary hypothyroidism is the clue that led to the diagnosis.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Yeast prions are a group of non-Mendelian genetic elements transmitted as altered and self-propagating conformations. Extensive studies in the last decade have provided valuable information on the mechanisms responsible for yeast prion propagation. How yeast prions are formed de novo and what cellular factors are required for determining prion "strains" or variants--a single polypeptide capable of existing in multiple conformations to result in distinct heritable phenotypes--continue to defy our understanding. We report here that Sse1, the yeast ortholog of the mammalian heat-shock protein 110 (Hsp110) and a nucleotide exchange factor for Hsp70 proteins, plays an important role in regulating [PSI+] de novo formation and variant determination. Overproduction of the Sse1 chaperone dramatically enhanced [PSI+] formation whereas deletion of SSE1 severely inhibited it. Only an unstable weak [PSI+] variant was formed in SSE1 disrupted cells whereas [PSI+] variants ranging from very strong to very weak were formed in isogenic wild-type cells under identical conditions. Thus, Sse1 is essential for the generation of multiple [PSI+] variants. Mutational analysis further demonstrated that the physical association of Sse1 with Hsp70 but not the ATP hydrolysis activity of Sse1 is required for the formation of multiple [PSI+] variants. Our findings establish a novel role for Sse1 in [PSI+] de novo formation and variant determination, implying that the mammalian Hsp110 may likewise be involved in the etiology of protein-folding diseases.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Oligonucleotides comprising unnatural building blocks, which interfere with the translation machinery, have gained increased attention for the treatment of gene-related diseases (e.g. antisense, RNAi). Due to structural modifications, synthetic oligonucleotides exhibit increased biostability and bioavailability upon administration. Consequently, classical enzyme-based sequencing methods are not applicable to their sequence elucidation and verification. Tandem mass spectrometry is the method of choice for performing such tasks, since gas-phase dissociation is not restricted to natural nucleic acids. However, tandem mass spectrometric analysis can generate product ion spectra of tremendous complexity, as the number of possible fragments grows rapidly with increasing sequence length. The fact that structural modifications affect the dissociation pathways greatly increases the variety of analytically valuable fragment ions. The gas-phase dissociation of oligonucleotides is characterized by the cleavage of one of the four bonds along the phosphodiester chain, by the accompanying loss of nucleases, and by the generation of internal fragments due to secondary backbone cleavage. For example, an 18-mer oligonucleotide yields a total number of 272’920 theoretical fragment ions. In contrast to the processing of peptide product ion spectra, which nowadays is highly automated, there is a lack of tools assisting the interpretation of oligonucleotide data. The existing web-based and stand-alone software applications are primarily designed for the sequence analysis of natural nucleic acids, but do not account for chemical modifications and adducts. Consequently, we developed a software to support the interpretation of mass spectrometric data of natural and modified nucleic acids and their adducts with chemotherapeutic agents.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Epileptic encephalopathies are a phenotypically and genetically heterogeneous group of severe epilepsies accompanied by intellectual disability and other neurodevelopmental features. Using next-generation sequencing, we identified four different de novo mutations in KCNA2, encoding the potassium channel KV1.2, in six isolated patients with epileptic encephalopathy (one mutation recurred three times independently). Four individuals presented with febrile and multiple afebrile, often focal seizure types, multifocal epileptiform discharges strongly activated by sleep, mild to moderate intellectual disability, delayed speech development and sometimes ataxia. Functional studies of the two mutations associated with this phenotype showed almost complete loss of function with a dominant-negative effect. Two further individuals presented with a different and more severe epileptic encephalopathy phenotype. They carried mutations inducing a drastic gain-of-function effect leading to permanently open channels. These results establish KCNA2 as a new gene involved in human neurodevelopmental disorders through two different mechanisms, predicting either hyperexcitability or electrical silencing of KV1.2-expressing neurons.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Background: Tef (Eragrostis tef), an indigenous cereal critical to food security in the Horn of Africa, is rich in minerals and protein, resistant to many biotic and abiotic stresses and safe for diabetics as well as sufferers of immune reactions to wheat gluten. We present the genome of tef, the first species in the grass subfamily Chloridoideae and the first allotetraploid assembled de novo. We sequenced the tef genome for marker-assisted breeding, to shed light on the molecular mechanisms conferring tef's desirable nutritional and agronomic properties, and to make its genome publicly available as a community resource. Results: The draft genome contains 672 Mbp representing 87% of the genome size estimated from flow cytometry. We also sequenced two transcriptomes, one from a normalized RNA library and another from unnormalized RNASeq data. The normalized RNA library revealed around 38000 transcripts that were then annotated by the SwissProt group. The CoGe comparative genomics platform was used to compare the tef genome to other genomes, notably sorghum. Scaffolds comprising approximately half of the genome size were ordered by syntenic alignment to sorghum producing tef pseudo-chromosomes, which were sorted into A and B genomes as well as compared to the genetic map of tef. The draft genome was used to identify novel SSR markers, investigate target genes for abiotic stress resistance studies, and understand the evolution of the prolamin family of proteins that are responsible for the immune response to gluten. Conclusions: It is highly plausible that breeding targets previously identified in other cereal crops will also be valuable breeding targets in tef. The draft genome and transcriptome will be of great use for identifying these targets for genetic improvement of this orphan crop that is vital for feeding 50 million people in the Horn of Africa.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Next-generation sequencing (NGS) technology has become a prominent tool in biological and biomedical research. However, NGS data analysis, such as de novo assembly, mapping and variants detection is far from maturity, and the high sequencing error-rate is one of the major problems. . To minimize the impact of sequencing errors, we developed a highly robust and efficient method, MTM, to correct the errors in NGS reads. We demonstrated the effectiveness of MTM on both single-cell data with highly non-uniform coverage and normal data with uniformly high coverage, reflecting that MTM’s performance does not rely on the coverage of the sequencing reads. MTM was also compared with Hammer and Quake, the best methods for correcting non-uniform and uniform data respectively. For non-uniform data, MTM outperformed both Hammer and Quake. For uniform data, MTM showed better performance than Quake and comparable results to Hammer. By making better error correction with MTM, the quality of downstream analysis, such as mapping and SNP detection, was improved. SNP calling is a major application of NGS technologies. However, the existence of sequencing errors complicates this process, especially for the low coverage (

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Nowadays, there is a great amount of genomic and transcriptomic data available about forest species, including ambitious projects looking for complete sequencing and annotation of different gymnosperm genomes [1, 2]. Pinus canariensis is an endemic conifer of the Canary Islands with re-sprouting capability and resilience against fire and mechanical damage, as result of an adaptation to volcanic environments. Additionally, this species has a high proportion of axial parenchyma compared with other conifers, and this tissue connects with radial parenchyma allowing transport of reserves. The most internal tracheids stop accumulating water [3], and get filled of resins and polyphenols synthesized by the axial parenchyma; this is the so-called ?torch-heartwood? [4], which avoids decay. This wood achieves very high prices due to its particular resistance to rot. These features make P. canariensis an interesting model species for the analysis of these developmental processes in conifers. In this study we aim to perform a complete transcriptome annotation during xylogenesis in Pinus canariensis, using next-generation sequencing (NGS) -Roche 454 pyrosequencing-, in order to provide a genomic resource for further analysis, including expression profiling and the identification of candidate genes for important adaptive features.