912 resultados para sequencing error
Resumo:
This work develops a computational approach for boundary and initial-value problems by using operational matrices, in order to run an evolutive process in a Hilbert space. Besides, upper bounds for errors in the solutions and in their derivatives can be estimated providing accuracy measures.
Resumo:
Background: Black pepper (Piper nigrum L.) is one of the most popular spices in the world. It is used in cooking and the preservation of food and even has medicinal properties. Losses in production from disease are a major limitation in the culture of this crop. The major diseases are root rot and foot rot, which are results of root infection by Fusarium solani and Phytophtora capsici, respectively. Understanding the molecular interaction between the pathogens and the host's root region is important for obtaining resistant cultivars by biotechnological breeding. Genetic and molecular data for this species, though, are limited. In this paper, RNA-Seq technology has been employed, for the first time, to describe the root transcriptome of black pepper. Results: The root transcriptome of black pepper was sequenced by the NGS SOLiD platform and assembled using the multiple-k method. Blast2Go and orthoMCL methods were used to annotate 10338 unigenes. The 4472 predicted proteins showed about 52% homology with the Arabidopsis proteome. Two root proteomes identified 615 proteins, which seem to define the plant's root pattern. Simple-sequence repeats were identified that may be useful in studies of genetic diversity and may have applications in biotechnology and ecology. Conclusions: This dataset of 10338 unigenes is crucially important for the biotechnological breeding of black pepper and the ecogenomics of the Magnoliids, a major group of basal angiosperms.
Resumo:
A survey of Microsporum gypseum was conducted in soil samples in different geographical regions of Brazil. The isolation of dermatophyte from soil samples was performed by hair baiting technique and the species were identified by morphology studies. We analyzed 692 soil samples and the recuperating rate was 19.2%. The activities of keratinase and elastase were quantitatively performed in 138 samples. The sequencing of the ITS region of rDNA was performed in representatives samples. M. gypseum isolates showed significant quantitative differences in the expression of both keratinase and elastase, but no significant correlation was observed between these enzymes. The sequencing of the representative samples revealed the presence of two teleomorphic species of M. gypseum (Arthroderma gypseum and A. incurvatum). The enzymatic activities may play an important role in the pathogenicity and a probable adaptation of this fungus to the animal parasitism. Using the phenotypical and molecular analysis, the Microsporum identification and their teleomorphic states will provide a useful and reliable identification system.
Resumo:
Estimates of evapotranspiration on a local scale is important information for agricultural and hydrological practices. However, equations to estimate potential evapotranspiration based only on temperature data, which are simple to use, are usually less trustworthy than the Food and Agriculture Organization (FAO)Penman-Monteith standard method. The present work describes two correction procedures for potential evapotranspiration estimates by temperature, making the results more reliable. Initially, the standard FAO-Penman-Monteith method was evaluated with a complete climatologic data set for the period between 2002 and 2006. Then temperature-based estimates by Camargo and Jensen-Haise methods have been adjusted by error autocorrelation evaluated in biweekly and monthly periods. In a second adjustment, simple linear regression was applied. The adjusted equations have been validated with climatic data available for the Year 2001. Both proposed methodologies showed good agreement with the standard method indicating that the methodology can be used for local potential evapotranspiration estimates.
Resumo:
The scope of this study was to estimate calibrated values for dietary data obtained by the Food Frequency Questionnaire for Adolescents (FFQA) and illustrate the effect of this approach on food consumption data. The adolescents were assessed on two occasions, with an average interval of twelve months. In 2004, 393 adolescents participated, and 289 were then reassessed in 2005. Dietary data obtained by the FFQA were calibrated using the regression coefficients estimated from the average of two 24-hour recalls (24HR) of the subsample. The calibrated values were similar to the the 24HR reference measurement in the subsample. In 2004 and 2005 a significant difference was observed between the average consumption levels of the FFQA before and after calibration for all nutrients. With the use of calibrated data the proportion of schoolchildren who had fiber intake below the recommended level increased. Therefore, it is seen that calibrated data can be used to obtain adjusted associations due to reclassification of subjects within the predetermined categories.
Resumo:
Since a genome is a discrete sequence, the elements of which belong to a set of four letters, the question as to whether or not there is an error-correcting code underlying DNA sequences is unavoidable. The most common approach to answering this question is to propose a methodology to verify the existence of such a code. However, none of the methodologies proposed so far, although quite clever, has achieved that goal. In a recent work, we showed that DNA sequences can be identified as codewords in a class of cyclic error-correcting codes known as Hamming codes. In this paper, we show that a complete intron-exon gene, and even a plasmid genome, can be identified as a Hamming code codeword as well. Although this does not constitute a definitive proof that there is an error-correcting code underlying DNA sequences, it is the first evidence in this direction.
Resumo:
Workplace accidents involving machines are relevant for their magnitude and their impacts on worker health. Despite consolidated critical statements, explanation centered on errors of operators remains predominant with industry professionals, hampering preventive measures and the improvement of production-system reliability. Several initiatives were adopted by enforcement agencies in partnership with universities to stimulate production and diffusion of analysis methodologies with a systemic approach. Starting from one accident case that occurred with a worker who operated a brake-clutch type mechanical press, the article explores cognitive aspects and the existence of traps in the operation of this machine. It deals with a large-sized press that, despite being endowed with a light curtain in areas of access to the pressing zone, did not meet legal requirements. The safety devices gave rise to an illusion of safety, permitting activation of the machine when a worker was still found within the operational zone. Preventive interventions must stimulate the tailoring of systems to the characteristics of workers, minimizing the creation of traps and encouraging safety policies and practices that replace judgments of behaviors that participate in accidents by analyses of reasons that lead workers to act in that manner.
Translocation capture sequencing: A method for high throughput mapping of chromosomal rearrangements
Resumo:
Chromosomal translocations require formation and joining of DNA double strand breaks (DSBs). These events disrupt the integrity of the genome and are involved in producing leukemias, lymphomas and sarcomas. Translocations are frequent, clonal and recurrent in mature B cell lymphomas, which bear a particularly high DNA damage burden by virtue of activation-induced cytidine deaminase (AID) expression. Despite the ubiquity of genomic rearrangements, the forces that underlie their genesis are not well understood. Here, we provide a detailed description of a new method for studying these events, translocation capture sequencing (TC-Seq). TC-Seq provides the means to document chromosomal rearrangements genome-wide in primary cells, and to discover recombination hotspots. Demonstrating its effectiveness, we successfully estimate the frequency of c-myc/IgH translocations in primary B cells, and identify hotspots of AID-mediated recombination. Furthermore. TC-Seq can be adapted to generate genome-wide rearrangement maps in any cell type and under any condition. (C) 2011 Elsevier B.V. All rights reserved.
Resumo:
Abstract Background An important challenge for transcript counting methods such as Serial Analysis of Gene Expression (SAGE), "Digital Northern" or Massively Parallel Signature Sequencing (MPSS), is to carry out statistical analyses that account for the within-class variability, i.e., variability due to the intrinsic biological differences among sampled individuals of the same class, and not only variability due to technical sampling error. Results We introduce a Bayesian model that accounts for the within-class variability by means of mixture distribution. We show that the previously available approaches of aggregation in pools ("pseudo-libraries") and the Beta-Binomial model, are particular cases of the mixture model. We illustrate our method with a brain tumor vs. normal comparison using SAGE data from public databases. We show examples of tags regarded as differentially expressed with high significance if the within-class variability is ignored, but clearly not so significant if one accounts for it. Conclusion Using available information about biological replicates, one can transform a list of candidate transcripts showing differential expression to a more reliable one. Our method is freely available, under GPL/GNU copyleft, through a user friendly web-based on-line tool or as R language scripts at supplemental web-site.
Resumo:
Abstract Background One goal of gene expression profiling is to identify signature genes that robustly distinguish different types or grades of tumors. Several tumor classifiers based on expression profiling have been proposed using microarray technique. Due to important differences in the probabilistic models of microarray and SAGE technologies, it is important to develop suitable techniques to select specific genes from SAGE measurements. Results A new framework to select specific genes that distinguish different biological states based on the analysis of SAGE data is proposed. The new framework applies the bolstered error for the identification of strong genes that separate the biological states in a feature space defined by the gene expression of a training set. Credibility intervals defined from a probabilistic model of SAGE measurements are used to identify the genes that distinguish the different states with more reliability among all gene groups selected by the strong genes method. A score taking into account the credibility and the bolstered error values in order to rank the groups of considered genes is proposed. Results obtained using SAGE data from gliomas are presented, thus corroborating the introduced methodology. Conclusion The model representing counting data, such as SAGE, provides additional statistical information that allows a more robust analysis. The additional statistical information provided by the probabilistic model is incorporated in the methodology described in the paper. The introduced method is suitable to identify signature genes that lead to a good separation of the biological states using SAGE and may be adapted for other counting methods such as Massive Parallel Signature Sequencing (MPSS) or the recent Sequencing-By-Synthesis (SBS) technique. Some of such genes identified by the proposed method may be useful to generate classifiers.
Resumo:
Abstract Background Leishmania (Leishmania) amazonensis infection in man results in a clinical spectrum of disease manifestations ranging from cutaneous to mucosal or visceral involvement. In the present study, we have investigated the genetic variability of 18 L. amazonensis strains isolated in northeastern Brazil from patients with different clinical manifestations of leishmaniasis. Parasite DNA was analyzed by sequencing of the ITS flanking the 5.8 S subunit of the ribosomal RNA genes, by RAPD and SSR-PCR and by PFGE followed by hybridization with gene-specific probes. Results ITS sequencing and PCR-based methods revealed genetic heterogeneity among the L. amazonensis isolates examined and molecular karyotyping also showed variation in the chromosome size of different isolates. Unrooted genetic trees separated strains into different groups. Conclusion These results indicate that L. amazonensis strains isolated from leishmaniasis patients from northeastern Brazil are genetically diverse, however, no correlation between genetic polymorphism and phenotype were found.
Resumo:
Abstract Background From shotgun libraries used for the genomic sequencing of the phytopathogenic bacterium Xanthomonas axonopodis pv. citri (XAC), clones that were representative of the largest possible number of coding sequences (CDSs) were selected to create a DNA microarray platform on glass slides (XACarray). The creation of the XACarray allowed for the establishment of a tool that is capable of providing data for the analysis of global genome expression in this organism. Findings The inserts from the selected clones were amplified by PCR with the universal oligonucleotide primers M13R and M13F. The obtained products were purified and fixed in duplicate on glass slides specific for use in DNA microarrays. The number of spots on the microarray totaled 6,144 and included 768 positive controls and 624 negative controls per slide. Validation of the platform was performed through hybridization of total DNA probes from XAC labeled with different fluorophores, Cy3 and Cy5. In this validation assay, 86% of all PCR products fixed on the glass slides were confirmed to present a hybridization signal greater than twice the standard deviation of the deviation of the global median signal-to-noise ration. Conclusions Our validation of the XACArray platform using DNA-DNA hybridization revealed that it can be used to evaluate the expression of 2,365 individual CDSs from all major functional categories, which corresponds to 52.7% of the annotated CDSs of the XAC genome. As a proof of concept, we used this platform in a previously work to verify the absence of genomic regions that could not be detected by sequencing in related strains of Xanthomonas.
Resumo:
Background Genotyping of hepatitis C virus (HCV) has become an essential tool for prognosis and prediction of treatment duration. The aim of this study was to compare two HCV genotyping methods: reverse hybridization line probe assay (LiPA v.1) and partial sequencing of the NS5B region. Methods Plasma of 171 patients with chronic hepatitis C were screened using both a commercial method (LiPA HCV Versant, Siemens, Tarrytown, NY, USA) and different primers targeting the NS5B region for PCR amplification and sequencing analysis. Results Comparison of the HCV genotyping methods showed no difference in the classification at the genotype level. However, a total of 82/171 samples (47.9%) including misclassification, non-subtypable, discrepant and inconclusive results were not classified by LiPA at the subtype level but could be discriminated by NS5B sequencing. Of these samples, 34 samples of genotype 1a and 6 samples of genotype 1b were classified at the subtype level using sequencing of NS5B. Conclusions Sequence analysis of NS5B for genotyping HCV provides precise genotype and subtype identification and an accurate epidemiological representation of circulating viral strains.
Resumo:
The performance of an anaerobic sequencing-batch biofilm reactor (ASBBR- laboratory scale- 14L )containing biomass immobilized on coal was evaluated for the removal of elevated concentrations of sulfate (between 200 and 3,000 mg SO4-2·L-1) from industrial wastewater effluents. The ASBBR was shown to be efficient for removal of organic material (between 90% and 45%) and sulfate (between 95% and 85%). The microbiota adhering to the support medium was analyzed by amplified ribosomal DNA restriction analysis (ARDRA). The ARDRA profiles for the Bacteria and Archaea domains proved to be sensitive for the determination of microbial diversity and were consistent with the physical-chemical monitoring analysis of the reactor. At 3,000 mg SO4-2·L-1, there was a reduction in the microbial diversity of both domains and also in the removal efficiencies of organic material and sulfate.
Resumo:
A survey of Microsporum gypseum was conducted in soil samples in different geographical regions of Brazil. The isolation of dermatophyte from soil samples was performed by hair baiting technique and the species were identified by morphology studies. We analyzed 692 soil samples and the recuperating rate was 19.2%. The activities of keratinase and elastase were quantitatively performed in 138 samples. The sequencing of the ITS region of rDNA was performed in representatives samples. M. gypseum isolates showed significant quantitative differences in the expression of both keratinase and elastase, but no significant correlation was observed between these enzymes. The sequencing of the representative samples revealed the presence of two teleomorphic species of M. gypseum (Arthroderma gypseum and A. incurvatum). The enzymatic activities may play an important role in the pathogenicity and a probable adaptation of this fungus to the animal parasitism. Using the phenotypical and molecular analysis, the Microsporum identification and their teleomorphic states will provide a useful and reliable identification system.