974 resultados para RNA sequence analysis
Resumo:
Background: High-throughput molecular approaches for gene expression profiling, such as Serial Analysis of Gene Expression (SAGE), Massively Parallel Signature Sequencing (MPSS) or Sequencing-by-Synthesis (SBS) represent powerful techniques that provide global transcription profiles of different cell types through sequencing of short fragments of transcripts, denominated sequence tags. These techniques have improved our understanding about the relationships between these expression profiles and cellular phenotypes. Despite this, more reliable datasets are still necessary. In this work, we present a web-based tool named S3T: Score System for Sequence Tags, to index sequenced tags in accordance with their reliability. This is made through a series of evaluations based on a defined rule set. S3T allows the identification/selection of tags, considered more reliable for further gene expression analysis. Results: This methodology was applied to a public SAGE dataset. In order to compare data before and after filtering, a hierarchical clustering analysis was performed in samples from the same type of tissue, in distinct biological conditions, using these two datasets. Our results provide evidences suggesting that it is possible to find more congruous clusters after using S3T scoring system. Conclusion: These results substantiate the proposed application to generate more reliable data. This is a significant contribution for determination of global gene expression profiles. The library analysis with S3T is freely available at http://gdm.fmrp.usp.br/s3t/.S3T source code and datasets can also be downloaded from the aforementioned website.
Resumo:
Liver samples from rabbits killed by RHDV, collected from five States in Australia in 1996 and 1997 were analysed by RT-PCR. A 398 bp fragment of the capsid protein (VP60) gene was amplified by PCR and directly sequenced. The alignment of the nucleotide and amino acid sequences and their comparison with the original strain of the virus released in Australia indicated genetic changes after two years have been small with 98.2% to 100% identity. The constructed phylogenetic tree suggests slight differences in nucleotide substitutions in various States but there is no clear evidence of clustering of sequences according to their geographic origin. In practical terms, sequencing of viral RNA provides a means of testing the efficacy of further releases and subsequent spread of the virus if such a strategy is employed as a means of enhancing RHD as a biological control of the wild rabbit in Australia.
Resumo:
Dissertação de mestrado em Plant Molecular Biology, Biotechnology and Bioentrepreneurship
Resumo:
We improved, evaluated, and used Sanger sequencing for quantification of single nucleotide polymorphism (SNP) variants in transcripts and gDNA samples. This improved assay resulted in highly reproducible relative allele frequencies (e.g., for a heterozygous gDNA 50.0+/-1.4%, and for a missense mutation-bearing transcript 46.9+/-3.7%) with a lower detection limit of 3-9%. It provided excellent accuracy and linear correlation between expected and observed relative allele frequencies. This sequencing assay, which can also be used for the quantification of copy number variations (CNVs), methylations, mosaicisms, and DNA pools, enabled us to analyze transcripts of the FBN1 gene in fibroblasts and blood samples of patients with suspected Marfan syndrome not only qualitatively but also quantitatively. We report a total of 18 novel and 19 known FBN1 sequence variants leading to a premature termination codon (PTC), 26 of which we analyzed by quantitative sequencing both at gDNA and cDNA levels. The relative amounts of PTC-containing FBN1 transcripts in fresh and PAXgene-stabilized blood samples were significantly higher (33.0+/-3.9% to 80.0+/-7.2%) than those detected in affected fibroblasts with inhibition of nonsense-mediated mRNA decay (NMD) (11.0+/-2.1% to 25.0+/-1.8%), whereas in fibroblasts without NMD inhibition no mutant alleles could be detected. These results provide evidence for incomplete NMD in leukocytes and have particular importance for RNA-based analyses not only in FBN1 but also in other genes.
Resumo:
Apple stem grooving virus (ASGV) is one of the most important viruses infecting fruit trees. This study aimed at the molecular characterization of ASGV infecting apple (Malus domestica) plants in Santa Catarina (SC). RNA extracted from plants infected with isolate UV01 was used as a template for RT-PCR using specific primers. An amplified DNA fragment of 755 bp was sequenced. The coat protein gene of ASGV isolate UV01 contains 714 nucleotides, coding for a protein of 237 amino acids with a predicted Mr of approximately 27 kDa. The nucleotide and the deduced amino acid sequences of the coat protein gene showed identities of 90.9% and 97.9%, respectively, with a Japanese isolate of ASGV. Very high amino acid homologies (98.7%) were also found with Citrus tatter leaf capillovirus (CTLV), a very close relative of ASGV. These results indicate low coat protein gene variability among Capillovirus isolates from distinct regions. In a restricted survey, mother stocks in orchards and plants introduced into the country for large scale fruit production were indexed and shown to be infected by ASGV (20%), usually in a complex with other (latent) apple viruses (80%).
Resumo:
Measles virus is a highly contagious agent which causes a major health problem in developing countries. The viral genomic RNA is single-stranded, nonsegmented and of negative polarity. Many live attenuated vaccines for measles virus have been developed using either the prototype Edmonston strain or other locally isolated measles strains. Despite the diverse geographic origins of the vaccine viruses and the different attenuation methods used, there was remarkable sequence similarity of H, F and N genes among all vaccine strains. CAM-70 is a Japanese measles attenuated vaccine strain widely used in Brazilian children and produced by Bio-Manguinhos since 1982. Previous studies have characterized this vaccine biologically and genomically. Nevertheless, only the F, H and N genes have been sequenced. In the present study we have sequenced the remaining P, M and L genes (approximately 1.6, 1.4 and 6.5 kb, respectively) to complete the genomic characterization of CAM-70 and to assess the extent of genetic relationship between CAM-70 and other current vaccines. These genes were amplified using long-range or standard RT-PCR techniques, and the cDNA was cloned and automatically sequenced using the dideoxy chain-termination method. The sequence analysis comparing previously sequenced genotype A strains with the CAM-70 Bio-Manguinhos strain showed a low divergence among them. However, the CAM-70 strains (CAM-70 Bio-Manguinhos and a recently sequenced CAM-70 submaster seed strain) were assigned to a specific group by phylogenetic analysis using the neighbor-joining method. Information about our product at the genomic level is important for monitoring vaccination campaigns and for future studies of measles virus attenuation.
Resumo:
The present study examined the distribution of hepatitis C virus (HCV) genotypes and subtypes in a hemodialysis population in Goiás State, Central Brazil, and evaluated the efficiency of two genotyping methods: line probe assay (LiPA) based on the 5' noncoding region and nucleotide sequencing of the nonstructural 5B (NS5B) region of the genome. A total of 1095 sera were tested for HCV RNA by RT-nested PCR of the 5' noncoding region. The LiPA assay was able to genotype all 131 HCV RNA-positive samples. Genotypes 1 (92.4%) and 3 (7.6%) were found. Subtype 1a (65.7%) was the most prevalent, followed by subtypes 1b (26.7%) and 3a (7.6%). Direct nucleotide sequencing of 340 bp from the NS5B region was performed in 106 samples. The phylogenetic tree showed that 98 sequences (92.4%) were classified as genotype 1, subtypes 1a (72.6%) and 1b (19.8%), and 8 sequences (7.6%) as subtype 3a. The two genotyping methods gave concordant results within HCV genotypes and subtypes in 100 and 96.2% of cases, respectively. Only four samples presented discrepant results, with LiPA not distinguishing subtypes 1a and 1b. Therefore, HCV genotype 1 (subtype 1a) is predominant in hemodialysis patients in Central Brazil. By using sequence analysis of the NS5B region as a reference standard method for HCV genotyping, we found that LiPA was efficient at the genotype level, although some discrepant results were observed at the subtype level (sensitivity of 96.1% for subtype 1a and 95.2% for subtype 1b). Thus, analysis of the NS5B region permitted better discrimination between HCV subtypes, as required in epidemiological investigations.
Resumo:
Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)
Resumo:
Cytochrome P450 enzymes (CYP450s) represent a superfamily of haem-thiolate proteins. CYP450s are most abundant in the liver, a major site of drug metabolism, and play key roles in the metabolism of a variety of substrates, including drugs and environmental contaminants. Interaction of two or more different drugs with the same enzyme can account for adverse effects and failure of therapy. Human CYP3A4 metabolizes about 50% of all known drugs, but little is known about the orthologous CYP450s in horses. We report here the genomic organization of the equine CYP3A gene cluster as well as a comparative analysis with the human CYP3A gene cluster. The equine CYP450 genes of the 3A family are located on ECA 13 between 6.97-7.53 Mb, in a region syntenic to HSA 7 99.05-99.35 Mb. Seven potential, closely linked equine CYP3A genes were found, in contrast to only four genes in the human genome. RNA was isolated from an equine liver sample, and the approximately 1.5-kb coding sequence of six CYP3A genes could be amplified by RT-PCR. Sequencing of the RT-PCR products revealed numerous hitherto unknown single nucleotide polymorphisms (SNPs) in these six CYP3A genes, and one 6-bp deletion compared to the reference sequence (EquCab2.0). The presence of the variants was confirmed in a sample of genomic DNA from the same horse. In conclusion, orthologous genes for the CYP3A family exist in horses, but their number differs from those of the human CYP3A gene family. CYP450 genes of the same family show high homology within and between mammalian species, but can be highly polymorphic.
Resumo:
We improved, evaluated, and used Sanger sequencing for quantification of single nucleotide polymorphism (SNP) variants in transcripts and gDNA samples. This improved assay resulted in highly reproducible relative allele frequencies (e.g., for a heterozygous gDNA 50.0+/-1.4%, and for a missense mutation-bearing transcript 46.9+/-3.7%) with a lower detection limit of 3-9%. It provided excellent accuracy and linear correlation between expected and observed relative allele frequencies. This sequencing assay, which can also be used for the quantification of copy number variations (CNVs), methylations, mosaicisms, and DNA pools, enabled us to analyze transcripts of the FBN1 gene in fibroblasts and blood samples of patients with suspected Marfan syndrome not only qualitatively but also quantitatively. We report a total of 18 novel and 19 known FBN1 sequence variants leading to a premature termination codon (PTC), 26 of which we analyzed by quantitative sequencing both at gDNA and cDNA levels. The relative amounts of PTC-containing FBN1 transcripts in fresh and PAXgene-stabilized blood samples were significantly higher (33.0+/-3.9% to 80.0+/-7.2%) than those detected in affected fibroblasts with inhibition of nonsense-mediated mRNA decay (NMD) (11.0+/-2.1% to 25.0+/-1.8%), whereas in fibroblasts without NMD inhibition no mutant alleles could be detected. These results provide evidence for incomplete NMD in leukocytes and have particular importance for RNA-based analyses not only in FBN1 but also in other genes.
Resumo:
We have developed a novel way to assess the mutagenicity of environmentally important metal carcinogens, such as nickel, by creating a positive selection system based upon the conditional expression of a retroviral transforming gene. The target gene is the v-mos gene in MuSVts110, a murine retrovirus possessing a growth temperature dependent defect in expression of the transforming gene due to viral RNA splicing. In normal rat kidney cells infected with MuSVts110 (6m2 cells), splicing of the MuSVts110 RNA to form the mRNA from which the transforming protein, p85$\sp{\rm gag-mos}$, is translated is growth-temperature dependent, occurring at 33 C and below but not at 39 C and above. This splicing "defect" is mediated by cis-acting viral sequences. Nickel chloride treatment of 6m2 cells followed by growth at 39 C, allowed the selection of "revertant" cells which constitutively express p85$\sp{\rm gag-mos}$ due to stable changes in the viral RNA splicing phenotype, suggesting that nickel, a carcinogen whose mutagenicity has not been well established, could induce mutations in mammalian genes. We also show by direct sequencing of PCR-amplified integrated MuSVts110 DNA from a 6m2 nickel-revertant cell line that the nickel-induced mutation affecting the splicing phenotype is a cis-acting 70-base duplication of a region of the viral DNA surrounding the 3$\sp\prime$ splice site. These findings provide the first example of the molecular basis for a nickel-induced DNA lesion and establish the mutagenicity of this potent carcinogen. ^
Resumo:
The cell matrix adhesion regulator (CMAR) gene has been suggested to be a signal transduction molecule influencing cell adhesion to collagen and, through this, possibly involved in tumor suppression. The originally reported CMAR cDNA was 464 bp long with a tyrosine phosphorylation site at the extreme 3′ end, which mutagenesis studies had shown to be central to the function of this gene. Since the discovery of a 4-bp insertion polymorphism within the originally reported coding region, further sequence information has been obtained. The cDNA has been extended 5′ by ≈2 kb revealing a 559-bp region showing strong homology to the proposed 5′ untranslated sequence of a murine protein kinase receptor family member, variant in kinase (vik). CMAR genomic sequencing has shown the presence of an intron, the intron/exon boundary lying within this region of homology. An RNA transcript for CMAR of ≈2.5 kb has also been identified. The data suggest complex mechanisms for control of expression of two closely associated genes, CMAR and the vik- associated sequence.
Resumo:
The human prion gene contains five copies of a 24 nt repeat that is highly conserved among species. An analysis of folding free energies of the human prion mRNA, in particular in the repeat region, suggested biased codon selection and the presence of RNA patterns. In particular, pseudoknots, similar to the one predicted by Wills in the human prion mRNA, were identified in the repeat region of all available prion mRNAs available in GenBank, but not those of birds and the red slider turtle. An alignment of these mRNAs, which share low sequence homology, shows several co-variations that maintain the pseudoknot pattern. The presence of pseudoknots in yeast Sup35p and Rnq1 suggests acquisition in the prokaryotic era. Computer generated three-dimensional structures of the human prion pseudoknot highlight protein and RNA interaction domains, which suggest a possible effect in prion protein translation. The role of pseudoknots in prion diseases is discussed as individuals with extra copies of the 24 nt repeat develop the familial form of Creutzfeldt–Jakob disease.