17 resultados para Sequence Analysis
em Biblioteca Digital da Produção Intelectual da Universidade de São Paulo
Resumo:
Intron splicing is one of the most important steps involved in the maturation process of a pre-mRNA. Although the sequence profiles around the splice sites have been studied extensively, the levels of sequence identity between the exonic sequences preceding the donor sites and the intronic sequences preceding the acceptor sites has not been examined as thoroughly. In this study we investigated identity patterns between the last 15 nucleotides of the exonic sequence preceding the 5' splice site and the intronic sequence preceding the 3' splice site in a set of human protein-coding genes that do not exhibit intron retention. We found that almost 60% of consecutive exons and introns in human protein-coding genes share at least two identical nucleotides at their 3' ends and, on average, the sequence identity length is 2.47 nucleotides. Based on our findings we conclude that the 3' ends of exons and introns tend to have longer identical sequences within a gene than when being taken from different genes. Our results hold even if the pairs are non-consecutive in the transcription order. (C) 2012 Elsevier Ltd. All rights reserved.
Resumo:
Surveys were conducted in Brazil, Benin and Tanzania to collect predatory mites as candidates for control of the coconut mite Aceria guerreronis Keifer, a serious pest of coconut fruits. At all locations surveyed, one of the most dominant predators on infested coconut fruits was identified as Neoseiulus baraki Athias-Henriot, based on morphological similarity with regard to taxonomically relevant characters. However, scrutiny of our own and published descriptions suggests that consistent morphological differences may exist between the Benin population and those from the other geographic origins. In this study, we combined three methods to assess whether these populations belong to one species or a few distinct, yet closely related species. First, multivariate analysis of 32 morphological characters showed that the Benin population differed from the other three populations. Second, DNA sequence analysis based on the mitochondrial cytochrome oxidase subunit I (COI) showed the same difference between these populations. Third, cross-breeding between populations was unsuccessful in all combinations. These data provide evidence for the existence of cryptic species. Subsequent morphological research showed that the Benin population can be distinguished from the others by a new character (not included in the multivariate analysis), viz. the number of teeth on the fixed digit of the female chelicera.
Resumo:
Peptides derived from cytosolic, mitochondrial, and nuclear proteins have been detected in extracts of animal tissues and cell lines. To test whether the proteasome is involved in their formation, HEK293T cells were treated with epoxomicin (0.2 or 2 mu M) for 1 h and quantitative peptidomics analysis was performed. Altogether, 147 unique peptides were identified by mass spectrometry sequence analysis. Epoxomicin treatment decreased the levels of the majority of intracellular peptides, consistent with inhibition of the proteasome beta-2 and beta-5 subunits. Treatment with the higher concentration of epoxomicin elevated the levels of some peptides. Most of the elevated peptides resulted from cleavages at acidic residues, suggesting that epoxomicin increased the processing of proteins through the beta-1 subunit. Interestingly, some of the peptides that were elevated by the epoxomicin treatment had hydrophobic residues in P1 cleavage sites. Taken together, these findings suggest that, while the proteasome is the major source of intracellular peptides, other peptide-generating mechanisms exist. Because intracellular peptides are likely to perform intracellular functions, studies using proteasome inhibitors need to be interpreted with caution, as it is possible that the effects of these inhibitors are due to a change in the peptide levels rather than inhibition of protein degradation.
Resumo:
Abstract Background A large number of probabilistic models used in sequence analysis assign non-zero probability values to most input sequences. To decide when a given probability is sufficient the most common way is bayesian binary classification, where the probability of the model characterizing the sequence family of interest is compared to that of an alternative probability model. We can use as alternative model a null model. This is the scoring technique used by sequence analysis tools such as HMMER, SAM and INFERNAL. The most prevalent null models are position-independent residue distributions that include: the uniform distribution, genomic distribution, family-specific distribution and the target sequence distribution. This paper presents a study to evaluate the impact of the choice of a null model in the final result of classifications. In particular, we are interested in minimizing the number of false predictions in a classification. This is a crucial issue to reduce costs of biological validation. Results For all the tests, the target null model presented the lowest number of false positives, when using random sequences as a test. The study was performed in DNA sequences using GC content as the measure of content bias, but the results should be valid also for protein sequences. To broaden the application of the results, the study was performed using randomly generated sequences. Previous studies were performed on aminoacid sequences, using only one probabilistic model (HMM) and on a specific benchmark, and lack more general conclusions about the performance of null models. Finally, a benchmark test with P. falciparum confirmed these results. Conclusions Of the evaluated models the best suited for classification are the uniform model and the target model. However, the use of the uniform model presents a GC bias that can cause more false positives for candidate sequences with extreme compositional bias, a characteristic not described in previous studies. In these cases the target model is more dependable for biological validation due to its higher specificity.
Resumo:
Snake venom proteomes/peptidomes are highly complex and maintenance of their integrity within the gland lumen is crucial for the expression of toxin activities. There has been considerable progress in the field of venom proteomics, however, peptidomics does not progress as fast, because of the lack of comprehensive venom sequence databases for analysis of MS data. Therefore, in many cases venom peptides have to be sequenced manually by MS/MS analysis or Edman degradation. This is critical for rare snake species, as is the case of Bothrops cotiara (BC) and B. fonsecai (BF), which are regarded as near threatened with extinction. In this study we conducted a comprehensive analysis of the venom peptidomes of BC, BF, and B. jararaca (BJ) using a combination of solid-phase extraction and reversed-phase HPLC to fractionate the peptides, followed by nano-liquid chromatography-tandem MS (LC-MS/MS) or direct infusion electrospray ionization-(ESI)-MS/MS or MALDI-MS/MS analyses. We detected marked differences in the venom peptidomes and identified peptides ranging from 7 to 39 residues in length by de novo sequencing. Forty-four unique sequences were manually identified, out of which 30 are new peptides, including 17 bradykinin-potentiating peptides, three poly-histidine-poly-glycine peptides and interestingly, 10 L-amino acid oxidase fragments. Some of the new bradykinin-potentiating peptides display significant bradykinin potentiating activity. Automated database search revealed fragments from several toxins in the peptidomes, mainly from L-amino acid oxidase, and allowed the determination of the peptide bond specificity of proteinases and amino acid occurrences for the P4-P4' sites. We also demonstrate that the venom lyophilization/resolubilization process greatly increases the complexity of the peptidome because of the imbalance caused to the venom proteome and the consequent activity of proteinases on venom components. The use of proteinase inhibitors clearly showed different outcomes in the peptidome characterization and suggested that degradomic-peptidomic analysis of snake venoms is highly sensitive to the conditions of sampling procedures. Molecular & Cellular Proteomics 11: 10.1074/mcp.M112.019331, 1245-1262, 2012.
Resumo:
Oropouche fever is the second most frequent arboviral infection in Brazil, surpassed only by dengue. Oropouche virus (OROV) causes large and explosive outbreaks of acute febrile illness in cities and villages in the Amazon and Central-Plateau regions. Cerebrospinal fluid (CSF) samples from 110 meningoencephalitis patients were analyzed. The RNA extracted from fluid was submitted to reverse transcription-polymerase chain reaction and sequencing to identify OROV. Three CSF samples showed the presence of OROV causing infection in the central nervous system (CNS). These patients are adults. Two of the patients had other diseases affecting CNS and immune systems: neurocysticercosis and acquired immunodeficiency syndrome, respectively. Nucleotide sequence analysis showed that the OROV from the CSF of these patients belonged to genotype I. We show here that severe Oropouche disease is occurring during outbreaks of this virus in Brazil
Resumo:
Osteogenesis imperfecta (OI) is a Mendelian disease with genetic heterogeneity characterized by bone fragility, recurrent fractures, blue sclerae, and short stature, caused mostly by mutations in COL1A1 or COL1A2 genes, which encode the pro-alpha 1(I) and pro-alpha 2(I) chains of type I collagen, respectively. A Brazilian family that showed variable expression of autosomal dominant OI was identified and characterized. Scanning for mutations was carried out using SSCP and DNA sequence analysis. The missense mutation c.3235G>A was identified within exon 45 of the COL1A1 gene in a 16-year-old girl diagnosed as having OI type I; it resulted in substitution of a glycine residue (G) by a serine (S) at codon 1079 (p.G1079S). The proband's mother had the disease signs, but without bone fractures, as did five of nine uncles and aunts of the patient. All of them carried the mutation, which was excluded in four healthy brothers of the patient's mother. This is the first description in a Brazilian family with OI showing variable expression; only one among seven carriers for the c.3235G>A mutation developed bone fractures, the most striking clinical feature of this disease. This finding has a significant implication for prenatal diagnosis in OI disease.
Resumo:
The taxonomic position of a streptomycete isolated from soil collected from Cockle Park Experimental Farm, Northumberland, UK, was determined by using a polyphasic approach. The organism had chemical and morphological features consistent with its classification in the genus Streptomyces. 16S rRNA gene sequence analysis supported classification of the strain in the genus Streptomyces and showed that it formed a distinct phyletic line loosely associated with members of the Streptomyces yeochonensis Glade. It was related most closely to Streptomyces paucisporeus 1413(T) (98.6%16S rRNA gene sequence similarity), but could be distinguished from the latter based on the low level of DNA DNA relatedness (40%). It was readily distinguished from the type strains of all species assigned to the S. yeochonensis clade based on a combination of phenotypic properties. Strain BK168(T) (=KACC 20908(T)=NCIMB 14704(T)) should therefore be classified as the type strain of a novel species of the genus Streptomyces, for which the name Streptomyces cocklensis sp. nov. is proposed. The organism produces the antibiotic dioxamycin.
Resumo:
The structures and functional activities of metalloproteinases from snake venoms have been widely studied because of the importance of these molecules in envenomation. Batroxase, which is a metalloproteinase isolated from Bothrops atrox (Para) snake venom, was obtained by gel filtration and anion exchange chromatography. The enzyme is a single protein chain composed of 202 amino acid residues with a molecular mass of 22.9 kDa, as determined by mass spectrometry analysis, showing an isoelectric point of 7.5. The primary sequence analysis indicates that the proteinase contains a zinc ligand motif (HELGHNLGISH) and a sequence C164I165M166 motif that is associated with a "Met-turn" structure. The protein lacks N-glycosylation sites and contains seven half cystine residues, six of which are conserved as pairs to form disulfide bridges. The three-dimensional structure of Batroxase was modeled based on the crystal structure of BmooMP alpha-I from Bothrops moojeni. The model revealed that the zinc binding site has a high structural similarity to the binding site of other metalloproteinases. Batroxase presented weak hemorrhagic activity, with a MHD of 10 mu g, and was able to hydrolyze extracellular matrix components, such as type IV collagen and fibronectin. The toxin cleaves both a and beta-chains of the fibrinogen molecule, and it can be inhibited by EDTA. EGTA and beta-mercaptoethanol. Batroxase was able to dissolve fibrin clots independently of plasminogen activation. These results demonstrate that Batroxase is a zinc-dependent hemorrhagic metalloproteinase with fibrin(ogen)olytic and thrombolytic activity. Published by Elsevier Ltd.
Resumo:
Fumarate hydratases (FHs; EC 4.2.1.2) are enzymes that catalyze the reversible hydration of fumarate to S-malate. Parasitic protists that belong to the genus Leishmania and are responsible for a complex of vector-borne diseases named leishmaniases possess two genes that encode distinct putative FH enzymes. Genome sequence analysis of Leishmania major Friedlin reveals the existence of genes LmjF24.0320 and LmjF29.1960 encoding the putative enzymes LmFH-1 and LmFH-2, respectively. In the present work, the FH activity of both L. major enzymes has been confirmed. Circular dichroism studies suggest important differences in terms of secondary structure content when comparing LmFH isoforms and even larger differences when comparing them to the homologous human enzyme. CD melting experiments revealed that both LmFH isoforms are thermolabile enzymes. The catalytic efficiency under aerobic and anaerobic environments suggests that they are both highly sensitive to oxidation and damaged by oxygen. Intracellular localization studies located LmFH-1 in the mitochondrion, whereas LmFH-2 was found predominantly in the cytosol with possibly also some in glycosomes. The high degree of sequence conservation in different Leishmania species, together with the relevance of FH activity for the energy metabolism in these parasites suggest that FHs might be exploited as targets for broad-spectrum antileishmanial drugs. (c) 2012 Elsevier B.V. All rights reserved.
Resumo:
A 39-year-old woman with autosomal dominant polycystic kidney disease (ADPKD) presented with acromegaly and a pituitary macroadenoma. There was a family history of this renal disorder. She had undergone surgery for pituitary adenoma 6 years prior. Physical examination disclosed bitemporal hemianopsia and elevation of both basal growth hormone (GH) 106 ng/mL (normal 0-5) and insulin-like growth factor (IGF-1) 811 ng/mL (normal 48-255) blood levels. A magnetic resonance imaging scan disclosed a 3.0 cm sellar and suprasellar mass with both optic chiasm compression and left cavernous sinus invasion. Pathologic, cytogenetic, molecular and in silico analysis was undertaken. Histologic, immunohistochemical and ultrastructural studies of the lesion disclosed a sparsely granulated somatotroph adenoma. Standard chromosome analysis on the blood sample showed no abnormality. Sequence analysis of the coding regions of PKD1 and PKD2 employing DNA from both peripheral leukocytes and the tumor revealed the most common PKD1 mutation, 5014_5015delAG. Analysis of the entire SSTR5 gene disclosed the variant c.142C > A (p.L48M, rs4988483) in the heterozygous state in both blood and tumor, while no pathogenic mutations were noted in the MEN1, AIP, p27Kip1 and SSTR2 genes. To our knowledge, this is the fourth reported case of a GH-producing pituitary adenoma associated with ADPKD, but the first subjected to extensive morphological, ultrastructural, cytogenetic and molecular studies. The physical proximity of the PKD1 and SSTR5 genes on chromosome 16 suggests a causal relationship between ADPKD and somatotroph adenoma.
Resumo:
Madrepora is one of the most ecologically important genera of reef-building scleractinians in the deep sea, occurring from tropical to high-latitude regions. Despite this, the taxonomic affinities and relationships within the genus Madrepora remain unclear. To clarify these issues, we sequenced the mitochondrial (mt) genome of the most widespread Madrepora species, M. oculata, and compared this with data for other scleractinians. The architecture of the M. oculara mt genome was very similar to that of other scleractinians, except for a novel gene rearrangement affecting only cox2 and cox3. This pattern of gene organization was common to four geographically distinct M. oculata individuals as well as the congeneric species M. minutiseptum, but was not shared by other genera that are closely related on the basis of cox1 sequence analysis nor other oculinids, suggesting that it might be unique to Madrepora. (C) 2012 Elsevier Inc. All rights reserved.
Resumo:
The Xylella fastidiosa comparative genomic database is a scientific resource with the aim to provide a user-friendly interface for accessing high-quality manually curated genomic annotation and comparative sequence analysis, as well as for identifying and mapping prophage-like elements, a marked feature of Xylella genomes. Here we describe a database and tools for exploring the biology of this important plant pathogen. The hallmarks of this database are the high quality genomic annotation, the functional and comparative genomic analysis and the identification and mapping of prophage-like elements. It is available from web site http://www.xylella.lncc.br.
Resumo:
Abstract Background Sugarcane is an increasingly economically and environmentally important C4 grass, used for the production of sugar and bioethanol, a low-carbon emission fuel. Sugarcane originated from crosses of Saccharum species and is noted for its unique capacity to accumulate high amounts of sucrose in its stems. Environmental stresses limit enormously sugarcane productivity worldwide. To investigate transcriptome changes in response to environmental inputs that alter yield we used cDNA microarrays to profile expression of 1,545 genes in plants submitted to drought, phosphate starvation, herbivory and N2-fixing endophytic bacteria. We also investigated the response to phytohormones (abscisic acid and methyl jasmonate). The arrayed elements correspond mostly to genes involved in signal transduction, hormone biosynthesis, transcription factors, novel genes and genes corresponding to unknown proteins. Results Adopting an outliers searching method 179 genes with strikingly different expression levels were identified as differentially expressed in at least one of the treatments analysed. Self Organizing Maps were used to cluster the expression profiles of 695 genes that showed a highly correlated expression pattern among replicates. The expression data for 22 genes was evaluated for 36 experimental data points by quantitative RT-PCR indicating a validation rate of 80.5% using three biological experimental replicates. The SUCAST Database was created that provides public access to the data described in this work, linked to tissue expression profiling and the SUCAST gene category and sequence analysis. The SUCAST database also includes a categorization of the sugarcane kinome based on a phylogenetic grouping that included 182 undefined kinases. Conclusion An extensive study on the sugarcane transcriptome was performed. Sugarcane genes responsive to phytohormones and to challenges sugarcane commonly deals with in the field were identified. Additionally, the protein kinases were annotated based on a phylogenetic approach. The experimental design and statistical analysis applied proved robust to unravel genes associated with a diverse array of conditions attributing novel functions to previously unknown or undefined genes. The data consolidated in the SUCAST database resource can guide further studies and be useful for the development of improved sugarcane varieties.
Resumo:
A severely immune-suppressed AIDS patient was suspected of suffering from BK virus (BKV) meningoencephalitis, after being studied for common causes of neurological complications of co-infectious origin. Polymerase chain reaction (PCR) and sequence analysis of cerebrospinal fluid and brain samples, confirmed the presence of BKV. His clinical condition improved along with the regression of brain lesions, after modifications on his antiretroviral regime. Five months after discharge, the patient was readmitted because of frequent headaches, and a marked inflammatory reaction was evidenced by a new magnetic resonance imaging (MRI). The symptoms paralleled a rising CD4+ lymphocyte count, and immune reconstitution syndrome was suspected. This is the first non-postmortem report of BKV meningoencephalitis in an AIDS patient, showing clinical and radiographic improvement solely under HAART.