9 resultados para Protein Sequence Analysis
em Biblioteca Digital da Produção Intelectual da Universidade de São Paulo
Resumo:
Cowpea aphid-borne mosaic virus (CABMV) causes major diseases in cowpea and passion flower plants in Brazil and also in other countries. CABMV has also been isolated from leguminous species including, Cassia hoffmannseggii, Canavalia rosea, Crotalaria juncea and Arachis hypogaea in Brazil. The virus seems to be adapted to two distinct families, the Passifloraceae and Fabaceae. Aiming to identify CABMV and elucidate a possible host adaptation of this virus species, isolates from cowpea, passion flower and C.hoffmannseggii collected in the states of Pernambuco and Rio Grande do Norte were analysed by sequencing the complete coat protein genes. A phylogenetic tree was constructed based on the obtained sequences and those available in public databases. Major Brazilian isolates from passion flower, independently of the geographical distances among them, were grouped in three different clusters. The possible host adaptation was also observed in fabaceous-infecting CABMV Brazilian isolates. These host adaptations possibly occurred independently within Brazil, so all these clusters belong to a bigger Brazilian cluster. Nevertheless, African passion flower or cowpea-infecting isolates formed totally different clusters. These results showed that host adaptation could be one factor for CABMV evolution, although geographical isolation is a stronger factor.
Resumo:
Intron splicing is one of the most important steps involved in the maturation process of a pre-mRNA. Although the sequence profiles around the splice sites have been studied extensively, the levels of sequence identity between the exonic sequences preceding the donor sites and the intronic sequences preceding the acceptor sites has not been examined as thoroughly. In this study we investigated identity patterns between the last 15 nucleotides of the exonic sequence preceding the 5' splice site and the intronic sequence preceding the 3' splice site in a set of human protein-coding genes that do not exhibit intron retention. We found that almost 60% of consecutive exons and introns in human protein-coding genes share at least two identical nucleotides at their 3' ends and, on average, the sequence identity length is 2.47 nucleotides. Based on our findings we conclude that the 3' ends of exons and introns tend to have longer identical sequences within a gene than when being taken from different genes. Our results hold even if the pairs are non-consecutive in the transcription order. (C) 2012 Elsevier Ltd. All rights reserved.
Resumo:
Bananas (Musa spp.) are highly perishable fruit of notable economic and nutritional relevance. Because the identification of proteins involved in metabolic pathways could help to extend green-life and improve the quality of the fruit, this study aimed to compare the proteins of banana pulp at the pre-climacteric and climacteric stages. The use of two-dimensional fluorescence difference gel electrophoresis (2D-DIGE) revealed 50 differentially expressed proteins, and comparing those proteins to the Mass Spectrometry Protein Sequence Database (MSDB) identified 26 known proteins. Chitinases were the most abundant types of proteins in unripe bananas, and two isoforms in the ripe fruit have been implicated in the stress/defense response. In this regard, three heat shock proteins and isoflavone reductase were also abundant at the climacteric stage. Concerning fruit quality, pectate lyase, malate dehydrogenase, and starch phosphorylase accumulated during ripening. In addition to the ethylene formation enzyme amino cyclo carboxylic acid oxidase, the accumulation of S-adenosyl-L-homocysteine hydrolase was needed because of the increased ethylene synthesis and DNA methylation that occurred in ripening bananas. Differential analysis provided information on the ripening-associated changes that occurred in proteins involved in banana flavor, texture, defense, synthesis of ethylene, regulation of expression, and protein folding, and this analysis validated previous data on the transcripts during ripening. In this regard, the differential proteomics of fruit pulp enlarged our understanding of the process of banana ripening. (C) 2012 Elsevier B.V. All rights reserved.
Resumo:
Peptides derived from cytosolic, mitochondrial, and nuclear proteins have been detected in extracts of animal tissues and cell lines. To test whether the proteasome is involved in their formation, HEK293T cells were treated with epoxomicin (0.2 or 2 mu M) for 1 h and quantitative peptidomics analysis was performed. Altogether, 147 unique peptides were identified by mass spectrometry sequence analysis. Epoxomicin treatment decreased the levels of the majority of intracellular peptides, consistent with inhibition of the proteasome beta-2 and beta-5 subunits. Treatment with the higher concentration of epoxomicin elevated the levels of some peptides. Most of the elevated peptides resulted from cleavages at acidic residues, suggesting that epoxomicin increased the processing of proteins through the beta-1 subunit. Interestingly, some of the peptides that were elevated by the epoxomicin treatment had hydrophobic residues in P1 cleavage sites. Taken together, these findings suggest that, while the proteasome is the major source of intracellular peptides, other peptide-generating mechanisms exist. Because intracellular peptides are likely to perform intracellular functions, studies using proteasome inhibitors need to be interpreted with caution, as it is possible that the effects of these inhibitors are due to a change in the peptide levels rather than inhibition of protein degradation.
Resumo:
Abstract Background A large number of probabilistic models used in sequence analysis assign non-zero probability values to most input sequences. To decide when a given probability is sufficient the most common way is bayesian binary classification, where the probability of the model characterizing the sequence family of interest is compared to that of an alternative probability model. We can use as alternative model a null model. This is the scoring technique used by sequence analysis tools such as HMMER, SAM and INFERNAL. The most prevalent null models are position-independent residue distributions that include: the uniform distribution, genomic distribution, family-specific distribution and the target sequence distribution. This paper presents a study to evaluate the impact of the choice of a null model in the final result of classifications. In particular, we are interested in minimizing the number of false predictions in a classification. This is a crucial issue to reduce costs of biological validation. Results For all the tests, the target null model presented the lowest number of false positives, when using random sequences as a test. The study was performed in DNA sequences using GC content as the measure of content bias, but the results should be valid also for protein sequences. To broaden the application of the results, the study was performed using randomly generated sequences. Previous studies were performed on aminoacid sequences, using only one probabilistic model (HMM) and on a specific benchmark, and lack more general conclusions about the performance of null models. Finally, a benchmark test with P. falciparum confirmed these results. Conclusions Of the evaluated models the best suited for classification are the uniform model and the target model. However, the use of the uniform model presents a GC bias that can cause more false positives for candidate sequences with extreme compositional bias, a characteristic not described in previous studies. In these cases the target model is more dependable for biological validation due to its higher specificity.
Resumo:
Background: Proteinaceous toxins are observed across all levels of inter-organismal and intra-genomic conflicts. These include recently discovered prokaryotic polymorphic toxin systems implicated in intra-specific conflicts. They are characterized by a remarkable diversity of C-terminal toxin domains generated by recombination with standalone toxin-coding cassettes. Prior analysis revealed a striking diversity of nuclease and deaminase domains among the toxin modules. We systematically investigated polymorphic toxin systems using comparative genomics, sequence and structure analysis. Results: Polymorphic toxin systems are distributed across all major bacterial lineages and are delivered by at least eight distinct secretory systems. In addition to type-II, these include type-V, VI, VII (ESX), and the poorly characterized "Photorhabdus virulence cassettes (PVC)", PrsW-dependent and MuF phage-capsid-like systems. We present evidence that trafficking of these toxins is often accompanied by autoproteolytic processing catalyzed by HINT, ZU5, PrsW, caspase-like, papain-like, and a novel metallopeptidase associated with the PVC system. We identified over 150 distinct toxin domains in these systems. These span an extraordinary catalytic spectrum to include 23 distinct clades of peptidases, numerous previously unrecognized versions of nucleases and deaminases, ADP-ribosyltransferases, ADP ribosyl cyclases, RelA/SpoT-like nucleotidyltransferases, glycosyltranferases and other enzymes predicted to modify lipids and carbohydrates, and a pore-forming toxin domain. Several of these toxin domains are shared with host-directed effectors of pathogenic bacteria. Over 90 families of immunity proteins might neutralize anywhere between a single to at least 27 distinct types of toxin domains. In some organisms multiple tandem immunity genes or immunity protein domains are organized into polyimmunity loci or polyimmunity proteins. Gene-neighborhood-analysis of polymorphic toxin systems predicts the presence of novel trafficking-related components, and also the organizational logic that allows toxin diversification through recombination. Domain architecture and protein-length analysis revealed that these toxins might be deployed as secreted factors, through directed injection, or via inter-cellular contact facilitated by filamentous structures formed by RHS/YD, filamentous hemagglutinin and other repeats. Phyletic pattern and life-style analysis indicate that polymorphic toxins and polyimmunity loci participate in cooperative behavior and facultative 'cheating' in several ecosystems such as the human oral cavity and soil. Multiple domains from these systems have also been repeatedly transferred to eukaryotes and their viruses, such as the nucleo-cytoplasmic large DNA viruses. Conclusions: Along with a comprehensive inventory of toxins and immunity proteins, we present several testable predictions regarding active sites and catalytic mechanisms of toxins, their processing and trafficking and their role in intra-specific and inter-specific interactions between bacteria. These systems provide insights regarding the emergence of key systems at different points in eukaryotic evolution, such as ADP ribosylation, interaction of myosin VI with cargo proteins, mediation of apoptosis, hyphal heteroincompatibility, hedgehog signaling, arthropod toxins, cell-cell interaction molecules like teneurins and different signaling messengers.
Resumo:
The structures and functional activities of metalloproteinases from snake venoms have been widely studied because of the importance of these molecules in envenomation. Batroxase, which is a metalloproteinase isolated from Bothrops atrox (Para) snake venom, was obtained by gel filtration and anion exchange chromatography. The enzyme is a single protein chain composed of 202 amino acid residues with a molecular mass of 22.9 kDa, as determined by mass spectrometry analysis, showing an isoelectric point of 7.5. The primary sequence analysis indicates that the proteinase contains a zinc ligand motif (HELGHNLGISH) and a sequence C164I165M166 motif that is associated with a "Met-turn" structure. The protein lacks N-glycosylation sites and contains seven half cystine residues, six of which are conserved as pairs to form disulfide bridges. The three-dimensional structure of Batroxase was modeled based on the crystal structure of BmooMP alpha-I from Bothrops moojeni. The model revealed that the zinc binding site has a high structural similarity to the binding site of other metalloproteinases. Batroxase presented weak hemorrhagic activity, with a MHD of 10 mu g, and was able to hydrolyze extracellular matrix components, such as type IV collagen and fibronectin. The toxin cleaves both a and beta-chains of the fibrinogen molecule, and it can be inhibited by EDTA. EGTA and beta-mercaptoethanol. Batroxase was able to dissolve fibrin clots independently of plasminogen activation. These results demonstrate that Batroxase is a zinc-dependent hemorrhagic metalloproteinase with fibrin(ogen)olytic and thrombolytic activity. Published by Elsevier Ltd.
Resumo:
Surveys were conducted in Brazil, Benin and Tanzania to collect predatory mites as candidates for control of the coconut mite Aceria guerreronis Keifer, a serious pest of coconut fruits. At all locations surveyed, one of the most dominant predators on infested coconut fruits was identified as Neoseiulus baraki Athias-Henriot, based on morphological similarity with regard to taxonomically relevant characters. However, scrutiny of our own and published descriptions suggests that consistent morphological differences may exist between the Benin population and those from the other geographic origins. In this study, we combined three methods to assess whether these populations belong to one species or a few distinct, yet closely related species. First, multivariate analysis of 32 morphological characters showed that the Benin population differed from the other three populations. Second, DNA sequence analysis based on the mitochondrial cytochrome oxidase subunit I (COI) showed the same difference between these populations. Third, cross-breeding between populations was unsuccessful in all combinations. These data provide evidence for the existence of cryptic species. Subsequent morphological research showed that the Benin population can be distinguished from the others by a new character (not included in the multivariate analysis), viz. the number of teeth on the fixed digit of the female chelicera.
Resumo:
Abstract Background Sugarcane is an increasingly economically and environmentally important C4 grass, used for the production of sugar and bioethanol, a low-carbon emission fuel. Sugarcane originated from crosses of Saccharum species and is noted for its unique capacity to accumulate high amounts of sucrose in its stems. Environmental stresses limit enormously sugarcane productivity worldwide. To investigate transcriptome changes in response to environmental inputs that alter yield we used cDNA microarrays to profile expression of 1,545 genes in plants submitted to drought, phosphate starvation, herbivory and N2-fixing endophytic bacteria. We also investigated the response to phytohormones (abscisic acid and methyl jasmonate). The arrayed elements correspond mostly to genes involved in signal transduction, hormone biosynthesis, transcription factors, novel genes and genes corresponding to unknown proteins. Results Adopting an outliers searching method 179 genes with strikingly different expression levels were identified as differentially expressed in at least one of the treatments analysed. Self Organizing Maps were used to cluster the expression profiles of 695 genes that showed a highly correlated expression pattern among replicates. The expression data for 22 genes was evaluated for 36 experimental data points by quantitative RT-PCR indicating a validation rate of 80.5% using three biological experimental replicates. The SUCAST Database was created that provides public access to the data described in this work, linked to tissue expression profiling and the SUCAST gene category and sequence analysis. The SUCAST database also includes a categorization of the sugarcane kinome based on a phylogenetic grouping that included 182 undefined kinases. Conclusion An extensive study on the sugarcane transcriptome was performed. Sugarcane genes responsive to phytohormones and to challenges sugarcane commonly deals with in the field were identified. Additionally, the protein kinases were annotated based on a phylogenetic approach. The experimental design and statistical analysis applied proved robust to unravel genes associated with a diverse array of conditions attributing novel functions to previously unknown or undefined genes. The data consolidated in the SUCAST database resource can guide further studies and be useful for the development of improved sugarcane varieties.