987 resultados para sequence identity
Resumo:
DBMODELING is a relational database of annotated comparative protein structure models and their metabolic, pathway characterization. It is focused on enzymes identified in the genomes of Mycobacterium tuberculosis and Xylella fastidiosa. The main goal of the present database is to provide structural models to be used in docking simulations and drug design. However, since the accuracy of structural models is highly dependent on sequence identity between template and target, it is necessary to make clear to the user that only models which show high structural quality should be used in such efforts. Molecular modeling of these genomes generated a database, in which all structural models were built using alignments presenting more than 30% of sequence identity, generating models with medium and high accuracy. All models in the database are publicly accessible at http://www.biocristalografia.df.ibilce.unesp.br/tools. DBMODELING user interface provides users friendly menus, so that all information can be printed in one stop from any web browser. Furthermore, DBMODELING also provides a docking interface, which allows the user to carry out geometric docking simulation, against the molecular models available in the database. There are three other important homology model databases: MODBASE, SWISSMODEL, and GTOP. The main applications of these databases are described in the present article. © 2007 Bentham Science Publishers Ltd.
Resumo:
Lettuce big vein associated virus (LBVaV) and Mirafiori lettuce big vein virus (MLBVV) have been found in mixed infection in Brazil causing the lettuce big vein disease. Analysis of part of the coat protein (CP) gene of Brazilian isolates of LBVaV collected from lettuce, showed at least 93% amino acid sequence identity with other LBVaV isolates. Genetic diversity among MLBVV CP sequences was higher when compared to LBVaV CP sequences, with amino acid sequence identity ranging between 91% to 100%. Brazilian isolates of MLBVV belong to subgroup A, with one RsaI restriction site on the coat protein gene. There is no indication for a possible geografical origin for the Brazilian isolates of LBVaV and MLBVV.
Resumo:
We have determined the structure of the fatty acid-binding protein 6 (fabp6) gene and the tissue-specific distribution of its transcripts in embryos, larvae and adult zebrafish (Danio rerio). Like most members of the vertebrate FABP multigene family, the zebrafish fabp6 gene contains four exons separated by three introns. The coding region of the gene and expressed sequence tags code for a polypeptide of 131 amino acids (14 kDa, pI 6.59). The putative zebrafish Fabp6 protein shared greatest sequence identity with human FABP6 (55.3%) compared to other orthologous mammalian FABPs and paralogous zebrafish Fabps. Phylogenetic analysis showed that the zebrafish Fabp6 formed a distinct clade with the mammalian FABP6s. The zebrafish fabp6 gene was assigned to linkage group (chromosome) 21 by radiation hybrid mapping. Conserved gene synteny was evident between the zebrafish fabp6 gene on chromosome 21 and the FABP6/Fabp6 genes on human chromosome 5, rat chromosome 10 and mouse chromosome 11. Zebrafish fabp6 transcripts were first detected in the distal region of the intestine of embryos at 72 h postfertilization. This spatial distribution remained constant to 7-day-old larvae, the last stage assayed during larval development. In adult zebrafish, fabp6 transcripts were detected by RT-PCR in RNA extracted from liver, heart, intestine, ovary and kidney (most likely adrenal tissue), but not in RNA from skin, brain, gill, eye or muscle. In situ hybridization of a fabp6 riboprobe to adult zebrafish sections revealed intense hybridization signals in the adrenal homolog of the kidney and the distal region of the intestine, and to a lesser extent in ovary and liver, a transcript distribution that is similar, but not identical, to that seen for the mammalian FABP6/Fabp6 gene. © 2008 The Authors.
Resumo:
Papillomaviruses (PVs) infect a wide range of animal species and show great genetic diversity. To date, excluding equine sarcoids, only three species of PVs were identified associated with lesions in horses: Equus caballus papillomavirus 1 (EcPV1-cutaneous), EcPV2 (genital) and EcPV3 (aural plaques). In this study, we identified a novel equine PV from aural plaques, which we designated EcPV4. Cutaneous samples from horses with lesions that were microscopically diagnosed as aural plaques were subjected to DNA extraction, amplification and sequencing. Rolling circle amplification and inverse PCR with specific primers confirmed the presence of an approximately 8. kb circular genome. The full-length EcPV4 L1 major capsid protein sequence has 1488 nucleotides (495 amino acids). EcPV4 had a sequence identity of only 53.3%, 60.2% and 51.7% when compared with the published sequences for EcPV1, EcPV2 and EcPV3, respectively. A Bayesian phylogenetic analysis indicated that EcPV4 clusters with EcPV2, but not with EcPV1 and EcPV3. Using the current PV classification system that is based on the nucleotide sequence of L1, we could not define the genus of the newly identified virus. Therefore, a structural analysis of the L1 protein was carried out to aid in this classification because EcPV4 cause lesion similar to the lesion caused by EcPV3. A comparison of the superficial loops demonstrated a distinct amino acid conservation pattern between EcPV4/EcPV2 and EcPV4/EcPV3. These results demonstrate the presence of a new equine PV species and that structural studies could be useful in the classification of PVs. © 2012 Elsevier B.V.
Resumo:
Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq)
Resumo:
Pós-graduação em Biotecnologia - IQ
Resumo:
Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)
Resumo:
Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)
Resumo:
Scorpion toxins targeting voltage-gated sodium (NaV) channels are peptides that comprise 6076 amino acid residues cross-linked by four disulfide bridges. These toxins can be divided in two groups (a and beta toxins), according to their binding properties and mode of action. The scorpion a-toxin Ts2, previously described as a beta-toxin, was purified from the venom of Tityus serrulatus, the most dangerous Brazilian scorpion. In this study, seven mammalian NaV channel isoforms (rNaV1.2, rNaV1.3, rNaV1.4, hNaV1.5, mNaV1.6, rNaV1.7 and rNaV1.8) and one insect NaV channel isoform (DmNaV1) were used to investigate the subtype specificity and selectivity of Ts2. The electrophysiology assays showed that Ts2 inhibits rapid inactivation of NaV1.2, NaV1.3, NaV1.5, NaV1.6 and NaV1.7, but does not affect NaV1.4, NaV1.8 or DmNaV1. Interestingly, Ts2 significantly shifts the voltage dependence of activation of NaV1.3 channels. The 3D structure of this toxin was modeled based on the high sequence identity (72%) shared with Ts1, another T. serrulatus toxin. The overall fold of the Ts2 model consists of three beta-strands and one a-helix, and is arranged in a triangular shape forming a cysteine-stabilized a-helix/beta-sheet (CSa beta) motif.
Resumo:
alpha-KTx toxin Tc32, from the Amazonian scorpion Tityus cambridgei, lacks the dyad motif; including Lys27, characteristic of the family and generally associated with channel blockage. The toxin has been cloned and expressed for the first time. Electrophysiological experiments, by showing that the recombinant form blocks Kv1.3 channels of olfactory bulb periglomerular cells like the natural Tc32 toxin, when tested on the Kv1.3 channel of human T lymphocytes, confirmed it is in an active fold. The nuclear magnetic resonance-derived structure revealed it exhibits an alpha/beta scaffold typical of the members of the alpha-KTx family. TdK2 and TdK3, all belonging to the same alpha-KTx 18 subfamily, share significant sequence identity with Tc32 but diverse selectivity and affinity for Kv1.3 and Kv1.1 channels. To gain insight into the structural features that may justify those differences, we used the recombinant Tc32 nuclear magnetic resonance-derived structure to model the other two toxins, for which no experimental structure is available. Their interaction with Kv1.3 and Kv1.1 has been investigated by means of docking simulations. The results suggest that differences in the electrostatic features of the toxins and channels, in their contact surfaces, and in their total dipole moment orientations govern the affinity and selectivity of toxins. In addition, we found that, regardless of whether the dyad motif is present, it is always a Lys side chain that physically blocks the channels, irrespective of its position in the toxin sequence.
Resumo:
Abstract Background Sugarcane (Saccharum spp.) has become an increasingly important crop for its leading role in biofuel production. The high sugar content species S. officinarum is an octoploid without known diploid or tetraploid progenitors. Commercial sugarcane cultivars are hybrids between S. officinarum and wild species S. spontaneum with ploidy at ~12×. The complex autopolyploid sugarcane genome has not been characterized at the DNA sequence level. Results The microsynteny between sugarcane and sorghum was assessed by comparing 454 pyrosequences of 20 sugarcane bacterial artificial chromosomes (BACs) with sorghum sequences. These 20 BACs were selected by hybridization of 1961 single copy sorghum overgo probes to the sugarcane BAC library with one sugarcane BAC corresponding to each of the 20 sorghum chromosome arms. The genic regions of the sugarcane BACs shared an average of 95.2% sequence identity with sorghum, and the sorghum genome was used as a template to order sequence contigs covering 78.2% of the 20 BAC sequences. About 53.1% of the sugarcane BAC sequences are aligned with sorghum sequence. The unaligned regions contain non-coding and repetitive sequences. Within the aligned sequences, 209 genes were annotated in sugarcane and 202 in sorghum. Seventeen genes appeared to be sugarcane-specific and all validated by sugarcane ESTs, while 12 appeared sorghum-specific but only one validated by sorghum ESTs. Twelve of the 17 sugarcane-specific genes have no match in the non-redundant protein database in GenBank, perhaps encoding proteins for sugarcane-specific processes. The sorghum orthologous regions appeared to have expanded relative to sugarcane, mostly by the increase of retrotransposons. Conclusions The sugarcane and sorghum genomes are mostly collinear in the genic regions, and the sorghum genome can be used as a template for assembling much of the genic DNA of the autopolyploid sugarcane genome. The comparable gene density between sugarcane BACs and corresponding sorghum sequences defied the notion that polyploidy species might have faster pace of gene loss due to the redundancy of multiple alleles at each locus.
Resumo:
Abstract Background The mitochondrial DNA of kinetoplastid flagellates is distinctive in the eukaryotic world due to its massive size, complex form and large sequence content. Comprised of catenated maxicircles that contain rRNA and protein-coding genes and thousands of heterogeneous minicircles encoding small guide RNAs, the kinetoplast network has evolved along with an extreme form of mRNA processing in the form of uridine insertion and deletion RNA editing. Many maxicircle-encoded mRNAs cannot be translated without this post-transcriptional sequence modification. Results We present the complete sequence and annotation of the Trypanosoma cruzi maxicircles for the CL Brener and Esmeraldo strains. Gene order is syntenic with Trypanosoma brucei and Leishmania tarentolae maxicircles. The non-coding components have strain-specific repetitive regions and a variable region that is unique for each strain with the exception of a conserved sequence element that may serve as an origin of replication, but shows no sequence identity with L. tarentolae or T. brucei. Alternative assemblies of the variable region demonstrate intra-strain heterogeneity of the maxicircle population. The extent of mRNA editing required for particular genes approximates that seen in T. brucei. Extensively edited genes were more divergent among the genera than non-edited and rRNA genes. Esmeraldo contains a unique 236-bp deletion that removes the 5'-ends of ND4 and CR4 and the intergenic region. Esmeraldo shows additional insertions and deletions outside of areas edited in other species in ND5, MURF1, and MURF2, while CL Brener has a distinct insertion in MURF2. Conclusion The CL Brener and Esmeraldo maxicircles represent two of three previously defined maxicircle clades and promise utility as taxonomic markers. Restoration of the disrupted reading frames might be accomplished by strain-specific RNA editing. Elements in the non-coding region may be important for replication, transcription, and anchoring of the maxicircle within the kinetoplast network.
Resumo:
Abstract Background Ferredoxin-NADP(H) reductases (FNRs) are flavoenzymes that catalyze the electron transfer between NADP(H) and the proteins ferredoxin or flavodoxin. A number of structural features distinguish plant and bacterial FNRs, one of which is the mode of the cofactor FAD binding. Leptospira interrogans is a spirochaete parasitic bacterium capable of infecting humans and mammals in general. Leptospira interrogans FNR (LepFNR) displays low sequence identity with plant (34% with Zea mays) and bacterial (31% with Escherichia coli) FNRs. However, LepFNR contains all consensus sequences that define the plastidic class FNRs. Results The crystal structures of the FAD-containing LepFNR and the complex of the enzyme with NADP+, were solved and compared to known FNRs. The comparison reveals significant structural similarities of the enzyme with the plastidic type FNRs and differences with the bacterial enzymes. Our small angle X-ray scattering experiments show that LepFNR is a monomeric enzyme. Moreover, our biochemical data demonstrate that the LepFNR has an enzymatic activity similar to those reported for the plastidic enzymes and that is significantly different from bacterial flavoenzymes, which display lower turnover rates. Conclusion LepFNR is the first plastidic type FNR found in bacteria and, despite of its low sequence similarity with plastidic FNRs still displays high catalytic turnover rates. The typical structural and biochemical characteristics of plant FNRs unveiled for LepFNR support a notion of a putative lateral gene transfer which presumably offers Leptospira interrogans evolutionary advantages. The wealth of structural information about LepFNR provides a molecular basis for advanced drugs developments against leptospirosis.
Resumo:
Beet soil-borne mosaic virus (BSBMV) and Beet necrotic yellow vein virus (BNYVV) are members of Benyvirus genus. BSBMV has been reported only in the United States while BNYVV has a worldwide distribution. Both viruses are vectored by Polymyxa betae, possess similar host ranges, particles number and morphology. Both viruses are not serologically related but have similar genomic organizations. Field isolates consist of four RNA species but some BNYVV isolates contain a fifth RNA. RNAs 1 and 2 are essential for infection and replication while RNAs 3 and 4 play important roles on plant and vector interactions, respectively. Nucleotide and amino acid analyses revealed BSBMV and BNYVV are different enough to be classified in two different species. Additionally in BNYVV/BSBMV mixed infections, a competition was previous described in sugar beet, where BNYVV infection reduces BSBMV accumulation in both susceptible and resistant cultivars. Considering all this observations we hypothesized that BNYVV and BSBMV crossed study, exploiting their similarities and divergences, can improve investigation of molecular interactions between sugar beets and Benyviruses. The main achievement of our research is the production of a cDNA biologically active clones collection of BNYVV and BSBMV RNAs, from which synthetic copies of both Benyviruses can be transcribed. Moreover, through recombination experiments we demonstrated, for the first time, the BNYVV RNA 1 and 2 capability to trans-replicate and encapsidate BSBMV RNA 3 and 4, either the BSBMV RNA 1 and 2 capability to replicate BNYVV RNA2 in planta. We also demonstrated that BSBMV RNA3 support long-distance movement of BNYVV RNA 1 and 2 in B. macrocarpa and that 85 foreign sequence as p29HA, GFP and RFP, are successfully expressed, in C. quinoa, by BSBMV RNA3 based replicon (RepIII) also produced by our research. These results confirm the close correlation among the two viruses. Interestingly, the symptoms induced by BSBMV RNA-3 on C. quinoa leaves are more similar to necrotic local lesions caused by BNYVV RNA-5 p26 than to strongly chlorotic local lesions or yellow spot induced by BNYVV RNA- 3 encoded p25. As previous reported BSBMV p29 share 23% of amino acid sequence identity with BNYVV p25 but identity increase to 43% when compared with sequence of BNYVV RNA-5 p26. Based on our results the essential sequence (Core region) for the longdistance movement of BSBMV and BNYVV in B. macrocarpa, is not only carried by RNA3s species but other regions, perhaps located on the RNA 1 and 2, could play a fundamental role in this matter. Finally a chimeric RNA, composed by the 5’ region of RNA4 and 3’ region of RNA3 of BSBMV, has been produced after 21 serial mechanically inoculation of wild type BSBMV on C. quinoa plants. Chimera seems unable to express any protein, but it is replicated and transcript in planta. It could represent an important tool to study the interactions between Benyvirus and plant host. In conclusion different tools, comprising a method to study synthetic viruses under natural conditions of inoculum through P. Betae, have been produced and new knowledge are been acquired that will allow to perform future investigation of the molecular interactions between sugar beets and Benyviruses.
Resumo:
Motivation An actual issue of great interest, both under a theoretical and an applicative perspective, is the analysis of biological sequences for disclosing the information that they encode. The development of new technologies for genome sequencing in the last years, opened new fundamental problems since huge amounts of biological data still deserve an interpretation. Indeed, the sequencing is only the first step of the genome annotation process that consists in the assignment of biological information to each sequence. Hence given the large amount of available data, in silico methods became useful and necessary in order to extract relevant information from sequences. The availability of data from Genome Projects gave rise to new strategies for tackling the basic problems of computational biology such as the determination of the tridimensional structures of proteins, their biological function and their reciprocal interactions. Results The aim of this work has been the implementation of predictive methods that allow the extraction of information on the properties of genomes and proteins starting from the nucleotide and aminoacidic sequences, by taking advantage of the information provided by the comparison of the genome sequences from different species. In the first part of the work a comprehensive large scale genome comparison of 599 organisms is described. 2,6 million of sequences coming from 551 prokaryotic and 48 eukaryotic genomes were aligned and clustered on the basis of their sequence identity. This procedure led to the identification of classes of proteins that are peculiar to the different groups of organisms. Moreover the adopted similarity threshold produced clusters that are homogeneous on the structural point of view and that can be used for structural annotation of uncharacterized sequences. The second part of the work focuses on the characterization of thermostable proteins and on the development of tools able to predict the thermostability of a protein starting from its sequence. By means of Principal Component Analysis the codon composition of a non redundant database comprising 116 prokaryotic genomes has been analyzed and it has been showed that a cross genomic approach can allow the extraction of common determinants of thermostability at the genome level, leading to an overall accuracy in discriminating thermophilic coding sequences equal to 95%. This result outperform those obtained in previous studies. Moreover, we investigated the effect of multiple mutations on protein thermostability. This issue is of great importance in the field of protein engineering, since thermostable proteins are generally more suitable than their mesostable counterparts in technological applications. A Support Vector Machine based method has been trained to predict if a set of mutations can enhance the thermostability of a given protein sequence. The developed predictor achieves 88% accuracy.