8 resultados para sequence identity
em Biblioteca Digital da Produção Intelectual da Universidade de São Paulo
Resumo:
Intron splicing is one of the most important steps involved in the maturation process of a pre-mRNA. Although the sequence profiles around the splice sites have been studied extensively, the levels of sequence identity between the exonic sequences preceding the donor sites and the intronic sequences preceding the acceptor sites has not been examined as thoroughly. In this study we investigated identity patterns between the last 15 nucleotides of the exonic sequence preceding the 5' splice site and the intronic sequence preceding the 3' splice site in a set of human protein-coding genes that do not exhibit intron retention. We found that almost 60% of consecutive exons and introns in human protein-coding genes share at least two identical nucleotides at their 3' ends and, on average, the sequence identity length is 2.47 nucleotides. Based on our findings we conclude that the 3' ends of exons and introns tend to have longer identical sequences within a gene than when being taken from different genes. Our results hold even if the pairs are non-consecutive in the transcription order. (C) 2012 Elsevier Ltd. All rights reserved.
Resumo:
Scorpion toxins targeting voltage-gated sodium (NaV) channels are peptides that comprise 6076 amino acid residues cross-linked by four disulfide bridges. These toxins can be divided in two groups (a and beta toxins), according to their binding properties and mode of action. The scorpion a-toxin Ts2, previously described as a beta-toxin, was purified from the venom of Tityus serrulatus, the most dangerous Brazilian scorpion. In this study, seven mammalian NaV channel isoforms (rNaV1.2, rNaV1.3, rNaV1.4, hNaV1.5, mNaV1.6, rNaV1.7 and rNaV1.8) and one insect NaV channel isoform (DmNaV1) were used to investigate the subtype specificity and selectivity of Ts2. The electrophysiology assays showed that Ts2 inhibits rapid inactivation of NaV1.2, NaV1.3, NaV1.5, NaV1.6 and NaV1.7, but does not affect NaV1.4, NaV1.8 or DmNaV1. Interestingly, Ts2 significantly shifts the voltage dependence of activation of NaV1.3 channels. The 3D structure of this toxin was modeled based on the high sequence identity (72%) shared with Ts1, another T. serrulatus toxin. The overall fold of the Ts2 model consists of three beta-strands and one a-helix, and is arranged in a triangular shape forming a cysteine-stabilized a-helix/beta-sheet (CSa beta) motif.
Resumo:
alpha-KTx toxin Tc32, from the Amazonian scorpion Tityus cambridgei, lacks the dyad motif; including Lys27, characteristic of the family and generally associated with channel blockage. The toxin has been cloned and expressed for the first time. Electrophysiological experiments, by showing that the recombinant form blocks Kv1.3 channels of olfactory bulb periglomerular cells like the natural Tc32 toxin, when tested on the Kv1.3 channel of human T lymphocytes, confirmed it is in an active fold. The nuclear magnetic resonance-derived structure revealed it exhibits an alpha/beta scaffold typical of the members of the alpha-KTx family. TdK2 and TdK3, all belonging to the same alpha-KTx 18 subfamily, share significant sequence identity with Tc32 but diverse selectivity and affinity for Kv1.3 and Kv1.1 channels. To gain insight into the structural features that may justify those differences, we used the recombinant Tc32 nuclear magnetic resonance-derived structure to model the other two toxins, for which no experimental structure is available. Their interaction with Kv1.3 and Kv1.1 has been investigated by means of docking simulations. The results suggest that differences in the electrostatic features of the toxins and channels, in their contact surfaces, and in their total dipole moment orientations govern the affinity and selectivity of toxins. In addition, we found that, regardless of whether the dyad motif is present, it is always a Lys side chain that physically blocks the channels, irrespective of its position in the toxin sequence.
Resumo:
Abstract Background Sugarcane (Saccharum spp.) has become an increasingly important crop for its leading role in biofuel production. The high sugar content species S. officinarum is an octoploid without known diploid or tetraploid progenitors. Commercial sugarcane cultivars are hybrids between S. officinarum and wild species S. spontaneum with ploidy at ~12×. The complex autopolyploid sugarcane genome has not been characterized at the DNA sequence level. Results The microsynteny between sugarcane and sorghum was assessed by comparing 454 pyrosequences of 20 sugarcane bacterial artificial chromosomes (BACs) with sorghum sequences. These 20 BACs were selected by hybridization of 1961 single copy sorghum overgo probes to the sugarcane BAC library with one sugarcane BAC corresponding to each of the 20 sorghum chromosome arms. The genic regions of the sugarcane BACs shared an average of 95.2% sequence identity with sorghum, and the sorghum genome was used as a template to order sequence contigs covering 78.2% of the 20 BAC sequences. About 53.1% of the sugarcane BAC sequences are aligned with sorghum sequence. The unaligned regions contain non-coding and repetitive sequences. Within the aligned sequences, 209 genes were annotated in sugarcane and 202 in sorghum. Seventeen genes appeared to be sugarcane-specific and all validated by sugarcane ESTs, while 12 appeared sorghum-specific but only one validated by sorghum ESTs. Twelve of the 17 sugarcane-specific genes have no match in the non-redundant protein database in GenBank, perhaps encoding proteins for sugarcane-specific processes. The sorghum orthologous regions appeared to have expanded relative to sugarcane, mostly by the increase of retrotransposons. Conclusions The sugarcane and sorghum genomes are mostly collinear in the genic regions, and the sorghum genome can be used as a template for assembling much of the genic DNA of the autopolyploid sugarcane genome. The comparable gene density between sugarcane BACs and corresponding sorghum sequences defied the notion that polyploidy species might have faster pace of gene loss due to the redundancy of multiple alleles at each locus.
Resumo:
Abstract Background The mitochondrial DNA of kinetoplastid flagellates is distinctive in the eukaryotic world due to its massive size, complex form and large sequence content. Comprised of catenated maxicircles that contain rRNA and protein-coding genes and thousands of heterogeneous minicircles encoding small guide RNAs, the kinetoplast network has evolved along with an extreme form of mRNA processing in the form of uridine insertion and deletion RNA editing. Many maxicircle-encoded mRNAs cannot be translated without this post-transcriptional sequence modification. Results We present the complete sequence and annotation of the Trypanosoma cruzi maxicircles for the CL Brener and Esmeraldo strains. Gene order is syntenic with Trypanosoma brucei and Leishmania tarentolae maxicircles. The non-coding components have strain-specific repetitive regions and a variable region that is unique for each strain with the exception of a conserved sequence element that may serve as an origin of replication, but shows no sequence identity with L. tarentolae or T. brucei. Alternative assemblies of the variable region demonstrate intra-strain heterogeneity of the maxicircle population. The extent of mRNA editing required for particular genes approximates that seen in T. brucei. Extensively edited genes were more divergent among the genera than non-edited and rRNA genes. Esmeraldo contains a unique 236-bp deletion that removes the 5'-ends of ND4 and CR4 and the intergenic region. Esmeraldo shows additional insertions and deletions outside of areas edited in other species in ND5, MURF1, and MURF2, while CL Brener has a distinct insertion in MURF2. Conclusion The CL Brener and Esmeraldo maxicircles represent two of three previously defined maxicircle clades and promise utility as taxonomic markers. Restoration of the disrupted reading frames might be accomplished by strain-specific RNA editing. Elements in the non-coding region may be important for replication, transcription, and anchoring of the maxicircle within the kinetoplast network.
Resumo:
Abstract Background Ferredoxin-NADP(H) reductases (FNRs) are flavoenzymes that catalyze the electron transfer between NADP(H) and the proteins ferredoxin or flavodoxin. A number of structural features distinguish plant and bacterial FNRs, one of which is the mode of the cofactor FAD binding. Leptospira interrogans is a spirochaete parasitic bacterium capable of infecting humans and mammals in general. Leptospira interrogans FNR (LepFNR) displays low sequence identity with plant (34% with Zea mays) and bacterial (31% with Escherichia coli) FNRs. However, LepFNR contains all consensus sequences that define the plastidic class FNRs. Results The crystal structures of the FAD-containing LepFNR and the complex of the enzyme with NADP+, were solved and compared to known FNRs. The comparison reveals significant structural similarities of the enzyme with the plastidic type FNRs and differences with the bacterial enzymes. Our small angle X-ray scattering experiments show that LepFNR is a monomeric enzyme. Moreover, our biochemical data demonstrate that the LepFNR has an enzymatic activity similar to those reported for the plastidic enzymes and that is significantly different from bacterial flavoenzymes, which display lower turnover rates. Conclusion LepFNR is the first plastidic type FNR found in bacteria and, despite of its low sequence similarity with plastidic FNRs still displays high catalytic turnover rates. The typical structural and biochemical characteristics of plant FNRs unveiled for LepFNR support a notion of a putative lateral gene transfer which presumably offers Leptospira interrogans evolutionary advantages. The wealth of structural information about LepFNR provides a molecular basis for advanced drugs developments against leptospirosis.
Resumo:
Germline and early embryo development constitute ideal model systems to study the establishment of polarity, cell identity, and asymmetric cell divisions (ACDs) in plants. We describe here the function of the MATH-BTB domain protein MAB1 that is exclusively expressed in the germ lineages and the zygote of maize (Zea mays). mab1 (RNA interference [RNAi]) mutant plants display chromosome segregation defects and short spindles during meiosis that cause insufficient separation and migration of nuclei. After the meiosis-to-mitosis transition, two attached nuclei of similar identity are formed in mab1 (RNAi) mutants leading to an arrest of further germline development. Transient expression studies of MAB1 in tobacco (Nicotiana tabacum) Bright Yellow-2 cells revealed a cell cycle-dependent nuclear localization pattern but no direct colocalization with the spindle apparatus. MAB1 is able to form homodimers and interacts with the E3 ubiquitin ligase component Cullin 3a (CUL3a) in the cytoplasm, likely as a substrate-specific adapter protein. The microtubule-severing subunit p60 of katanin was identified as a candidate substrate for MAB1, suggesting that MAB1 resembles the animal key ACD regulator Maternal Effect Lethal 26 (MEL-26). In summary, our findings provide further evidence for the importance of posttranslational regulation for asymmetric divisions and germline progression in plants and identified an unstable key protein that seems to be involved in regulating the stability of a spindle apparatus regulator(s).
Resumo:
Abstract Background Plasmodium vivax is the most widely distributed human malaria, responsible for 70–80 million clinical cases each year and large socio-economical burdens for countries such as Brazil where it is the most prevalent species. Unfortunately, due to the impossibility of growing this parasite in continuous in vitro culture, research on P. vivax remains largely neglected. Methods A pilot survey of expressed sequence tags (ESTs) from the asexual blood stages of P. vivax was performed. To do so, 1,184 clones from a cDNA library constructed with parasites obtained from 10 different human patients in the Brazilian Amazon were sequenced. Sequences were automatedly processed to remove contaminants and low quality reads. A total of 806 sequences with an average length of 586 bp met such criteria and their clustering revealed 666 distinct events. The consensus sequence of each cluster and the unique sequences of the singlets were used in similarity searches against different databases that included P. vivax, Plasmodium falciparum, Plasmodium yoelii, Plasmodium knowlesi, Apicomplexa and the GenBank non-redundant database. An E-value of <10-30 was used to define a significant database match. ESTs were manually assigned a gene ontology (GO) terminology Results A total of 769 ESTs could be assigned a putative identity based upon sequence similarity to known proteins in GenBank. Moreover, 292 ESTs were annotated and a GO terminology was assigned to 164 of them. Conclusion These are the first ESTs reported for P. vivax and, as such, they represent a valuable resource to assist in the annotation of the P. vivax genome currently being sequenced. Moreover, since the GC-content of the P. vivax genome is strikingly different from that of P. falciparum, these ESTs will help in the validation of gene predictions for P. vivax and to create a gene index of this malaria parasite.