77 resultados para sequence identity
Resumo:
The RegA proteins from the bacteriophage T4 and RB69 are translational repressors that control the expression of multiple phage mRNAs. RegA proteins from the two phages share 78% sequence identity; however, in vivo expression studies have suggested that the RB69 RegA protein binds target RNAs with a higher affinity than T4 RegA protein. To study the RNA binding properties of T4 and RB69 RegA proteins more directly, the binding sites of RB69 RegA protein on synthetic RNAs corresponding to the translation initiation region of two RB69 target genes were mapped by RNase protection assays. These assays revealed that RB69 RegA protein protects nucleotides –9 to –3 (relative to the start codon) on RB69 gene 44, which contains the sequence GAAAAUU. On RB69 gene 45, the protected site (nucleotides –8 to –3) contains a similar purine-rich sequence: GAAAUA. Interestingly, T4 RegA protein protected the same nucleotides on these RNAs. To examine the specificity of RNA binding, quantitative RNA gel shift assays were performed with synthetic RNAs corresponding to recognition elements (REs) in three T4 and three RB69 mRNAs. Comparative gel shift assays demonstrated that RB69 RegA protein has an ∼7-fold higher affinity for T4 gene 44 RE RNA than T4 RegA protein. RB69 RegA protein also binds RB69 gene 44 RE RNA with a 4-fold higher affinity than T4 RegA protein. On the other hand, T4 RegA exhibited a higher affinity than RB69 RegA protein for RB69 gene 45 RE RNA. With respect to their affinities for cognate RNAs, both RegA proteins exhibited the following hierarchy of affinities: gene 44 > gene 45 > regA. Interestingly, T4 RegA exhibited the highest affinity towards RB69 gene 45 RE RNA, whereas RB69 RegA protein had the highest affinity for T4 gene 44 RE RNA. The helix–loop groove RNA binding motif of T4 RegA protein is fully conserved in RB69 RegA protein. However, homology modeling of the structure of RB69 RegA protein reveals that the divergent residues are clustered in two areas of the surface, and that there are two large areas of high conservation near the helix–loop groove, which may also play a role in RNA binding.
Resumo:
The Dali Domain Dictionary (http://www.ebi.ac.uk/dali/domain) is a numerical taxonomy of all known structures in the Protein Data Bank (PDB). The taxonomy is derived fully automatically from measurements of structural, functional and sequence similarities. Here, we report the extension of the classification to match the traditional four hierarchical levels corresponding to: (i) supersecondary structural motifs (attractors in fold space), (ii) the topology of globular domains (fold types), (iii) remote homologues (functional families) and (iv) homologues with sequence identity above 25% (sequence families). The computational definitions of attractors and functional families are new. In September 2000, the Dali classification contained 10 531 PDB entries comprising 17 101 chains, which were partitioned into five attractor regions, 1375 fold types, 2582 functional families and 3724 domain sequence families. Sequence families were further associated with 99 582 unique homologous sequences in the HSSP database, which increases the number of effectively known structures several-fold. The resulting database contains the description of protein domain architecture, the definition of structural neighbours around each known structure, the definition of structurally conserved cores and a comprehensive library of explicit multiple alignments of distantly related protein families.
Resumo:
The downstream prion-like protein (doppel, or Dpl) is a paralog of the cellular prion protein, PrPC. The two proteins have ≈25% sequence identity, but seem to have distinct physiologic roles. Unlike PrPC, Dpl does not support prion replication; instead, overexpression of Dpl in the brain seems to cause a completely different neurodegenerative disease. We report the solution structure of a fragment of recombinant mouse Dpl (residues 26–157) containing a globular domain with three helices and a small amount of β-structure. Overall, the topology of Dpl is very similar to that of PrPC. Significant differences include a marked kink in one of the helices in Dpl, and a different orientation of the two short β-strands. Although the two proteins most likely arose through duplication of a single ancestral gene, the relationship is now so distant that only the structures retain similarity; the functions have diversified along with the sequence.
Resumo:
We introduce a quantitative framework for assessing the generation of crossovers in DNA shuffling experiments. The approach uses free energy calculations and complete sequence information to model the annealing process. Statistics obtained for the annealing events then are combined with a reassembly algorithm to infer crossover allocation in the reassembled sequences. The fraction of reassembled sequences containing zero, one, two, or more crossovers and the probability that a given nucleotide position in a reassembled sequence is the site of a crossover event are estimated. Comparisons of the predictions against experimental data for five example systems demonstrate good agreement despite the fact that no adjustable parameters are used. An in silico case study of a set of 12 subtilases examines the effect of fragmentation length, annealing temperature, sequence identity and number of shuffled sequences on the number, type, and distribution of crossovers. A computational verification of crossover aggregation in regions of near-perfect sequence identity and the presence of synergistic reassembly in family DNA shuffling is obtained.
Resumo:
Two cellular retinol-binding proteins (CRBP I and II) with distinct tissue distributions and retinoid-binding properties have been recognized thus far in mammals. Here, we report the identification of a human retinol-binding protein resembling type I (55.6% identity) and type II (49.6% identity) CRBPs, but with a unique H residue in the retinoid-binding site and a distinctively different tissue distribution. Additionally, this binding protein (CRBP III) exhibits a remarkable sequence identity (62.2%) with the recently identified ι-crystallin/CRBP of the diurnal gecko Lygodactylus picturatus [Werten, P. J. L., Röll, B., van Alten, D. M. F. & de Jong, W. W. (2000) Proc. Natl. Acad. Sci. USA 97, 3282–3287 (First Published March 21, 2000; 10.1073/pnas.050500597)]. CRBP III and all-trans-retinol form a complex (Kd ≈ 60 nM), the absorption spectrum of which is characterized by the peculiar fine structure typical of the spectra of holo-CRBP I and II. As revealed by a 2.3-Å x-ray molecular model of apo-CRBP III, the amino acid residues that line the retinol-binding site in CRBP I and II are positioned nearly identically in the structure of CRBP III. At variance with the human CRBP I and II mRNAs, which are most abundant in ovary and intestine, respectively, the CRBP III mRNA is expressed at the highest levels in kidney and liver thus suggesting a prominent role for human CRBP III as an intracellular mediator of retinol metabolism in these tissues.
Resumo:
The flavoprotein (R)-(+)-mandelonitrile lyase (MDL; EC 4.1.2.10), which plays a key role in cyanogenesis in rosaceous stone fruits, occurs in black cherry (Prunus serotina Ehrh.) homogenates as several closely related isoforms. Biochemical and molecular biological methods were used to investigate MDL microheterogeneity and function in this species. Three novel MDL cDNAs of high sequence identity (designated MDL2, MDL4, and MDL5) were isolated. Like MDL1 and MDL3 cDNAs (Z. Hu, J.E. Poulton [1997] Plant Physiol 115: 1359–1369), they had open reading frames that predicted a flavin adenine dinucleotide-binding site, multiple N-glycosylation sites, and an N-terminal signal sequence. The N terminus of an MDL isoform purified from seedlings matched the derived amino acid sequence of the MDL4 cDNA. Genomic sequences corresponding to the MDL1, MDL2, and MDL4 cDNAs were obtained by polymerase chain reaction amplification of genomic DNA. Like the previously reported mdl3 gene, these genes are interrupted at identical positions by three short, conserved introns. Given their overall similarity, we conclude that the genes mdl1, mdl2, mdl3, mdl4, and mdl5 are derived from a common ancestral gene and constitute members of a gene family. Genomic Southern-blot analysis showed that this family has approximately eight members. Northern-blot analysis using gene-specific probes revealed differential expression of the genes mdl1, mdl2, mdl3, mdl4, and mdl5.
Resumo:
A previously unidentified gonadotropin-regulated long chain acyl-CoA synthetase (GR-LACS) was cloned and characterized as a 79-kDa cytoplasmic protein expressed in Leydig cells of the rat testis. GR-LACS shares sequence identity with two conserved regions of the LACS and luciferase families, including the ATP/AMP binding domain and the 25-aa fatty acyl-CoA synthetase signature motif, but displays low overall amino acid similarities (23–28%). GR-LACS mRNA is expressed abundantly in Leydig cells of the adult testis and to a lesser degree in the seminiferous tubules in spermatogonia and Sertoli cells. It is also observed in ovary and brain. Immunoreactive protein expression was observed mainly in Leydig cells and minimally in the tubules but was not detected in other tissues. In vivo, treatment with a desensitizing dose of human chorionic gonadotropin caused transcriptional down-regulation of GR-LACS expression in Leydig cells. The expressed protein present in the cytoplasm of transfected cells displayed acyl-CoA synthetase activity for long chain fatty acid substrates. GR-LACS may contribute to the provision of energy requirements and to the biosynthesis of steroid precursors and could participate through acyl-CoA's multiple functions in the regulation of the male gonad.
Resumo:
The determination of complete genome sequences provides us with an opportunity to describe and analyze evolution at the comprehensive level of genomes. Here we compare nine genomes with respect to their protein coding genes at two levels: (i) we compare genomes as “bags of genes” and measure the fraction of orthologs shared between genomes and (ii) we quantify correlations between genes with respect to their relative positions in genomes. Distances between the genomes are related to their divergence times, measured as the number of amino acid substitutions per site in a set of 34 orthologous genes that are shared among all the genomes compared. We establish a hierarchy of rates at which genomes have changed during evolution. Protein sequence identity is the most conserved, followed by the complement of genes within the genome. Next is the degree of conservation of the order of genes, whereas gene regulation appears to evolve at the highest rate. Finally, we show that some genomes are more highly organized than others: they show a higher degree of the clustering of genes that have orthologs in other genomes.
Resumo:
The corticotropin-releasing factor (CRF) family of neuropeptides includes the mammalian peptides CRF, urocortin, and urocortin II, as well as piscine urotensin I and frog sauvagine. The mammalian peptides signal through two G protein-coupled receptor types to modulate endocrine, autonomic, and behavioral responses to stress, as well as a range of peripheral (cardiovascular, gastrointestinal, and immune) activities. The three previously known ligands are differentially distributed anatomically and have distinct specificities for the two major receptor types. Here we describe the characterization of an additional CRF-related peptide, urocortin III, in the human and mouse. In searching the public human genome databases we found a partial expressed sequence tagged (EST) clone with significant sequence identity to mammalian and fish urocortin-related peptides. By using primers based on the human EST sequence, a full-length human clone was isolated from genomic DNA that encodes a protein that includes a predicted putative 38-aa peptide structurally related to other known family members. With a human probe, we then cloned the mouse ortholog from a genomic library. Human and mouse urocortin III share 90% identity in the 38-aa putative mature peptide. In the peptide coding region, both human and mouse urocortin III are 76% identical to pufferfish urocortin-related peptide and more distantly related to urocortin II, CRF, and urocortin from other mammalian species. Mouse urocortin III mRNA expression is found in areas of the brain including the hypothalamus, amygdala, and brainstem, but is not evident in the cerebellum, pituitary, or cerebral cortex; it is also expressed peripherally in small intestine and skin. Urocortin III is selective for type 2 CRF receptors and thus represents another potential endogenous ligand for these receptors.
Resumo:
It was reported previously that enolase enzyme activity and ENO1 transcript levels are induced by anaerobic stress in maize (Zea mays). Here we show that not all isoforms of maize enolase are anaerobically induced. We cloned and sequenced a second enolase cDNA clone (pENO2) from maize. Sequence analysis showed that pENO2 shares 75.6% nucleotide and 89.5% deduced amino acid sequence identity with pENO1 and is encoded by a distinct gene. Expression of ENO2 is constitutive under aerobic conditions, whereas ENO1 levels are induced 10-fold in maize roots after 24 h of anaerobic treatment. Western-blot analysis and N-terminal sequencing of in vivo-labeled maize roots identified two major proteins selectively synthesized upon anaerobic stress as isozymes of enolase. We describe the expression of enolase in maize roots under anaerobic stress.
Resumo:
We studied aquaporins in maize (Zea mays), an important crop in which numerous studies on plant water relations have been carried out. A maize cDNA, ZmTIP1, was isolated by reverse transcription-coupled PCR using conserved motifs from plant aquaporins. The derived amino acid sequence of ZmTIP1 shows 76% sequence identity with the tonoplast aquaporin γ-TIP (tonoplast intrinsic protein) from Arabidopsis. Expression of ZmTIP1 in Xenopus laevis oocytes showed that it increased the osmotic water permeability of oocytes 5-fold; this water transport was inhibited by mercuric chloride. A cross-reacting antiserum made against bean α-TIP was used for immunocytochemical localization of ZmTIP1. These results indicate that this and/or other aquaporins is abundantly present in the small vacuoles of meristematic cells. Northern analysis demonstrated that ZmTIP1 is expressed in all plant organs. In situ hybridization showed a high ZmTIP1 expression in meristems and zones of cell enlargement: tips of primary and lateral roots, leaf primordia, and male and female inflorescence meristems. The high ZmTIP1 expression in meristems and expanding cells suggests that ZmTIP1 is needed (a) for vacuole biogenesis and (b) to support the rapid influx of water into vacuoles during cell expansion.
Resumo:
Previously we reported that oxalate oxidase activity increases in extracts of barley (Hordeum vulgare) leaves in response to the powdery mildew fungus (Blumeria [syn. Erysiphe] graminis f.sp. hordei) and proposed this as a source of H2O2 during plant-pathogen interactions. In this paper we show that the N terminus of the major pathogen-response oxalate oxidase has a high degree of sequence identity to previously characterized germin-like oxalate oxidases. Two cDNAs were isolated, pHvOxOa, which represents this major enzyme, and pHvOxOb', representing a closely related enzyme. Our data suggest the presence of only two oxalate oxidase genes in the barley genome, i.e. a gene encoding HvOxOa, which possibly exists in several copies, and a single-copy gene encoding HvOxOb. The use of 3′ end gene-specific probes has allowed us to demonstrate that the HvOxOa transcript accumulates to 6 times the level of the HvOxOb transcript in response to the powdery mildew fungus. The transcripts were detected in both compatible and incompatible interactions with a similar accumulation pattern. The oxalate oxidase is found exclusively in the leaf mesophyll, where it is cell wall located. A model for a signal transduction pathway in which oxalate oxidase plays a central role is proposed for the regulation of the hypersensitive response.
Resumo:
Phosphorus is a major nutrient acquired by roots via high-affinity inorganic phosphate (Pi) transporters. In this paper, we describe the tissue-specific regulation of tomato (Lycopersicon esculentum L.) Pi-transporter genes by Pi. The encoded peptides of the LePT1 and LePT2 genes belong to a family of 12 membrane-spanning domain proteins and show a high degree of sequence identity to known high-affinity Pi transporters. Both genes are highly expressed in roots, although there is some expression of LePT1 in leaves. Their expression is markedly induced by Pi starvation but not by starvation of nitrogen, potassium, or iron. The transcripts are primarily localized in root epidermis under Pi starvation. Accumulation of LePT1 message was also observed in palisade parenchyma cells of Pi-starved leaves. Our data suggest that the epidermally localized Pi transporters may play a significant role in acquiring the nutrient under natural conditions. Divided root-system studies support the hypothesis that signal(s) for the Pi-starvation response may arise internally because of the changes in cellular concentration of phosphorus.
Resumo:
Structural studies of viral membrane fusion proteins suggest that a “trimer-of-hairpins” motif plays a critical role in the membrane fusion process of many enveloped viruses. In this motif, a coiled coil (formed by homotrimeric association of the N-terminal regions of the protein) is surrounded by three C-terminal regions that pack against the coiled coil in an oblique antiparallel manner. The resulting trimer-of-hairpins structure serves to bring the viral and cellular membranes together for fusion. learncoil-vmf, a computational program developed to recognize coiled coil-like regions that form the trimer-of-hairpins motif, predicts these regions in the membrane fusion protein of the Visna virus. Peptides corresponding to the computationally identified sequences were synthesized, and the soluble core of the Visna membrane fusion protein was reconstituted in solution. Its crystal structure at 1.5-Å resolution demonstrates that a trimer-of-hairpins structure is formed. Remarkably, despite less than 23% sequence identity, the ectodomains in Visna and HIV-1 envelope glycoproteins show detailed structural conservation, especially within the area of a hydrophobic pocket in the central coiled coil currently being targeted for the development of new anti-HIV drugs.
Resumo:
The absence of the fragile X mental retardation protein (FMRP), encoded by the FMR1 gene, is responsible for pathologic manifestations in the Fragile X Syndrome, the most frequent cause of inherited mental retardation. FMRP is an RNA-binding protein associated with polysomes as part of a messenger ribonucleoprotein (mRNP) complex. Although its function is poorly understood, various observations suggest a role in local protein translation at neuronal dendrites and in dendritic spine maturation. We present here the identification of CYFIP1/2 (Cytoplasmic FMRP Interacting Proteins) as FMRP interactors. CYFIP1/2 share 88% amino acid sequence identity and represent the two members in humans of a highly conserved protein family. Remarkably, whereas CYFIP2 also interacts with the FMRP-related proteins FXR1P/2P, CYFIP1 interacts exclusively with FMRP. FMRP–CYFIP interaction involves the domain of FMRP also mediating homo- and heteromerization, thus suggesting a competition between interaction among the FXR proteins and interaction with CYFIP. CYFIP1/2 are proteins of unknown function, but CYFIP1 has recently been shown to interact with the small GTPase Rac1, which is implicated in development and maintenance of neuronal structures. Consistent with FMRP and Rac1 localization in dendritic fine structures, CYFIP1/2 are present in synaptosomal extracts.