160 resultados para protein sequence classification
Resumo:
An approach was developed for the isolation and characterization of soybean plasma membrane-associated proteins by immunoscreening of a cDNA expression library. An antiserum was raised against purified plasma membrane vesicles. In a differential screening of approximately 500,000 plaque-forming units with the anti-(plasma membrane) serum and DNA probes derived from highly abundant clones isolated in a preliminary screening, 261 clones were selected from approximately 1,200 antiserum-positive plaques. These clones were classified into 40 groups by hybridization analysis and 5'- and 3'-terminal sequencing. By searching nucleic acid and protein sequence data bases, 11 groups of cDNAs were identified, among which valosin-containing protein (VCP), clathrin heavy chain, phospholipase C, and S-adenosylmethionine:delta 24-sterol-C-methyltransferase have not to date been cloned from plants. The remaining 29 groups did not match any current data base entries and may, therefore, represent additional or yet uncharacterized genes. A full-length cDNA encoding the soybean VCP was sequenced. The high level of amino acid identity with vertebrate VCP and yeast CDC48 protein indicates that the soybean protein is a plant homolog of vertebrate VCP and yeast CDC48 protein.
Resumo:
To discover genes involved in von Hippel-Lindau (VHL)-mediated carcinogenesis, we used renal cell carcinoma cell lines stably transfected with wild-type VHL-expressing transgenes. Large-scale RNA differential display technology applied to these cell lines identified several differentially expressed genes, including an alpha carbonic anhydrase gene, termed CA12. The deduced protein sequence was classified as a one-pass transmembrane CA possessing an apparently intact catalytic domain in the extracellular CA module. Reintroduced wild-type VHL strongly inhibited the overexpression of the CA12 gene in the parental renal cell carcinoma cell lines. Similar results were obtained with CA9, encoding another transmembrane CA with an intact catalytic domain. Although both domains of the VHL protein contribute to regulation of CA12 expression, the elongin binding domain alone could effectively regulate CA9 expression. We mapped CA12 and CA9 loci to chromosome bands 15q22 and 17q21.2 respectively, regions prone to amplification in some human cancers. Additional experiments are needed to define the role of CA IX and CA XII enzymes in the regulation of pH in the extracellular microenvironment and its potential impact on cancer cell growth.
Resumo:
In an attempt to quantify the rates of protein sequence divergence in Drosophila, we have devised a screen to differentiate between slow and fast evolving genes. We find that over one-third of randomly drawn cDNAs from a Drosophila melanogaster library do not cross-hybridize with Drosophila virilis DNA, indicating that they evolve with a very high rate. To determine the evolutionary characteristics of such protein sequences, we sequenced their homologs from a more closely related species (Drosophila yakuba). The amino acid substitution rates among these cDNAs are among the fastest known and several are only about 2-fold lower than the corresponding values for silent substitutions. An analysis of within-species polymorphisms for one of these sequences reveals an exceptionally high number of polymorphic amino acid positions, indicating that the protein is not under strong negative selection. We conclude that the Drosophila genome harbors a substantial proportion of genes with a very high divergence rate.
Resumo:
A crucial step in exploiting the information inherent in genome sequences is to assign to each protein sequence its three-dimensional fold and biological function. Here we describe fold assignment for the proteins encoded by the small genome of Mycoplasma genitalium. The assignment was carried out by our computer server (http://www.doe-mbi.ucla.edu/people/frsvr/frsvr.html), which assigns folds to amino acid sequences by comparing sequence-derived predictions with known structures. Of the total of 468 protein ORFs, 103 (22%) can be assigned a known protein fold with high confidence, as cross-validated with tests on known structures. Of these sequences, 75 (16%) show enough sequence similarity to proteins of known structure that they can also be detected by traditional sequence–sequence comparison methods. That is, the difference of 28 sequences (6%) are assignable by the sequence–structure method of the server but not by current sequence–sequence methods. Of the remaining 78% of sequences in the genome, 18% belong to membrane proteins and the remaining 60% cannot be assigned either because these sequences correspond to no presently known fold or because of insensitivity of the method. At the current rate of determination of new folds by x-ray and NMR methods, extrapolation suggests that folds will be assigned to most soluble proteins in the next decade.
Resumo:
Abscisic acid (ABA), an apocarotenoid synthesized from cleavage of carotenoids, regulates seed maturation and stress responses in plants. The viviparous seed mutants of maize identify genes involved in synthesis and perception of ABA. Two alleles of a new mutant, viviparous14 (vp14), were identified by transposon mutagenesis. Mutant embryos had normal sensitivity to ABA, and detached leaves of mutant seedlings showed markedly higher rates of water loss than those of wild type. The ABA content of developing mutant embryos was 70% lower than that of wild type, indicating a defect in ABA biosynthesis. vp14 embryos were not deficient in epoxy-carotenoids, and extracts of vp14 embryos efficiently converted the carotenoid cleavage product, xanthoxin, to ABA, suggesting a lesion in the cleavage reaction. vp14 was cloned by transposon tagging. The VP14 protein sequence is similar to bacterial lignostilbene dioxygenases (LSD). LSD catalyzes a double-bond cleavage reaction that is closely analogous to the carotenoid cleavage reaction of ABA biosynthesis. Southern blots indicated a family of four to six related genes in maize. The Vp14 mRNA is expressed in embryos and roots and is strongly induced in leaves by water stress. A family of Vp14-related genes evidently controls the first committed step of ABA biosynthesis. These genes are likely to play a key role in the developmental and environmental control of ABA synthesis in plants.
Resumo:
The Arabidopsis PAD4 gene previously was found to be required for expression of multiple defense responses including camalexin synthesis and PR-1 gene expression in response to infection by the bacterial pathogen Pseudomonas syringae pv. maculicola. This report describes the isolation of PAD4. The predicted PAD4 protein sequence displays similarity to triacyl glycerol lipases and other esterases. The PAD4 transcript was found to accumulate after P. syringae infection or treatment with salicylic acid (SA). PAD4 transcript levels were very low in infected pad4 mutants. Treatment with SA induced expression of PAD4 mRNA in pad4–1, pad4–3, and pad4–4 plants but not in pad4–2 plants. Induction of PAD4 expression by P. syringae was independent of the regulatory factor NPR1 but induction by SA was NPR1-dependent. Taken together with the previous observation that pad4 mutants have a defect in accumulation of SA upon pathogen infection, these results suggest that PAD4 participates in a positive regulatory loop that increases SA levels, thereby activating SA-dependent defense responses.
Resumo:
S-adenosyl-l-methionine (SAM)-dependent O-methyltransferases (OMTs) catalyze the methylation of hydroxycinnamic acid derivatives for the synthesis of methylated plant polyphenolics, including lignin. The distinction in the extent of methylation of lignins in angiosperms and gymnosperms, mediated by substrate-specific OMTs, represents one of the fundamental differences in lignin biosynthesis between these two classes of plants. In angiosperms, two types of structurally and functionally distinct lignin pathway OMTs, caffeic acid 3-O-methyltransferases (CAOMTs) and caffeoyl CoA 3-O-methyltransferases (CCoAOMTs), have been reported and extensively studied. However, little is known about lignin pathway OMTs in gymnosperms. We report here the first cloning of a loblolly pine (Pinus taeda) xylem cDNA encoding a multifunctional enzyme, SAM:hydroxycinnamic Acids/hydroxycinnamoyl CoA Esters OMT (AEOMT). The deduced protein sequence of AEOMT is partially similar to, but clearly distinguishable from, that of CAOMTs and does not exhibit any significant similarity with CCoAOMT protein sequences. However, functionally, yeast-expressed AEOMT enzyme catalyzed the methylation of CAOMT substrates, caffeic and 5-hydroxyferulic acids, as well as CCoAOMT substrates, caffeoyl CoA and 5-hydroxyferuloyl CoA esters, with similar specific activities and was completely inactive with substrates associated with flavonoid synthesis. The lignin-related substrates were also efficiently methylated in crude extracts of loblolly pine secondary xylem. Our results support the notion that, in the context of amino acid sequence and biochemical function, AEOMT represents a novel SAM-dependent OMT, with both CAOMT and CCoAOMT activities and thus the potential to mediate a dual methylation pathway in lignin biosynthesis in loblolly pine xylem.
Resumo:
Pyrrolizidine alkaloids are preformed plant defense compounds with sporadic phylogenetic distribution. They are thought to have evolved in response to the selective pressure of herbivory. The first pathway-specific intermediate of these alkaloids is the rare polyamine homospermidine, which is synthesized by homospermidine synthase (HSS). The HSS gene from Senecio vernalis was cloned and shown to be derived from the deoxyhypusine synthase (DHS) gene, which is highly conserved among all eukaryotes and archaebacteria. DHS catalyzes the first step in the activation of translation initiation factor 5A (eIF5A), which is essential for eukaryotic cell proliferation and which acts as a cofactor of the HIV-1 Rev regulatory protein. Sequence comparison provides direct evidence for the evolutionary recruitment of an essential gene of primary metabolism (DHS) for the origin of the committing step (HSS) in the biosynthesis of pyrrolizidine alkaloids.
Resumo:
All but two genes involved in the ergosterol biosynthetic pathway in Saccharomyces cerevisiae have been cloned, and their corresponding mutants have been described. The remaining genes encode the C-3 sterol dehydrogenase (C-4 decarboxylase) and the 3-keto sterol reductase and in concert with the C-4 sterol methyloxidase (ERG25) catalyze the sequential removal of the two methyl groups at the sterol C-4 position. The protein sequence of the Nocardia sp NAD(P)-dependent cholesterol dehydrogenase responsible for the conversion of cholesterol to its 3-keto derivative shows 30% similarity to a 329-aa Saccharomyces ORF, YGL001c, suggesting a possible role of YGL001c in sterol decarboxylation. The disruption of the YGL001c ORF was made in a diploid strain, and the segregants were plated onto sterol supplemented media under anaerobic growth conditions. Segregants containing the YGL001c disruption were not viable after transfer to fresh, sterol-supplemented media. However, one segregant was able to grow, and genetic analysis indicated that it contained a hem3 mutation. The YGL001c (ERG26) disruption also was viable in a hem 1Δ strain grown in the presence of ergosterol. Introduction of the erg26 mutation into an erg1 (squalene epoxidase) strain also was viable in ergosterol-supplemented media. We demonstrated that erg26 mutants grown on various sterol and heme-supplemented media accumulate nonesterified carboxylic acid sterols such as 4β,14α-dimethyl-4α-carboxy-cholesta-8,24-dien-3β-ol and 4β-methyl-4α-carboxy-cholesta-8,24-dien-3β-ol, the predicted substrates for the C-3 sterol dehydrogenase. Accumulation of these sterol molecules in a heme-competent erg26 strain results in an accumulation of toxic-oxygenated sterol intermediates that prevent growth, even in the presence of exogenously added sterol.
Resumo:
hDlg, the human homologue of the Drosophila Discs-large (Dlg) tumor suppressor protein, is known to interact with the tumor suppressor protein APC and the human papillomavirus E6 transforming protein. In a two-hybrid screen, we identified a 322-aa serine/threonine kinase that binds to the PDZ2 domain of hDlg. The mRNA for this PDZ-binding kinase, or PBK, is most abundant in placenta and absent from adult brain tissue. The protein sequence of PBK has all the characteristic protein kinase subdomains and a C-terminal PDZ-binding T/SXV motif. In vitro, PBK binds specifically to PDZ2 of hDlg through its C-terminal T/SXV motif. PBK and hDlg are phosphorylated at mitosis in HeLa cells, and the mitotic phosphorylation of PBK is required for its kinase activity. In vitro, cdc2/cyclin B phosphorylates PBK. This evidence shows how PBK could link hDlg or other PDZ-containing proteins to signal transduction pathways regulating the cell cycle or cellular proliferation.
Resumo:
We report DNA and predicted protein sequence similarities, implying homology, among genes of double-stranded DNA (dsDNA) bacteriophages and prophages spanning a broad phylogenetic range of host bacteria. The sequence matches reported here establish genetic connections, not always direct, among the lambdoid phages of Escherichia coli, phage φC31 of Streptomyces, phages of Mycobacterium, a previously unrecognized cryptic prophage, φflu, in the Haemophilus influenzae genome, and two small prophage-like elements, φRv1 and φRv2, in the genome of Mycobacterium tuberculosis. The results imply that these phage genes, and very possibly all of the dsDNA tailed phages, share common ancestry. We propose a model for the genetic structure and dynamics of the global phage population in which all dsDNA phage genomes are mosaics with access, by horizontal exchange, to a large common genetic pool but in which access to the gene pool is not uniform for all phage.
Resumo:
A de novo sequencing program for proteins is described that uses tandem MS data from electron capture dissociation and collisionally activated dissociation of electrosprayed protein ions. Computer automation is used to convert the fragment ion mass values derived from these spectra into the most probable protein sequence, without distinguishing Leu/Ile. Minimum human input is necessary for the data reduction and interpretation. No extra chemistry is necessary to distinguish N- and C-terminal fragments in the mass spectra, as this is determined from the electron capture dissociation data. With parts-per-million mass accuracy (now available by using higher field Fourier transform MS instruments), the complete sequences of ubiquitin (8.6 kDa) and melittin (2.8 kDa) were predicted correctly by the program. The data available also provided 91% of the cytochrome c (12.4 kDa) sequence (essentially complete except for the tandem MS-resistant region K13–V20 that contains the cyclic heme). Uncorrected mass values from a 6-T instrument still gave 86% of the sequence for ubiquitin, except for distinguishing Gln/Lys. Extensive sequencing of larger proteins should be possible by applying the algorithm to pieces of ≈10-kDa size, such as products of limited proteolysis.
Resumo:
The proliferation of various tumors is inhibited by the antagonists of growth hormone-releasing hormone (GHRH) in vitro and in vivo, but the receptors mediating the effects of GHRH antagonists have not been identified so far. Using an approach based on PCR, we detected two major splice variants (SVs) of mRNA for human GHRH receptor (GHRH-R) in human cancer cell lines, including LNCaP prostatic, MiaPaCa-2 pancreatic, MDA-MB-468 breast, OV-1063 ovarian, and H-69 small-cell lung carcinomas. In addition, high-affinity, low-capacity binding sites for GHRH antagonists were found on the membranes of cancer cell lines such as MiaPaCa-2 that are negative for the vasoactive intestinal peptide/pituitary adenylate cyclase-activating polypeptide receptor (VPAC-R) or lines such as LNCaP that are positive for VPAC-R. Sequence analysis of cDNAs revealed that the first three exons in SV1 and SV2 are replaced by a fragment of retained intron 3 having a new putative in-frame start codon. The rest of the coding region of SV1 is identical to that of human pituitary GHRH-R, whereas in SV2 exon 7 is spliced out, resulting in a 1-nt upstream frameshift, which leads to a premature stop codon in exon 8. The intronic sequence may encode a distinct 25-aa fragment of the N-terminal extracellular domain, which could serve as a proposed signal peptide. The continuation of the deduced protein sequence coded by exons 4–13 in SV1 is identical to that of pituitary GHRH-R. SV2 may encode a GHRH-R isoform truncated after the second transmembrane domain. Thus SVs of GHRH-Rs have now been identified in human extrapituitary cells. The findings support the view that distinct receptors are expressed on human cancer cells, which may mediate the antiproliferative effect of GHRH antagonists.
Resumo:
Ubiquitin is a highly conserved protein that is encoded by a multigene family. It is generally believed that this gene family is subject to concerted evolution, which homogenizes the member genes of the family. However, protein homogeneity can be attained also by strong purifying selection. We therefore studied the proportion (pS) of synonymous nucleotide differences between members of the ubiquitin gene family from 28 species of fungi, plants, and animals. The results have shown that pS is generally very high and is often close to the saturation level, although the protein sequence is virtually identical for all ubiquitins from fungi, plants, and animals. A small proportion of species showed a low level of pS values, but these values appeared to be caused by recent gene duplication. It was also found that the number of repeat copies of the gene family varies considerably with species, and some species harbor pseudogenes. These observations suggest that the members of this gene family evolve almost independently by silent nucleotide substitution and are subjected to birth-and-death evolution at the DNA level.
Resumo:
Aldose reductase (ALR2), a NADPH-dependent aldo-keto reductase (AKR), is widely distributed in mammalian tissues and has been implicated in complications of diabetes, including diabetic nephropathy. To identify a renal-specific reductase belonging to the AKR family, representational difference analyses of cDNA from diabetic mouse kidney were performed. A full-length cDNA with an ORF of 855 nt and yielding a ≈1.5-kb mRNA transcript was isolated from a mouse kidney library. Human and rat homologues also were isolated, and they had ≈91% and ≈97% amino acid identity with mouse protein. In vitro translation of the cDNA yielded a protein product of ≈33 kDa. Northern and Western blot analyses, using the cDNA and antirecombinant protein antibody, revealed its expression exclusively confined to the kidney. Like ALR2, the expression was up-regulated in diabetic kidneys. Its mRNA and protein expression was restricted to renal proximal tubules. The gene neither codistributed with Tamm–Horsfall protein nor aquaporin-2. The deduced protein sequence revealed an AKR-3 motif located near the N terminus, unlike the other AKR family members where it is confined to the C terminus. Fluorescence quenching and reactive blue agarose chromatography studies revealed that it binds to NADPH with high affinity (KdNADPH = 66.9 ± 2.3 nM). This binding domain is a tetrapeptide (Met-Ala-Lys-Ser) located within the AKR-3 motif that is similar to the other AKR members. The identified protein is designated as RSOR because it is renal-specific with properties of an oxido-reductase, and like ALR2 it may be relevant in the renal complications of diabetes mellitus.