993 resultados para conserved noncoding sequence
Resumo:
The involvement of a conserved serine (Ser196 at the mu-, Ser177 at the delta-, and Ser187 at the kappa-opioid receptor) in receptor activation is demonstrated by site-directed mutagenesis. It was initially observed during our functional screening of a mu/delta-opioid chimeric receptor, mu delta2, that classical opioid antagonists such as naloxone, naltrexone, naltriben, and H-Tyr-Tic[psi,CH2NH]Phe-Phe-OH (TIPPpsi; Tic = 1,2,3,4-tetrahydroisoquinoline-3-carboxylic acid) could inhibit forskolin-stimulated adenylyl cyclase activity in CHO cells stably expressing the chimeric receptor. Antagonists also activated the G protein-coupled inward rectifying potassium channel (GIRK1) in Xenopus oocytes coexpressing the mu delta2 opioid receptor and the GIRK1 channel. By sequence analysis and back mutation, it was determined that the observed antagonist activity was due to the mutation of a conserved serine to leucine in the fourth transmembrane domain (S196L). The importance of this serine was further demonstrated by analogous mutations created in the mu-opioid receptor (MORS196L) and delta-opioid receptor (DORS177L), in which classical opioid antagonists could inhibit forskolin-stimulated adenylyl cyclase activity in CHO cells stably expressing either MORS196L or DORS177L. Again, antagonists could activate the GIRK1 channel coexpressed with either MORS196L or DORS177L in Xenopus oocytes. These data taken together suggest a crucial role for this serine residue in opioid receptor activation.
Resumo:
For mammals beta2-microglobulin (beta2m), the light chain of major histocompatibility complex (MHC) class I molecules, is invariant (or highly conserved) and is encoded by a single gene unlinked to the MHC. We find that beta2m of a salmonid fish, the rainbow trout (Oncorhynchus mykiss), does not conform to the mammalian paradigm. Ten of 12 randomly selected beta2m cDNA clones from an individual fish have different nucleotide sequences. A complex restriction fragment length polymorphism pattern is observed with rainbow trout, suggesting multiple beta2m genes in the genome, in excess of the two genes expected from the ancestral salmonid tetraploidy. Additional duplication and diversification of the beta2m genes might have occurred subsequently. Variation in the beta2m cDNA sequences is mainly at sites that do not perturb the structure of the mature beta2m protein, showing that the observed diversity of the trout beta2m genes is not primarily a result of pathogen selection.
Resumo:
Multiubiquitin chain attachment is a key step leading to the selective degradation of abnormal polypeptides and many important regulatory proteins by the eukaryotic 26S proteasome. However, the mechanism by which the 26S complex recognizes this posttranslational modification is unknown. Using synthetic multiubiquitin chains to probe an expression library for interacting proteins, we have isolated an Arabidopsis cDNA, designated MBP1, that encodes a 41-kDa acidic protein exhibiting high affinity for chains, especially those containing four or more ubiquitins. Based on similar physical and immunological properties, multiubiquitin binding affinities, and peptide sequence, MBP1 is homologous to subunit 5a of the human 26S proteasome. Structurally related proteins also exist in yeast, Caenorhabditis, and other plant species. Given their binding properties, association with the 26S proteasome, and widespread distribution, MBP1, S5a, and related proteins likely function as essential ubiquitin recognition components of the 26S proteasome.
Resumo:
Using the yeast two-hybrid system we have identified a human protein, GAIP (G Alpha Interacting Protein), that specifically interacts with the heterotrimeric GTP-binding protein G alpha i3. Interaction was verified by specific binding of in vitro-translated G alpha i3 with a GAIP-glutathione S-transferase fusion protein. GAIP is a small protein (217 amino acids, 24 kDa) that contains two potential phosphorylation sites for protein kinase C and seven for casein kinase 2. GAIP shows high homology to two previously identified human proteins, GOS8 and 1R20, two Caenorhabditis elegans proteins, CO5B5.7 and C29H12.3, and the FLBA gene product in Aspergillus nidulans--all of unknown function. Significant homology was also found to the SST2 gene product in Saccharomyces cerevisiae that is known to interact with a yeast G alpha subunit (Gpa1). A highly conserved core domain of 125 amino acids characterizes this family of proteins. Analysis of deletion mutants demonstrated that the core domain is the site of GAIP's interaction with G alpha i3. GAIP is likely to be an early inducible phosphoprotein, as its cDNA contains the TTTTGT sequence characteristic of early response genes in its 3'-untranslated region. By Northern analysis GAIP's 1.6-kb mRNA is most abundant in lung, heart, placenta, and liver and is very low in brain, skeletal muscle, pancreas, and kidney. GAIP appears to interact exclusively with G alpha i3, as it did not interact with G alpha i2 and G alpha q. The fact that GAIP and Sst2 interact with G alpha subunits and share a common domain suggests that other members of the GAIP family also interact with G alpha subunits through the 125-amino-acid core domain.
Sequence similarity analysis of Escherichia coli proteins: functional and evolutionary implications.
Resumo:
A computer analysis of 2328 protein sequences comprising about 60% of the Escherichia coli gene products was performed using methods for database screening with individual sequences and alignment blocks. A high fraction of E. coli proteins--86%--shows significant sequence similarity to other proteins in current databases; about 70% show conservation at least at the level of distantly related bacteria, and about 40% contain ancient conserved regions (ACRs) shared with eukaryotic or Archaeal proteins. For > 90% of the E. coli proteins, either functional information or sequence similarity, or both, are available. Forty-six percent of the E. coli proteins belong to 299 clusters of paralogs (intraspecies homologs) defined on the basis of pairwise similarity. Another 10% could be included in 70 superclusters using motif detection methods. The majority of the clusters contain only two to four members. In contrast, nearly 25% of all E. coli proteins belong to the four largest superclusters--namely, permeases, ATPases and GTPases with the conserved "Walker-type" motif, helix-turn-helix regulatory proteins, and NAD(FAD)-binding proteins. We conclude that bacterial protein sequences generally are highly conserved in evolution, with about 50% of all ACR-containing protein families represented among the E. coli gene products. With the current sequence databases and methods of their screening, computer analysis yields useful information on the functions and evolutionary relationships of the vast majority of genes in a bacterial genome. Sequence similarity with E. coli proteins allows the prediction of functions for a number of important eukaryotic genes, including several whose products are implicated in human diseases.
Resumo:
The adenovirus type 2/5 E1A proteins transform primary baby rat kidney (BRK) cells in cooperation with the activated Ras (T24 ras) oncoprotein. The N-terminal half of E1A (exon 1) is essential for this transformation activity. While the C-terminal half of E1A (exon 2) is dispensable, a region located between residues 225 and 238 of the 243R E1A protein negatively modulates in vitro T24 ras cooperative transformation as well as the tumorigenic potential of E1A/T24 ras-transformed cells. The same C-terminal domain is also required for binding of a cellular 48-kDa phosphoprotein, C-terminal binding protein (CtBP). We have cloned the cDNA for CtBP via yeast two-hybrid interaction cloning. The cDNA encodes a 439-amino acid (48 kDa) protein that specifically interacts with exon 2 in yeast two-hybrid, in vitro protein binding, and in vivo coimmunoprecipitation analyses. This protein requires residues 225-238 of the 243R E1A protein for interaction. The predicted protein sequence of the isolated cDNA is identical to amino acid sequences obtained from peptides prepared from biochemically purified CtBP. Fine mapping of the CtBP-binding domain revealed that a 6-amino acid motif highly conserved among the E1A proteins of various human and animal adenoviruses is required for this interaction. These results suggest that interaction of CtBP with the E1A proteins may play a critical role in adenovirus replication and oncogenic transformation.
Resumo:
An extensive sequence comparison of the chloroplast ndhF gene from all major clades of the largest flowering plant family (Asteraceae) shows that this gene provides approximately 3 times more phylogenetic information than rbcL. This is because it is substantially longer and evolves twice as fast. The 5' region (1380 bp) of ndhF is very different from the 3' region (855 bp) and is similar to rbcL in both the rate and the pattern of sequence change. The 3' region is more A+T-rich, has higher levels of nonsynonymous base substitution, and shows greater transversion bias at all codon positions. These differences probably reflect different functional constraints on the 5' and 3' regions of ndhF. The two patterns of base substitutions of ndhF are particularly advantageous for phylogenetic reconstruction because the conserved and variable segments can be used for older and recent groups, respectively. Phylogenetic analyses of 94 ndhF sequences provided much better resolution of relationships than previous molecular and morphological phylogenies of the Asteraceae. The ndhF tree identified five major clades: (i) the Calyceraceae is the sister family of Asteraceae; (ii) the Barnadesioideae is monophyletic and is the sister group to the rest of the family; (iii) the Cichorioideae and its two basal tribes Mutisieae and Cardueae are paraphyletic; (iv) four tribes of Cichorioideae (Lactuceae, Arctoteae, Liabeae, and Vernonieae) form a monophyletic group, and these are the sister clade of the Asteroideae; and (v) the Asteroideae is monophyletic and includes three major clades.
Resumo:
The correspondence between the transversion/transition ratio and the neighboring base composition in chloroplast DNA is examined. For 18 noncoding regions of the chloroplast genome, alignments between rice (Oryza sativa) and maize (Zea mays) were generated by two different methods. Difficulties of aligning noncoding DNA are discussed, and the alignments are analyzed in a manner that reduces alignment artifacts. Sequence divergence is < 10%, so multiple substitutions at a site are assumed to be rare. Observed substitutions were analyzed with respect to the A+T content of the two immediately flanking bases. It is shown that as this content increases, the proportion of transversions also increases. When both the 5'- and 3'-flanking nucleotides are G or C (A+T content of 0), only 25% of the observed substitutions are transversions. However, when both the 5'- and 3'-flanking nucleotides are A or T (A+T content of 2), 57% of the observed substitutions are transversions. Therefore, the influence of flanking base composition on substitutions, previously reported for a single noncoding region, is a general feature of the chloroplast genome.
Resumo:
The guinea pig estrogen sulfotransferase gene has been cloned and compared to three other cloned steroid and phenol sulfotransferase genes (human estrogen sulfotransferase, human phenol sulfotransferase, and guinea pig 3 alpha-hydroxysteroid sulfotransferase). The four sulfotransferase genes demonstrate a common outstanding feature: the splice sites for their 3'-terminal exons are identically located. That is, the 3'-terminal exon splice sites involve a glycine that constitutes the N-terminal glycine of an invariably conserved GXXGXXK motif present in all steroid and phenol sulfotransferases for which primary structures are known. This consistency strongly suggests that all steroid and phenol sulfotransferase genes will be similarly spliced. The GXXGXXK motif forms the active binding site for the universal sulfonate donor 3'-phosphoadenosine 5'-phosphosulfate. Amino acid sequence alignment of 19 cloned steroid and phenol sulfotransferases starting with the GXXGXXK motif indicates that the 3'-terminal exon for each steroid and phenol sulfotransferase gene encodes a similarly sized C-terminal fragment of the protein. Interestingly, on further analysis of the alignment, three distinct amino acid sequence patterns emerge. The presence of the conserved functional GXXGXXK motif suggests that the protein domains encoded by steroid and phenol sulfotransferase 3'-terminal exons have evolved from a common ancestor. Furthermore, it is hypothesized that during the course of evolution, the 3'-terminal exon further diverged into at least three sulfotransferase subdivisions: a phenol or aryl group, an estrogen or phenolic steroid group, and a neutral steroid group.
Resumo:
Clones encoding pro-phenol oxidase [pro-PO; zymogen of phenol oxidase (monophenol, L-dopa:oxygen oxidoreductase, EC 1.14.18.1)] A1 were isolated from a lambda gt10 library that originated from Drosophila melanogaster strain Oregon-R male adults. The 2294 bp of the cDNA included a 13-bp 5'-noncoding region, a 2070-bp encoding open reading frame of 690 amino acids, and a 211-bp 3'-noncoding region. A hydrophobic NH2-terminal sequence for a signal peptide is absent in the protein. Furthermore, there are six potential N-glycosylation sites in the sequence, but no amino sugar was detected in the purified protein by amino acid analysis, indicating the lack of an N-linked sugar chain. The potential copper-binding sites, amino acids 200-248 and 359-414, are highly homologous to the corresponding sites of hemocyanin of the tarantula Eurypelma californicum, the horseshoe crab Limulus polyphemus, and the spiny lobster Panulirus interruptus. On the basis of the phylogenetic tree constructed by the neighbor-joining method, vertebrate tyrosinases and molluscan hemocyanins constitute one family, whereas pro-POs and arthropod hemocyanins group with another family. It seems, therefore, likely that pro-PO originates from a common ancestor with arthropod hemocyanins, independently to the vertebrate and microbial tyrosinases.
Resumo:
The Fc gamma receptor-associated gamma and zeta subunits contain a conserved cytoplasmic motif, termed the immunoglobulin gene tyrosine activation motif, which contains a pair of YXXL sequences. The tyrosine residues within these YXXL sequences have been shown to be required for transduction of a phagocytic signal. We have previously reported that the gamma subunit of the type IIIA Fc gamma receptor (Fc gamma RIIIA) is approximately 6 times more efficient in mediating phagocytosis than the zeta subunit of Fc gamma RIIIA. By exchanging regions of the cytoplasmic domains of the homologous gamma and zeta chains, we observed that the cytoplasmic area of the gamma chain bearing a pair of the conserved YXXL sequences is important in phagocytic signaling. Further specificity of phagocytic signaling is largely determined by the two internal XX amino acids in the YXXL sequences. In contrast, the flanking amino acids of the YXXL sequences including the seven intervening amino acids between the two YXXL sequences do not significantly affect the phagocytic signal. Furthermore, the protein-tyrosine kinase Syk, but not the related kinase ZAP-70, stimulated Fc gamma RIIIA-mediated phagocytosis. ZAP-70, however, increased phagocytosis when coexpressed with the Src family kinase Fyn. These data demonstrate the importance of the two specific amino acids within the gamma subunit YXXL cytoplasmic sequences in phagocytic signaling and explain the difference in phagocytic efficiency of the gamma and zeta chains. These results indicate the importance of Syk in Fc gamma RIIIA-mediated phagocytosis and demonstrate that ZAP-70 and syk differ in their requirement for a Src-related kinase in signal transduction.
Resumo:
Transcription factor TFIIIB plays a central role in transcription initiation by RNA polymerase III on genes encoding tRNA, 5S rRNA, and other small structural RNAs. We report the purification of a human TFIIIB-derived complex containing only the TATA-binding polypeptide (TBP) and a 90-kDa subunit (TFIIIB90) and the isolation of a cDNA clone encoding the 90-kDa subunit. The N-terminal half of TFIIIB90 exhibits sequence similarity to the yeast TFIIIB70 (BRF) and the class II transcription factor TFIIB and interacts weakly with TBP. The C-terminal half of TFIIIB90 contains a high-mobility-group protein 2 (HMG2)-related domain and interacts strongly with TBP. Recombinant TFIIIB90 plus recombinant human TBP substitute for human TFIIIB in a complementation assay for transcription of 5S, tRNA, and VA1 RNA genes, and both the TFIIB-related domain and the HMG2-related domain are required for this activity. TFIIIB90 is also required for transcription of human 7SK and U6 RNA genes by RNA polymerase III, but apparently within a complex distinct from the TBP/TFIIIB90 complex.
Resumo:
The regions surrounding the catalytic amino acids previously identified in a few "retaining" O-glycosyl hydrolases (EC 3.2.1) have been analyzed by hydrophobic cluster analysis and have been used to define sequence motifs. These motifs have been found in more than 150 glycosyl hydrolase sequences representing at least eight established protein families that act on a large variety of substrates. This allows the localization and the precise role of the catalytic residues (nucleophile and acid catalyst) to be predicted for each of these enzymes, including several lysosomal glycosidases. An identical arrangement of the catalytic nucleophile was also found for S-glycosyl hydrolases (myrosinases; EC 3.2.3.1) for which the acid catalyst is lacking. A (beta/alpha)8 barrel structure has been reported for two of the eight families of proteins that have been grouped. It is suggested that the six other families also share this fold at their catalytic domain. These enzymes illustrate how evolutionary events led to a wide diversification of substrate specificity with a similar disposition of identical catalytic residues onto the same ancestral (beta/alpha)8 barrel structure.
Resumo:
Open reading frames in the Plasmodium falciparum genome encode domains homologous to the adhesive domains of the P. falciparum EBA-175 erythrocyte-binding protein (eba-175 gene product) and those of the Plasmodium vivax and Plasmodium knowlesi Duffy antigen-binding proteins. These domains are referred to as Duffy binding-like (DBL), after the receptor that determines P. vivax invasion of Duffy blood group-positive human erythrocytes. Using oligonucleotide primers derived from short regions of conserved sequence, we have developed a reverse transcription-PCR method that amplifies sequences encoding the DBL domains of expressed genes. Products of these reverse transcription-PCR amplifications include sequences of single-copy genes (including eba-175) and variably transcribed genes that cross-hybridize to multiple regions of the genome. Restriction patterns of the multicopy genes show a high degree of polymorphism among different parasite lines, whereas single-copy genes are generally conserved. Characterization of the single-copy genes has identified a gene (ebl-1) that is related to eba-175 and is likely to be involved in erythrocyte invasion.
Resumo:
Inositol polyphosphate 1-phosphatase, inositol monophosphate phosphatase, and fructose 1,6-bisphosphatase share a sequence motif, Asp-Pro-(Ile or Leu)-Asp-(Gly or Ser)-(Thr or Ser), that has been shown by crystallographic and mutagenesis studies to bind metal ions and participate in catalysis. We compared the six alpha-carbon coordinates of this motif from the crystal structures of these three phosphatases and found that they are superimposable with rms deviations ranging from 0.27 to 0.60 A. Remarkably, when these proteins were aligned by this motif a common core structure emerged, defined by five alpha-helices and 11 beta-strands comprising 155 residues having rms deviations ranging from 1.48 to 2.66 A. We used the superimposed structures to align the sequences within the common core, and a distant relationship was observed suggesting a common ancestor. The common core was used to align the sequences of several other proteins that share significant similarity to inositol monophosphate phosphatase, including proteins encoded by fungal qa-X and qutG, bacterial suhB and cysQ (identical to amtA), and yeast met22 (identical to hal2). Evolutionary comparison of the core sequences indicate that five distinct branches exist within this family. These proteins share metal-dependent/Li(+)-sensitive phosphomonoesterase activity, and each predicted tree branch exhibits unique substrate specificity. Thus, these proteins define an ancient structurally conserved family involved in diverse metabolic pathways including inositol signaling, gluconeogenesis, sulfate assimilation, and possibly quinone metabolism. Furthermore, we suggest that this protein family identifies candidate enzymes to account for both the therapeutic and toxic actions of Li+ as it is used in patients treated for manic depressive disease.