154 resultados para Protein structures
Resumo:
Eukaryotic translation initiation factor 5A (eIF-5A) is a ubiquitous protein found in all eukaryotic cells. The protein is closely associated with cell proliferation in the G1–S stage of the cell cycle. Recent findings show that the eIF-5A proteins are highly expressed in tumor cells and act as a cofactor of the Rev protein in HIV-1-infected cells. The mature eIF is the only protein known to have the unusual amino acid hypusine, a post-translationally modified lysine. The crystal structure of eIF-5A from Methanococcus jannaschii (MJ eIF-5A) has been determined at 1.9 Å and 1.8 Å resolution in two crystal forms by using the multiple isomorphous replacement method and the multiwavelength anomalous diffraction method for the first crystal form and the molecular replacement method for the second crystal form. The structure consists of two folding domains, one of which is similar to the oligonucleotide-binding domain found in the prokaryotic cold shock protein and the translation initiation factor IF1 despite the absence of any significant sequence similarities. The 12 highly conserved amino acid residues found among eIF-5As include the hypusine site and form a long protruding loop at one end of the elongated molecule.
Resumo:
Many small bacterial, archaebacterial, and eukaryotic genomes have been sequenced, and the larger eukaryotic genomes are predicted to be completely sequenced within the next decade. In all genomes sequenced to date, a large portion of these organisms’ predicted protein coding regions encode polypeptides of unknown biochemical, biophysical, and/or cellular functions. Three-dimensional structures of these proteins may suggest biochemical or biophysical functions. Here we report the crystal structure of one such protein, MJ0577, from a hyperthermophile, Methanococcus jannaschii, at 1.7-Å resolution. The structure contains a bound ATP, suggesting MJ0577 is an ATPase or an ATP-mediated molecular switch, which we confirm by biochemical experiments. Furthermore, the structure reveals different ATP binding motifs that are shared among many homologous hypothetical proteins in this family. This result indicates that structure-based assignment of molecular function is a viable approach for the large-scale biochemical assignment of proteins and for discovering new motifs, a basic premise of structural genomics.
Resumo:
Transcriptional regulation in papillomaviruses depends on sequence-specific binding of the regulatory protein E2 to several sites in the viral genome. Crystal structures of bovine papillomavirus E2 DNA targets reveal a conformational variant of B-DNA characterized by a roll-induced writhe and helical repeat of 10.5 bp per turn. A comparison between the free and the protein-bound DNA demonstrates that the intrinsic structure of the DNA regions contacted directly by the protein and the deformability of the DNA region that is not contacted by the protein are critical for sequence-specific protein/DNA recognition and hence for gene-regulatory signals in the viral system. We show that the selection of dinucleotide or longer segments with appropriate conformational characteristics, when positioned at correct intervals along the DNA helix, can constitute a structural code for DNA recognition by regulatory proteins. This structural code facilitates the formation of a complementary protein–DNA interface that can be further specified by hydrogen bonds and nonpolar interactions between the protein amino acids and the DNA bases.
Resumo:
Computer models were used to examine whether and under what conditions the multimeric protein complex is inhibited by high concentrations of one of its components—an effect analogous to the prozone phenomenon in precipitin tests. A series of idealized simple “ball-and-stick” structures representing small oligomeric complexes of protein molecules formed by reversible binding reactions were analyzed to determine the binding steps leading to each structure. The equilibrium state of each system was then determined over a range of starting concentrations and Kds and the steady-state concentration of structurally complete oligomer calculated for each situation. A strong inhibitory effect at high concentrations was shown by any protein molecule forming a bridge between two or more separable parts of the complex. By contrast, proteins linked to the outside of the complex by a single bond showed no inhibition whatsoever at any concentration. Nonbridging, multivalent proteins in the body of the complex could show an inhibitory effect or not depending on the structure of the complex and the strength of its bonds. On the basis of this study, we suggest that the prozone phenomenon will occur widely in living cells and that it could be a crucial factor in the regulation of protein complex formation.
Resumo:
Several unanswered questions in T cell immunobiology relating to intracellular processing or in vivo antigen presentation could be approached if convenient, specific, and sensitive reagents were available for detecting the peptide–major histocompatibility complex (MHC) class I or class II ligands recognized by αβ T cell receptors. For this reason, we have developed a method using homogeneously loaded peptide–MHC class II complexes to generate and select specific mAb reactive with these structures using hen egg lysozyme (HEL) and I-Ak as a model system. mAbs specific for either HEL-(46–61)–Ak or HEL-(116–129)–Ak have been isolated. They cross-react with a small subset of I-Ak molecules loaded with self peptides but can nonetheless be used for flow cytometry, immunoprecipitation, Western blotting, and intracellular immunofluorescence to detect specific HEL peptide–MHC class II complexes formed by either peptide exposure or natural processing of native HEL. An example of the utility of these reagents is provided herein by using one of the anti-HEL-(46–61)–Ak specific mAbs to visualize intracellular compartments where I-Ak is loaded with HEL-derived peptides early after antigen administration. Other uses, especially for in vivo tracking of specific ligand-bearing antigen-presenting cells, are discussed.
Resumo:
We report here the crystal structure of the RuvB motor protein from Thermus thermophilus HB8, which drives branch migration of the Holliday junction during homologous recombination. RuvB has a crescent-like architecture consisting of three consecutive domains, the first two of which are involved in ATP binding and hydrolysis. DNA is likely to interact with a large basic cleft, which encompasses the ATP-binding pocket and domain boundaries, whereas the junction-recognition protein RuvA may bind a flexible β-hairpin protruding from the N-terminal domain. The structures of two subunits, related by a noncrystallographic pseudo-2-fold axis, imply that conformational changes of motor protein coupled with ATP hydrolysis may reflect motility essential for its translocation around double-stranded DNA.
Resumo:
We have determined the structure of a DEAD box putative RNA helicase from the hyperthermophile Methanococcus jannaschii. Like other helicases, the protein contains two α/β domains, each with a recA-like topology. Unlike other helicases, the protein exists as a dimer in the crystal. Through an interaction that resembles the dimer interface of insulin, the amino-terminal domain's 7-strand β-sheet is extended to 14 strands across the two molecules. Motifs conserved in the DEAD box family cluster in the cleft between domains, and many of their functions can be deduced by mutational data and by comparison with other helicase structures. Several lines of evidence suggest that motif III Ser-Ala-Thr may be involved in binding RNA.
Resumo:
Single-stranded DNA binding proteins (SSBs) play central roles in cellular and viral processes involving the generation of single-stranded DNA. These include DNA replication, homologous recombination and DNA repair pathways. SSBs bind DNA using four ‘OB-fold’ (oligonucleotide/oligosaccharide binding fold) domains that can be organised in a variety of overall quaternary structures. Thus eubacterial SSBs are homotetrameric whilst the eucaryal RPA protein is a heterotrimer and euryarchaeal proteins vary significantly in their subunit compositions. We demonstrate that the crenarchaeal SSB protein is an abundant protein with a unique structural organisation, existing as a monomer in solution and multimerising on DNA binding. The protein binds single-stranded DNA distributively with a binding site size of ~5 nt per monomer. Sulfolobus SSB lacks the zinc finger motif found in the eucaryal and euryarchaeal proteins, possessing instead a flexible C-terminal tail, sensitive to trypsin digestion, that is not required for DNA binding. In comparison with Escherichia coli SSB, the tail may play a role in protein–protein interactions during DNA replication and repair.
Resumo:
Replication protein A (RPA), the nuclear single-stranded DNA binding protein is involved in DNA replication, nucleotide excision repair (NER) and homologous recombination. It is a stable heterotrimer consisting of subunits with molecular masses of 70, 32 and 14 kDa (p70, p32 and p14, respectively). Gapped DNA structures are common intermediates during DNA replication and NER. To analyze the interaction of RPA and its subunits with gapped DNA we designed structures containing 9 and 30 nucleotide gaps with a photoreactive arylazido group at the 3′-end of the upstream oligonucleotide or at the 5′-end of the downstream oligonucleotide. UV crosslinking and subsequent analysis showed that the p70 subunit mainly interacts with the 5′-end of DNA irrespective of DNA structure, while the subunit orientation towards the 3′-end of DNA in the gap structures strongly depends on the gap size. The results are compared with the data obtained previously with the primer–template systems containing 5′- or 3′-protruding DNA strands. Our results suggest a model of polar RPA binding to the gapped DNA.
Resumo:
The HIV-1 transcript is alternatively spliced to over 30 different mRNAs. Whether RNA secondary structure can influence HIV-1 RNA alternative splicing has not previously been examined. Here we have determined the secondary structure of the HIV-1/BRU RNA segment, containing the alternative A3, A4a, A4b, A4c and A5 3′ splice sites. Site A3, required for tat mRNA production, is contained in the terminal loop of a stem–loop structure (SLS2), which is highly conserved in HIV-1 and related SIVcpz strains. The exon splicing silencer (ESS2) acting on site A3 is located in a long irregular stem–loop structure (SLS3). Two SLS3 domains were protected by nuclear components under splicing condition assays. One contains the A4c branch points and a putative SR protein binding site. The other one is adjacent to ESS2. Unexpectedly, only the 3′ A residue of ESS2 was protected. The suboptimal A3 polypyrimidine tract (PPT) is base paired. Using site-directed mutagenesis and transfection of a mini-HIV-1 cDNA into HeLa cells, we found that, in a wild-type PPT context, a mutation of the A3 downstream sequence that reinforced SLS2 stability decreased site A3 utilization. This was not the case with an optimized PPT. Hence, sequence and secondary structure of the PPT may cooperate in limiting site A3 utilization.
Resumo:
PDBsum is a web-based database providing a largely pictorial summary of the key information on each macromolecular structure deposited at the Protein Data Bank (PDB). It includes images of the structure, annotated plots of each protein chain’s secondary structure, detailed structural analyses generated by the PROMOTIF program, summary PROCHECK results and schematic diagrams of protein–ligand and protein–DNA interactions. RasMol scripts highlight key aspects of the structure, such as the protein’s domains, PROSITE patterns and protein–ligand interactions, for interactive viewing in 3D. Numerous links take the user to related sites. PDBsum is updated whenever any new structures are released by the PDB and is freely accessible via http://www.biochem.ucl.ac.uk/bsm/pdbsum.
Resumo:
The Dali Domain Dictionary (http://www.ebi.ac.uk/dali/domain) is a numerical taxonomy of all known structures in the Protein Data Bank (PDB). The taxonomy is derived fully automatically from measurements of structural, functional and sequence similarities. Here, we report the extension of the classification to match the traditional four hierarchical levels corresponding to: (i) supersecondary structural motifs (attractors in fold space), (ii) the topology of globular domains (fold types), (iii) remote homologues (functional families) and (iv) homologues with sequence identity above 25% (sequence families). The computational definitions of attractors and functional families are new. In September 2000, the Dali classification contained 10 531 PDB entries comprising 17 101 chains, which were partitioned into five attractor regions, 1375 fold types, 2582 functional families and 3724 domain sequence families. Sequence families were further associated with 99 582 unique homologous sequences in the HSSP database, which increases the number of effectively known structures several-fold. The resulting database contains the description of protein domain architecture, the definition of structural neighbours around each known structure, the definition of structurally conserved cores and a comprehensive library of explicit multiple alignments of distantly related protein families.
Resumo:
GlycoSuiteDB is a relational database that curates information from the scientific literature on glycoprotein derived glycan structures, their biological sources, the references in which the glycan was described and the methods used to determine the glycan structure. To date, the database includes most published O-linked oligosaccharides from the last 50 years and most N-linked oligosaccharides that were published in the 1990s. For each structure, information is available concerning the glycan type, linkage and anomeric configuration, mass and composition. Detailed information is also provided on native and recombinant sources, including tissue and/or cell type, cell line, strain and disease state. Where known, the proteins to which the glycan structures are attached are reported, and cross-references to the SWISS-PROT/TrEMBL protein sequence databases are given if applicable. The GlycoSuiteDB annotations include literature references which are linked to PubMed, and detailed information on the methods used to determine each glycan structure are noted to help the user assess the quality of the structural assignment. GlycoSuiteDB has a user-friendly web interface which allows the researcher to query the database using monoisotopic or average mass, monosaccharide composition, glycosylation linkages (e.g. N- or O-linked), reducing terminal sugar, attached protein, taxonomy, tissue or cell type and GlycoSuiteDB accession number. Advanced queries using combinations of these parameters are also possible. GlycoSuiteDB can be accessed on the web at http://www.glycosuite.com.
Resumo:
The RESID Database is a comprehensive collection of annotations and structures for protein post-translational modifications including N-terminal, C-terminal and peptide chain cross-link modifications. The RESID Database includes systematic and frequently observed alternate names, Chemical Abstracts Service registry numbers, atomic formulas and weights, enzyme activities, taxonomic range, keywords, literature citations with database cross-references, structural diagrams and molecular models. The NRL-3D Sequence–Structure Database is derived from the three-dimensional structure of proteins deposited with the Research Collaboratory for Structural Bioinformatics Protein Data Bank. The NRL-3D Database includes standardized and frequently observed alternate names, sources, keywords, literature citations, experimental conditions and searchable sequences from model coordinates. These databases are freely accessible through the National Cancer Institute–Frederick Advanced Biomedical Computing Center at these web sites: http://www.ncifcrf.gov/RESID, http://www.ncifcrf.gov/ NRL-3D; or at these National Biomedical Research Foundation Protein Information Resource web sites: http://pir.georgetown.edu/pirwww/dbinfo/resid.html, http://pir.georgetown.edu/pirwww/dbinfo/nrl3d.html
Resumo:
The iProClass database is an integrated resource that provides comprehensive family relationships and structural and functional features of proteins, with rich links to various databases. It is extended from ProClass, a protein family database that integrates PIR superfamilies and PROSITE motifs. The iProClass currently consists of more than 200 000 non-redundant PIR and SWISS-PROT proteins organized with more than 28 000 superfamilies, 2600 domains, 1300 motifs, 280 post-translational modification sites and links to more than 30 databases of protein families, structures, functions, genes, genomes, literature and taxonomy. Protein and family summary reports provide rich annotations, including membership information with length, taxonomy and keyword statistics, full family relationships, comprehensive enzyme and PDB cross-references and graphical feature display. The database facilitates classification-driven annotation for protein sequence databases and complete genomes, and supports structural and functional genomic research. The iProClass is implemented in Oracle 8i object-relational system and available for sequence search and report retrieval at http://pir.georgetow n.edu/iproclass/.