945 resultados para Protein structures
Resumo:
Transcriptional regulation in papillomaviruses depends on sequence-specific binding of the regulatory protein E2 to several sites in the viral genome. Crystal structures of bovine papillomavirus E2 DNA targets reveal a conformational variant of B-DNA characterized by a roll-induced writhe and helical repeat of 10.5 bp per turn. A comparison between the free and the protein-bound DNA demonstrates that the intrinsic structure of the DNA regions contacted directly by the protein and the deformability of the DNA region that is not contacted by the protein are critical for sequence-specific protein/DNA recognition and hence for gene-regulatory signals in the viral system. We show that the selection of dinucleotide or longer segments with appropriate conformational characteristics, when positioned at correct intervals along the DNA helix, can constitute a structural code for DNA recognition by regulatory proteins. This structural code facilitates the formation of a complementary protein–DNA interface that can be further specified by hydrogen bonds and nonpolar interactions between the protein amino acids and the DNA bases.
Resumo:
Computer models were used to examine whether and under what conditions the multimeric protein complex is inhibited by high concentrations of one of its components—an effect analogous to the prozone phenomenon in precipitin tests. A series of idealized simple “ball-and-stick” structures representing small oligomeric complexes of protein molecules formed by reversible binding reactions were analyzed to determine the binding steps leading to each structure. The equilibrium state of each system was then determined over a range of starting concentrations and Kds and the steady-state concentration of structurally complete oligomer calculated for each situation. A strong inhibitory effect at high concentrations was shown by any protein molecule forming a bridge between two or more separable parts of the complex. By contrast, proteins linked to the outside of the complex by a single bond showed no inhibition whatsoever at any concentration. Nonbridging, multivalent proteins in the body of the complex could show an inhibitory effect or not depending on the structure of the complex and the strength of its bonds. On the basis of this study, we suggest that the prozone phenomenon will occur widely in living cells and that it could be a crucial factor in the regulation of protein complex formation.
Resumo:
Several unanswered questions in T cell immunobiology relating to intracellular processing or in vivo antigen presentation could be approached if convenient, specific, and sensitive reagents were available for detecting the peptide–major histocompatibility complex (MHC) class I or class II ligands recognized by αβ T cell receptors. For this reason, we have developed a method using homogeneously loaded peptide–MHC class II complexes to generate and select specific mAb reactive with these structures using hen egg lysozyme (HEL) and I-Ak as a model system. mAbs specific for either HEL-(46–61)–Ak or HEL-(116–129)–Ak have been isolated. They cross-react with a small subset of I-Ak molecules loaded with self peptides but can nonetheless be used for flow cytometry, immunoprecipitation, Western blotting, and intracellular immunofluorescence to detect specific HEL peptide–MHC class II complexes formed by either peptide exposure or natural processing of native HEL. An example of the utility of these reagents is provided herein by using one of the anti-HEL-(46–61)–Ak specific mAbs to visualize intracellular compartments where I-Ak is loaded with HEL-derived peptides early after antigen administration. Other uses, especially for in vivo tracking of specific ligand-bearing antigen-presenting cells, are discussed.
Resumo:
We report here the crystal structure of the RuvB motor protein from Thermus thermophilus HB8, which drives branch migration of the Holliday junction during homologous recombination. RuvB has a crescent-like architecture consisting of three consecutive domains, the first two of which are involved in ATP binding and hydrolysis. DNA is likely to interact with a large basic cleft, which encompasses the ATP-binding pocket and domain boundaries, whereas the junction-recognition protein RuvA may bind a flexible β-hairpin protruding from the N-terminal domain. The structures of two subunits, related by a noncrystallographic pseudo-2-fold axis, imply that conformational changes of motor protein coupled with ATP hydrolysis may reflect motility essential for its translocation around double-stranded DNA.
Resumo:
We have determined the structure of a DEAD box putative RNA helicase from the hyperthermophile Methanococcus jannaschii. Like other helicases, the protein contains two α/β domains, each with a recA-like topology. Unlike other helicases, the protein exists as a dimer in the crystal. Through an interaction that resembles the dimer interface of insulin, the amino-terminal domain's 7-strand β-sheet is extended to 14 strands across the two molecules. Motifs conserved in the DEAD box family cluster in the cleft between domains, and many of their functions can be deduced by mutational data and by comparison with other helicase structures. Several lines of evidence suggest that motif III Ser-Ala-Thr may be involved in binding RNA.
Resumo:
Single-stranded DNA binding proteins (SSBs) play central roles in cellular and viral processes involving the generation of single-stranded DNA. These include DNA replication, homologous recombination and DNA repair pathways. SSBs bind DNA using four ‘OB-fold’ (oligonucleotide/oligosaccharide binding fold) domains that can be organised in a variety of overall quaternary structures. Thus eubacterial SSBs are homotetrameric whilst the eucaryal RPA protein is a heterotrimer and euryarchaeal proteins vary significantly in their subunit compositions. We demonstrate that the crenarchaeal SSB protein is an abundant protein with a unique structural organisation, existing as a monomer in solution and multimerising on DNA binding. The protein binds single-stranded DNA distributively with a binding site size of ~5 nt per monomer. Sulfolobus SSB lacks the zinc finger motif found in the eucaryal and euryarchaeal proteins, possessing instead a flexible C-terminal tail, sensitive to trypsin digestion, that is not required for DNA binding. In comparison with Escherichia coli SSB, the tail may play a role in protein–protein interactions during DNA replication and repair.
Resumo:
Replication protein A (RPA), the nuclear single-stranded DNA binding protein is involved in DNA replication, nucleotide excision repair (NER) and homologous recombination. It is a stable heterotrimer consisting of subunits with molecular masses of 70, 32 and 14 kDa (p70, p32 and p14, respectively). Gapped DNA structures are common intermediates during DNA replication and NER. To analyze the interaction of RPA and its subunits with gapped DNA we designed structures containing 9 and 30 nucleotide gaps with a photoreactive arylazido group at the 3′-end of the upstream oligonucleotide or at the 5′-end of the downstream oligonucleotide. UV crosslinking and subsequent analysis showed that the p70 subunit mainly interacts with the 5′-end of DNA irrespective of DNA structure, while the subunit orientation towards the 3′-end of DNA in the gap structures strongly depends on the gap size. The results are compared with the data obtained previously with the primer–template systems containing 5′- or 3′-protruding DNA strands. Our results suggest a model of polar RPA binding to the gapped DNA.
Resumo:
The HIV-1 transcript is alternatively spliced to over 30 different mRNAs. Whether RNA secondary structure can influence HIV-1 RNA alternative splicing has not previously been examined. Here we have determined the secondary structure of the HIV-1/BRU RNA segment, containing the alternative A3, A4a, A4b, A4c and A5 3′ splice sites. Site A3, required for tat mRNA production, is contained in the terminal loop of a stem–loop structure (SLS2), which is highly conserved in HIV-1 and related SIVcpz strains. The exon splicing silencer (ESS2) acting on site A3 is located in a long irregular stem–loop structure (SLS3). Two SLS3 domains were protected by nuclear components under splicing condition assays. One contains the A4c branch points and a putative SR protein binding site. The other one is adjacent to ESS2. Unexpectedly, only the 3′ A residue of ESS2 was protected. The suboptimal A3 polypyrimidine tract (PPT) is base paired. Using site-directed mutagenesis and transfection of a mini-HIV-1 cDNA into HeLa cells, we found that, in a wild-type PPT context, a mutation of the A3 downstream sequence that reinforced SLS2 stability decreased site A3 utilization. This was not the case with an optimized PPT. Hence, sequence and secondary structure of the PPT may cooperate in limiting site A3 utilization.
Resumo:
PDBsum is a web-based database providing a largely pictorial summary of the key information on each macromolecular structure deposited at the Protein Data Bank (PDB). It includes images of the structure, annotated plots of each protein chain’s secondary structure, detailed structural analyses generated by the PROMOTIF program, summary PROCHECK results and schematic diagrams of protein–ligand and protein–DNA interactions. RasMol scripts highlight key aspects of the structure, such as the protein’s domains, PROSITE patterns and protein–ligand interactions, for interactive viewing in 3D. Numerous links take the user to related sites. PDBsum is updated whenever any new structures are released by the PDB and is freely accessible via http://www.biochem.ucl.ac.uk/bsm/pdbsum.
Resumo:
The Dali Domain Dictionary (http://www.ebi.ac.uk/dali/domain) is a numerical taxonomy of all known structures in the Protein Data Bank (PDB). The taxonomy is derived fully automatically from measurements of structural, functional and sequence similarities. Here, we report the extension of the classification to match the traditional four hierarchical levels corresponding to: (i) supersecondary structural motifs (attractors in fold space), (ii) the topology of globular domains (fold types), (iii) remote homologues (functional families) and (iv) homologues with sequence identity above 25% (sequence families). The computational definitions of attractors and functional families are new. In September 2000, the Dali classification contained 10 531 PDB entries comprising 17 101 chains, which were partitioned into five attractor regions, 1375 fold types, 2582 functional families and 3724 domain sequence families. Sequence families were further associated with 99 582 unique homologous sequences in the HSSP database, which increases the number of effectively known structures several-fold. The resulting database contains the description of protein domain architecture, the definition of structural neighbours around each known structure, the definition of structurally conserved cores and a comprehensive library of explicit multiple alignments of distantly related protein families.
Resumo:
GlycoSuiteDB is a relational database that curates information from the scientific literature on glycoprotein derived glycan structures, their biological sources, the references in which the glycan was described and the methods used to determine the glycan structure. To date, the database includes most published O-linked oligosaccharides from the last 50 years and most N-linked oligosaccharides that were published in the 1990s. For each structure, information is available concerning the glycan type, linkage and anomeric configuration, mass and composition. Detailed information is also provided on native and recombinant sources, including tissue and/or cell type, cell line, strain and disease state. Where known, the proteins to which the glycan structures are attached are reported, and cross-references to the SWISS-PROT/TrEMBL protein sequence databases are given if applicable. The GlycoSuiteDB annotations include literature references which are linked to PubMed, and detailed information on the methods used to determine each glycan structure are noted to help the user assess the quality of the structural assignment. GlycoSuiteDB has a user-friendly web interface which allows the researcher to query the database using monoisotopic or average mass, monosaccharide composition, glycosylation linkages (e.g. N- or O-linked), reducing terminal sugar, attached protein, taxonomy, tissue or cell type and GlycoSuiteDB accession number. Advanced queries using combinations of these parameters are also possible. GlycoSuiteDB can be accessed on the web at http://www.glycosuite.com.
Resumo:
The RESID Database is a comprehensive collection of annotations and structures for protein post-translational modifications including N-terminal, C-terminal and peptide chain cross-link modifications. The RESID Database includes systematic and frequently observed alternate names, Chemical Abstracts Service registry numbers, atomic formulas and weights, enzyme activities, taxonomic range, keywords, literature citations with database cross-references, structural diagrams and molecular models. The NRL-3D Sequence–Structure Database is derived from the three-dimensional structure of proteins deposited with the Research Collaboratory for Structural Bioinformatics Protein Data Bank. The NRL-3D Database includes standardized and frequently observed alternate names, sources, keywords, literature citations, experimental conditions and searchable sequences from model coordinates. These databases are freely accessible through the National Cancer Institute–Frederick Advanced Biomedical Computing Center at these web sites: http://www.ncifcrf.gov/RESID, http://www.ncifcrf.gov/ NRL-3D; or at these National Biomedical Research Foundation Protein Information Resource web sites: http://pir.georgetown.edu/pirwww/dbinfo/resid.html, http://pir.georgetown.edu/pirwww/dbinfo/nrl3d.html
Resumo:
The iProClass database is an integrated resource that provides comprehensive family relationships and structural and functional features of proteins, with rich links to various databases. It is extended from ProClass, a protein family database that integrates PIR superfamilies and PROSITE motifs. The iProClass currently consists of more than 200 000 non-redundant PIR and SWISS-PROT proteins organized with more than 28 000 superfamilies, 2600 domains, 1300 motifs, 280 post-translational modification sites and links to more than 30 databases of protein families, structures, functions, genes, genomes, literature and taxonomy. Protein and family summary reports provide rich annotations, including membership information with length, taxonomy and keyword statistics, full family relationships, comprehensive enzyme and PDB cross-references and graphical feature display. The database facilitates classification-driven annotation for protein sequence databases and complete genomes, and supports structural and functional genomic research. The iProClass is implemented in Oracle 8i object-relational system and available for sequence search and report retrieval at http://pir.georgetow n.edu/iproclass/.
Resumo:
The Homeodomain Resource is an annotated collection of non-redundant protein sequences, three-dimensional structures and genomic information for the homeodomain protein family. Release 3.0 contains 795 full-length homeodomain-containing sequences, 32 experimentally-derived structures and 143 homeobox loci implicated in human genetic disorders. Entries are fully hyperlinked to facilitate easy retrieval of the original records from source databases. A simple search engine with a graphical user interface is provided to query the component databases and assemble customized data sets. A new feature for this release is the addition of DNA recognition sites for all human homeodomain proteins described in the literature. The Homeodomain Resource is freely available through the World Wide Web at http://genome.nhgri.nih.gov/homeodomain.
Resumo:
Recent improvements of a hierarchical ab initio or de novo approach for predicting both α and β structures of proteins are described. The united-residue energy function used in this procedure includes multibody interactions from a cumulant expansion of the free energy of polypeptide chains, with their relative weights determined by Z-score optimization. The critical initial stage of the hierarchical procedure involves a search of conformational space by the conformational space annealing (CSA) method, followed by optimization of an all-atom model. The procedure was assessed in a recent blind test of protein structure prediction (CASP4). The resulting lowest-energy structures of the target proteins (ranging in size from 70 to 244 residues) agreed with the experimental structures in many respects. The entire experimental structure of a cyclic α-helical protein of 70 residues was predicted to within 4.3 Å α-carbon (Cα) rms deviation (rmsd) whereas, for other α-helical proteins, fragments of roughly 60 residues were predicted to within 6.0 Å Cα rmsd. Whereas β structures can now be predicted with the new procedure, the success rate for α/β- and β-proteins is lower than that for α-proteins at present. For the β portions of α/β structures, the Cα rmsd's are less than 6.0 Å for contiguous fragments of 30–40 residues; for one target, three fragments (of length 10, 23, and 28 residues, respectively) formed a compact part of the tertiary structure with a Cα rmsd less than 6.0 Å. Overall, these results constitute an important step toward the ab initio prediction of protein structure solely from the amino acid sequence.