9 resultados para Data-bank

em National Center for Biotechnology Information - NCBI


Relevância:

100.00% 100.00%

Publicador:

Resumo:

PDB-REPRDB is a database of representative protein chains from the Protein Data Bank (PDB). The previous version of PDB-REPRDB provided 48 representative sets, whose similarity criteria were predetermined, on the WWW. The current version is designed so that the user may obtain a quick selection of representative chains from PDB. The selection of representative chains can be dynamically configured according to the user’s requirement. The WWW interface provides a large degree of freedom in setting parameters, such as cut-off scores of sequence and structural similarity. One can obtain a representative list and classification data of protein chains from the system. The current database includes 20 457 protein chains from PDB entries (August 6, 2000). The system for PDB-REPRDB is available at the Parallel Protein Information Analysis system (PAPIA) WWW server (http://www.rwcp.or.jp/papia/).

Relevância:

70.00% 70.00%

Publicador:

Resumo:

The Protein Data Bank (PDB; http://www.rcsb.org/pdb/) is the single worldwide archive of structural data of biological macromolecules. This paper describes the data uniformity project that is underway to address the inconsistency in PDB data.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

The crystal structure of Escherichia coli ornithine transcarbamoylase (OTCase, EC 2.1.3.3) complexed with the bisubstrate analog N-(phosphonacetyl)-l-ornithine (PALO) has been determined at 2.8-Å resolution. This research on the structure of a transcarbamoylase catalytic trimer with a substrate analog bound provides new insights into the linkages between substrate binding, protein–protein interactions, and conformational change. The structure was solved by molecular replacement with the Pseudomonas aeruginosa catabolic OTCase catalytic trimer (Villeret, V., Tricot, C., Stalon, V. & Dideberg, O. (1995) Proc. Natl. Acad. Sci. USA 92, 10762–10766; Protein Data Bank reference pdb 1otc) as the model and refined to a crystallographic R value of 21.3%. Each polypeptide chain folds into two domains, a carbamoyl phosphate binding domain and an l-ornithine binding domain. The bound inhibitor interacts with the side chains and/or backbone atoms of Lys-53, Ser-55, Thr-56, Arg-57, Thr-58, Arg-106, His-133, Asn-167, Asp-231, Met-236, Leu-274, Arg-319 as well as Gln-82 and Lys-86 from an adjacent chain. Comparison with the unligated P. aeruginosa catabolic OTCase structure indicates that binding of the substrate analog results in closure of the two domains of each chain. As in E. coli aspartate transcarbamoylase, the 240s loop undergoes the largest conformational change upon substrate binding. The clinical implications for human OTCase deficiency are discussed.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

The parasitic bacterium Mycoplasma genitalium has a small, reduced genome with close to a basic set of genes. As a first step toward determining the families of protein domains that form the products of these genes, we have used the multiple sequence programs psi-blast and geanfammer to match the sequences of the 467 gene products of M. genitalium to the sequences of the domains that form proteins of known structure [Protein Data Bank (PDB) sequences]. PDB sequences (274) match all of 106 M. genitalium sequences and some parts of another 85; thus, 41% of its total sequences are matched in all or part. The evolutionary relationships of the PDB domains that match M. genitalium are described in the structural classification of proteins (SCOP) database. Using this information, we show that the domains in the matched M. genitalium sequences come from 114 superfamilies and that 58% of them have arisen by gene duplication. This level of duplication is more than twice that found by using pairwise sequence comparisons. The PDB domain matches also describe the domain structure of the matched sequences: just over a quarter contain one domain and the rest have combinations of two or more domains.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

The EMBL Nucleotide Sequence Database (http://www.ebi.ac.uk/embl/) is maintained at the European Bioinformatics Institute (EBI) in an international collaboration with the DNA Data Bank of Japan (DDBJ) and GenBank at the NCBI (USA). Data is exchanged amongst the collaborating databases on a daily basis. The major contributors to the EMBL database are individual authors and genome project groups. Webin is the preferred web-based submission system for individual submitters, whilst automatic procedures allow incorporation of sequence data from large-scale genome sequencing centres and from the European Patent Office (EPO). Database releases are produced quarterly. Network services allow free access to the most up-to-date data collection via ftp, email and World Wide Web interfaces. EBI’s Sequence Retrieval System (SRS), a network browser for databanks in molecular biology, integrates and links the main nucleotide and protein databases plus many specialized databases. For sequence similarity searching a variety of tools (e.g. Blitz, Fasta, BLAST) are available which allow external users to compare their own sequences against the latest data in the EMBL Nucleotide Sequence Database and SWISS-PROT.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

PDBsum is a web-based database providing a largely pictorial summary of the key information on each macromolecular structure deposited at the Protein Data Bank (PDB). It includes images of the structure, annotated plots of each protein chain’s secondary structure, detailed structural analyses generated by the PROMOTIF program, summary PROCHECK results and schematic diagrams of protein–ligand and protein–DNA interactions. RasMol scripts highlight key aspects of the structure, such as the protein’s domains, PROSITE patterns and protein–ligand interactions, for interactive viewing in 3D. Numerous links take the user to related sites. PDBsum is updated whenever any new structures are released by the PDB and is freely accessible via http://www.biochem.ucl.ac.uk/bsm/pdbsum.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

The Dali Domain Dictionary (http://www.ebi.ac.uk/dali/domain) is a numerical taxonomy of all known structures in the Protein Data Bank (PDB). The taxonomy is derived fully automatically from measurements of structural, functional and sequence similarities. Here, we report the extension of the classification to match the traditional four hierarchical levels corresponding to: (i) supersecondary structural motifs (attractors in fold space), (ii) the topology of globular domains (fold types), (iii) remote homologues (functional families) and (iv) homologues with sequence identity above 25% (sequence families). The computational definitions of attractors and functional families are new. In September 2000, the Dali classification contained 10 531 PDB entries comprising 17 101 chains, which were partitioned into five attractor regions, 1375 fold types, 2582 functional families and 3724 domain sequence families. Sequence families were further associated with 99 582 unique homologous sequences in the HSSP database, which increases the number of effectively known structures several-fold. The resulting database contains the description of protein domain architecture, the definition of structural neighbours around each known structure, the definition of structurally conserved cores and a comprehensive library of explicit multiple alignments of distantly related protein families.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

The RESID Database is a comprehensive collection of annotations and structures for protein post-translational modifications including N-terminal, C-terminal and peptide chain cross-link modifications. The RESID Database includes systematic and frequently observed alternate names, Chemical Abstracts Service registry numbers, atomic formulas and weights, enzyme activities, taxonomic range, keywords, literature citations with database cross-references, structural diagrams and molecular models. The NRL-3D Sequence–Structure Database is derived from the three-dimensional structure of proteins deposited with the Research Collaboratory for Structural Bioinformatics Protein Data Bank. The NRL-3D Database includes standardized and frequently observed alternate names, sources, keywords, literature citations, experimental conditions and searchable sequences from model coordinates. These databases are freely accessible through the National Cancer Institute–Frederick Advanced Biomedical Computing Center at these web sites: http://www.ncifcrf.gov/RESID, http://www.ncifcrf.gov/ NRL-3D; or at these National Biomedical Research Foundation Protein Information Resource web sites: http://pir.georgetown.edu/pirwww/dbinfo/resid.html, http://pir.georgetown.edu/pirwww/dbinfo/nrl3d.html

Relevância:

60.00% 60.00%

Publicador:

Resumo:

It is generally accepted that globular proteins fold with a hydrophobic core and a hydrophilic exterior. Might the spatial distribution of amino acid hydrophobicity exhibit common features? The hydrophobic profile detailing this distribution from the protein interior to exterior has been examined for 30 relatively diverse structures obtained from the Protein Data Bank, for 3 proteins of the 30S ribosomal subunit, and for a simple set of 14 decoys. A second-order hydrophobic moment has provided a simple measure of the spatial variation. Shapes of the calculated spatial profiles of all native structures have been found to be comparable. Consequently, profile shapes as well as particular profile features should assist in validating predicted protein structures and in discriminating between different protein-folding pathways. The spatial profiles of the 14 decoys are clearly distinguished from the profiles of their native structures.