12 resultados para Structure Alignment

em National Center for Biotechnology Information - NCBI


Relevância:

70.00% 70.00%

Publicador:

Resumo:

In this study, we estimate the statistical significance of structure prediction by threading. We introduce a single parameter ɛ that serves as a universal measure determining the probability that the best alignment is indeed a native-like analog. Parameter ɛ takes into account both length and composition of the query sequence and the number of decoys in threading simulation. It can be computed directly from the query sequence and potential of interactions, eliminating the need for sequence reshuffling and realignment. Although our theoretical analysis is general, here we compare its predictions with the results of gapless threading. Finally we estimate the number of decoys from which the native structure can be found by existing potentials of interactions. We discuss how this analysis can be extended to determine the optimal gap penalties for any sequence-structure alignment (threading) method, thus optimizing it to maximum possible performance.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

The database reported here is derived using the Combinatorial Extension (CE) algorithm which compares pairs of protein polypeptide chains and provides a list of structurally similar proteins along with their structure alignments. Using CE, structurestructure alignments can provide insights into biological function. When a protein of known function is shown to be structurally similar to a protein of unknown function, a relationship might be inferred; a relationship not necessarily detectable from sequence comparison alone. Establishing structurestructure relationships in this way is of great importance as we enter an era of structural genomics where there is a likelihood of an increasing number of structures with unknown functions being determined. Thus the CE database is an example of a useful tool in the annotation of protein structures of unknown function. Comparisons can be performed on the complete PDB or on a structurally representative subset of proteins. The source protein(s) can be from the PDB (updated monthly) or uploaded by the user. CE provides sequence alignments resulting from structural alignments and Cartesian coordinates for the aligned structures, which may be analyzed using the supplied Compare3D Java applet, or downloaded for further local analysis. Searches can be run from the CE web site, http://cl.sdsc.edu/ce.html, or the database and software downloaded from the site for local use.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

High-resolution physical maps of the genomes of three Rhodobacter capsulatus strains, derived from ordered cosmid libraries, were aligned. The 1.2-Mb segment of the SB1003 genome studied here is adjacent to a 1-Mb region analyzed previously [Fonstein, M., Nikolskaya, T. & Haselkorn, H. (1995) J. Bacteriol. 177, 2368-2372]. Probes derived from the ordered cosmid set of R. capsulatus SB1003 were used to link cosmids from the St. Louis and 2.3.1 strain libraries. Cosmids selected this way did not merge into a single contig but formed several unlinked groups. EcoRV restriction maps of the ordered cosmids were then constructed using lambda terminase and fused to derive fragments of the chromosomal map. In order to link these fragments, their ends were transcribed to produce secondary probes for hybridization to gridded cosmid libraries of the same strains. This linking reduced the number of subcontigs to three for the St. Louis strain and one for the 2.3.1 strain. Hybridization of the same probes back to the ordered cosmid set of SB1003 positioned the subcontigs on the high-resolution physical map of SB1003. The final alignment of the restriction maps shows numerous large and small translocations in this 1.2-Mb chromosomal region of the three Rhodobacter strains. In addition, the chromosomes of the three strains, whose fine-structure maps can now be compared over 2.2 Mb, are seen to contain regions of 15-80 kb in which restriction sites are highly polymorphic, interspersed among regions in which the positions of restriction sites are highly conserved.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Microsomal NADPH–cytochrome P450 reductase (CPR) is one of only two mammalian enzymes known to contain both FAD and FMN, the other being nitric-oxide synthase. CPR is a membrane-bound protein and catalyzes electron transfer from NADPH to all known microsomal cytochromes P450. The structure of rat liver CPR, expressed in Escherichia coli and solubilized by limited trypsinolysis, has been determined by x-ray crystallography at 2.6 Å resolution. The molecule is composed of four structural domains: (from the N- to C- termini) the FMN-binding domain, the connecting domain, and the FAD- and NADPH-binding domains. The FMN-binding domain is similar to the structure of flavodoxin, whereas the two C-terminal dinucleotide-binding domains are similar to those of ferredoxin–NADP+ reductase (FNR). The connecting domain, situated between the FMN-binding and FNR-like domains, is responsible for the relative orientation of the other domains, ensuring the proper alignment of the two flavins necessary for efficient electron transfer. The two flavin isoalloxazine rings are juxtaposed, with the closest distance between them being about 4 Å. The bowl-shaped surface near the FMN-binding site is likely the docking site of cytochrome c and the physiological redox partners, including cytochromes P450 and b5 and heme oxygenase.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Mammalian electron transfer flavoproteins (ETF) are heterodimers containing a single equivalent of flavin adenine dinucleotide (FAD). They function as electron shuttles between primary flavoprotein dehydrogenases involved in mitochondrial fatty acid and amino acid catabolism and the membrane-bound electron transfer flavoprotein ubiquinone oxidoreductase. The structure of human ETF solved to 2.1-Å resolution reveals that the ETF molecule is comprised of three distinct domains: two domains are contributed by the α subunit and the third domain is made up entirely by the β subunit. The N-terminal portion of the α subunit and the majority of the β subunit have identical polypeptide folds, in the absence of any sequence homology. FAD lies in a cleft between the two subunits, with most of the FAD molecule residing in the C-terminal portion of the α subunit. Alignment of all the known sequences for the ETF α subunits together with the putative FixB gene product shows that the residues directly involved in FAD binding are conserved. A hydrogen bond is formed between the N5 of the FAD isoalloxazine ring and the hydroxyl side chain of αT266, suggesting why the pathogenic mutation, αT266M, affects ETF activity in patients with glutaric acidemia type II. Hydrogen bonds between the 4′-hydroxyl of the ribityl chain of FAD and N1 of the isoalloxazine ring, and between αH286 and the C2-carbonyl oxygen of the isoalloxazine ring, may play a role in the stabilization of the anionic semiquinone. With the known structure of medium chain acyl-CoA dehydrogenase, we hypothesize a possible structure for docking the two proteins.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

PALI (release 1.2) contains three-dimensional (3-D) structure-dependent sequence alignments as well as structure-based phylogenetic trees of homologous protein domains in various families. The data set of homologous protein structures has been derived by consulting the SCOP database (release 1.50) and the data set comprises 604 families of homologous proteins involving 2739 protein domain structures with each family made up of at least two members. Each member in a family has been structurally aligned with every other member in the same family (pairwise alignment) and all the members in the family are also aligned using simultaneous super­position (multiple alignment). The structural alignments are performed largely automatically, with manual interventions especially in the cases of distantly related proteins, using the program STAMP (version 4.2). Every family is also associated with two dendrograms, calculated using PHYLIP (version 3.5), one based on a structural dissimilarity metric defined for every pairwise alignment and the other based on similarity of topologically equivalent residues. These dendrograms enable easy comparison of sequence and structure-based relationships among the members in a family. Structure-based alignments with the details of structural and sequence similarities, superposed coordinate sets and dendrograms can be accessed conveniently using a web interface. The database can be queried for protein pairs with sequence or structural similarities falling within a specified range. Thus PALI forms a useful resource to help in analysing the relationship between sequence and structure variation at a given level of sequence similarity. PALI also contains over 653 ‘orphans’ (single member families). Using the web interface involving PSI_BLAST and PHYLIP it is possible to associate the sequence of a new protein with one of the families in PALI and generate a phylogenetic tree combining the query sequence and proteins of known 3-D structure. The database with the web interfaced search and dendrogram generation tools can be accessed at http://pa uling.mbu.iisc.ernet.in/~pali.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Many persistent viruses have evolved the ability to subvert MHC class I antigen presentation. Indeed, human cytomegalovirus (HCMV) encodes at least four proteins that down-regulate cell-surface expression of class I. The HCMV unique short (US)2 glycoprotein binds newly synthesized class I molecules within the endoplasmic reticulum (ER) and subsequently targets them for proteasomal degradation. We report the crystal structure of US2 bound to the HLA-A2/Tax peptide complex. US2 associates with HLA-A2 at the junction of the peptide-binding region and the α3 domain, a novel binding surface on class I that allows US2 to bind independently of peptide sequence. Mutation of class I heavy chains confirms the importance of this binding site in vivo. Available data on class I-ER chaperone interactions indicate that chaperones would not impede US2 binding. Unexpectedly, the US2 ER-luminal domain forms an Ig-like fold. A US2 structure-based sequence alignment reveals that seven HCMV proteins, at least three of which function in immune evasion, share the same fold as US2. The structure allows design of further experiments to determine how US2 targets class I molecules for degradation.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

We present an approach for assessing the significance of sequence and structure comparisons by using nearly identical statistical formalisms for both sequence and structure. Doing so involves an all-vs.-all comparison of protein domains [taken here from the Structural Classification of Proteins (scop) database] and then fitting a simple distribution function to the observed scores. By using this distribution, we can attach a statistical significance to each comparison score in the form of a P value, the probability that a better score would occur by chance. As expected, we find that the scores for sequence matching follow an extreme-value distribution. The agreement, moreover, between the P values that we derive from this distribution and those reported by standard programs (e.g., blast and fasta validates our approach. Structure comparison scores also follow an extreme-value distribution when the statistics are expressed in terms of a structural alignment score (essentially the sum of reciprocated distances between aligned atoms minus gap penalties). We find that the traditional metric of structural similarity, the rms deviation in atom positions after fitting aligned atoms, follows a different distribution of scores and does not perform as well as the structural alignment score. Comparison of the sequence and structure statistics for pairs of proteins known to be related distantly shows that structural comparison is able to detect approximately twice as many distant relationships as sequence comparison at the same error rate. The comparison also indicates that there are very few pairs with significant similarity in terms of sequence but not structure whereas many pairs have significant similarity in terms of structure but not sequence.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The three-dimensional structure of Aspergillus niger pectin lyase B (PLB) has been determined by crystallographic techniques at a resolution of 1.7 Å. The model, with all 359 amino acids and 339 water molecules, refines to a final crystallographic R factor of 16.5%. The polypeptide backbone folds into a large right-handed cylinder, termed a parallel β helix. Loops of various sizes and conformations protrude from the central helix and probably confer function. The largest loop of 53 residues folds into a small domain consisting of three antiparallel β strands, one turn of an α helix, and one turn of a 310 helix. By comparison with the structure of Erwinia chrysanthemi pectate lyase C (PelC), the primary sequence alignment between the pectate and pectin lyase subfamilies has been corrected and the active site region for the pectin lyases deduced. The substrate-binding site in PLB is considerably less hydrophilic than the comparable PelC region and consists of an extensive network of highly conserved Trp and His residues. The PLB structure provides an atomic explanation for the lack of a catalytic requirement for Ca2+ in the pectin lyase family, in contrast to that found in the pectate lyase enzymes. Surprisingly, however, the PLB site analogous to the Ca2+ site in PelC is filled with a positive charge provided by a conserved Arg in the pectin lyases. The significance of the finding with regard to the enzymatic mechanism is discussed.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Gene recognition is one of the most important problems in computational molecular biology. Previous attempts to solve this problem were based on statistics, and applications of combinatorial methods for gene recognition were almost unexplored. Recent advances in large-scale cDNA sequencing open a way toward a new approach to gene recognition that uses previously sequenced genes as a clue for recognition of newly sequenced genes. This paper describes a spliced alignment algorithm and software tool that explores all possible exon assemblies in polynomial time and finds the multiexon structure with the best fit to a related protein. Unlike other existing methods, the algorithm successfully recognizes genes even in the case of short exons or exons with unusual codon usage; we also report correct assemblies for genes with more than 10 exons. On a test sample of human genes with known mammalian relatives, the average correlation between the predicted and actual proteins was 99%. The algorithm correctly reconstructed 87% of genes and the rare discrepancies between the predicted and real exon-intron structures were caused either by short (less than 5 amino acids) initial/terminal exons or by alternative splicing. Moreover, the algorithm predicts human genes reasonably well when the homologous protein is nonvertebrate or even prokaryotic. The surprisingly good performance of the method was confirmed by extensive simulations: in particular, with target proteins at 160 accepted point mutations (PAM) (25% similarity), the correlation between the predicted and actual genes was still as high as 95%.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The x-ray crystal structures of the sulfide oxidase antibody 28B4 and of antibody 28B4 complexed with hapten have been solved at 2.2-angstrom and 1.9-angstrom resolution, respectively. To our knowledge, these structures are the highest resolution catalytic antibody structures to date and provide insight into the molecular mechanism of this antibody-catalyzed monooxygenation reaction. Specifically, the data suggest that entropic restriction plays a fundamental role in catalysis through the precise alignment of the thioether substrate and oxidant. The antibody active site also stabilizes developing charge on both sulfur and periodate in the transition state via cation-pi and electrostatic interactions, respectively. In addition to demonstrating that the active site of antibody 28B4 does indeed reflect the mechanistic information programmed in the aminophosphonic acid hapten, these high-resolution structures provide a basis for enhancing turnover rates through mutagenesis and improved hapten design.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Average hepatic expression (mRNA per cell per gene) of a metallothionein-rat growth hormone (rGH) gene with its natural introns was about 15-fold higher than an intronless version when tested in transgenic mice. We examined the idea that intron removal leads to an alteration in chromatin structure that might be responsible for this effect. Using an in vitro chromatin assembly system, we observed that nucleosomes were aligned in a characteristic ordered array over the gene and promoter when all introns were present. Linker histones were necessary for this alignment to occur. In contrast, nucleosome alignment was perturbed in constructs lacking some or all of the introns. A similar disruption of nucleosome alignment was observed when comparing chromatin from livers of transgenic mice carrying rGH transgenes with or without introns. In vitro, sequences at the 3' end of the rGH gene position nucleosomes and facilitate nucleosome alignment upstream; however, nucleosome alignment does not occur on the approximately 3 kb of downstream flanking rat sequence. These observations suggest that signals present in genomic rGH DNA may serve to establish appropriate nucleosome alignment during development and, possibly, to restore nucleosome alignment to the transcribed region after disruption incurred by the passage of an RNA polymerase molecule, thereby facilitating subsequent rounds of transcription.