34 resultados para Alignments.


Relevância:

10.00% 10.00%

Publicador:

Resumo:

Multiple lipoxygenase sequence alignments and structural modeling of the enzyme/substrate interaction of the cucumber lipid body lipoxygenase suggested histidine 608 as the primary determinant of positional specificity. Replacement of this amino acid by a less-space-filling valine altered the positional specificity of this linoleate 13-lipoxygenase in favor of 9-lipoxygenation. These alterations may be explained by the fact that H608V mutation may demask the positively charged guanidino group of R758, which, in turn, may force an inverse head-to-tail orientation of the fatty acid substrate. The R758L+H608V double mutant exhibited a strongly reduced reaction rate and a random positional specificity. Trilinolein, which lacks free carboxylic groups, was oxygenated to the corresponding (13S)-hydro(pero)xy derivatives by both the wild-type enzyme and the linoleate 9-lipoxygenating H608V mutant. These data indicate the complete conversion of a linoleate 13-lipoxygenase to a 9-lipoxygenating species by a single point mutation. It is hypothesized that H608V exchange may alter the orientation of the substrate at the active site and/or its steric configuration in such a way that a stereospecific dioxygen insertion at C-9 may exclusively take place.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Alignments of homologous genes typically reveal a great diversity of intron locations, far more than could fit comfortably in a single gene. Thus, a minority of these intron positions could be inherited from a single ancestral gene, but the larger share must be attributed to subsequent events of intron gain or intron “sliding” (movement from one position to another within a gene). Intron sliding has been argued from cases of discordant introns and from putative spatial clustering of intron positions. A list of 32 cases of discordant introns is presented here. Most of these cases are found to be artefactual. The spatial and phylogenetic distributions of intron positions from five published compilations of gene data, comprising 205 intron positions, have been examined systematically for evidence of intron sliding. The results suggest that sliding, if it occurs at all, has contributed little to the diversity of intron positions.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

The discovery of cyanobacterial phytochrome histidine kinases, together with the evidence that phytochromes from higher plants display protein kinase activity, bind ATP analogs, and possess C-terminal domains similar to bacterial histidine kinases, has fueled the controversial hypothesis that the eukaryotic phytochrome family of photoreceptors are light-regulated enzymes. Here we demonstrate that purified recombinant phytochromes from a higher plant and a green alga exhibit serine/threonine kinase activity similar to that of phytochrome isolated from dark grown seedlings. Phosphorylation of recombinant oat phytochrome is a light- and chromophore-regulated intramolecular process. Based on comparative protein sequence alignments and biochemical cross-talk experiments with the response regulator substrate of the cyanobacterial phytochrome Cph1, we propose that eukaryotic phytochromes are histidine kinase paralogs with serine/threonine specificity whose enzymatic activity diverged from that of a prokaryotic ancestor after duplication of the transmitter module.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

The pore-forming α subunit of large conductance voltage- and Ca2+-sensitive K (MaxiK) channels is regulated by a β subunit that has two membrane-spanning regions separated by an extracellular loop. To investigate the structural determinants in the pore-forming α subunit necessary for β-subunit modulation, we made chimeric constructs between a human MaxiK channel and the Drosophila homologue, which we show is insensitive to β-subunit modulation, and analyzed the topology of the α subunit. A comparison of multiple sequence alignments with hydrophobicity plots revealed that MaxiK channel α subunits have a unique hydrophobic segment (S0) at the N terminus. This segment is in addition to the six putative transmembrane segments (S1–S6) usually found in voltage-dependent ion channels. The transmembrane nature of this unique S0 region was demonstrated by in vitro translation experiments. Moreover, normal functional expression of signal sequence fusions and in vitro N-linked glycosylation experiments indicate that S0 leads to an exoplasmic N terminus. Therefore, we propose a new model where MaxiK channels have a seventh transmembrane segment at the N terminus (S0). Chimeric exchange of 41 N-terminal amino acids, including S0, from the human MaxiK channel to the Drosophila homologue transfers β-subunit regulation to the otherwise unresponsive Drosophila channel. Both the unique S0 region and the exoplasmic N terminus are necessary for this gain of function.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

SNARE [soluble NSF (N-ethylmaleimide-sensitive fusion protein) attachment protein receptor] proteins are essential for membrane fusion and are conserved from yeast to humans. Sequence alignments of the most conserved regions were mapped onto the recently solved crystal structure of the heterotrimeric synaptic fusion complex. The association of the four α-helices in the synaptic fusion complex structure produces highly conserved layers of interacting amino acid side chains in the center of the four-helix bundle. Mutations in these layers reduce complex stability and cause defects in membrane traffic even in distantly related SNAREs. When syntaxin-4 is modeled into the synaptic fusion complex as a replacement of syntaxin-1A, no major steric clashes arise and the most variable amino acids localize to the outer surface of the complex. We conclude that the main structural features of the neuronal complex are highly conserved during evolution. On the basis of these features we have reclassified SNARE proteins into Q-SNAREs and R-SNAREs, and we propose that fusion-competent SNARE complexes generally consist of four-helix bundles composed of three Q-SNAREs and one R-SNARE.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

The distribution of optimal local alignment scores of random sequences plays a vital role in evaluating the statistical significance of sequence alignments. These scores can be well described by an extreme-value distribution. The distribution’s parameters depend upon the scoring system employed and the random letter frequencies; in general they cannot be derived analytically, but must be estimated by curve fitting. For obtaining accurate parameter estimates, a form of the recently described ‘island’ method has several advantages. We describe this method in detail, and use it to investigate the functional dependence of these parameters on finite-length edge effects.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Chromosome 7q22 has been the focus of many cytogenetic and molecular studies aimed at delineating regions commonly deleted in myeloid leukemias and myelodysplastic syndromes. We have compared a gene-dense, GC-rich sub-region of 7q22 with the orthologous region on mouse chromosome 5. A physical map of 640 kb of genomic DNA from mouse chromosome 5 was derived from a series of overlapping bacterial artificial chromosomes. A 296 kb segment from the physical map, spanning Ache to Tfr2, was compared with 267 kb of human sequence. We identified a conserved linkage of 12 genes including an open reading frame flanked by Ache and Asr2, a novel cation-chloride cotransporter interacting protein Cip1, Ephb4, Zan and Perq1. While some of these genes have been previously described, in each case we present new data derived from our comparative sequence analysis. Adjacent unfinished sequence data from the mouse contains an orthologous block of 10 additional genes including three novel cDNA sequences that we subsequently mapped to human 7q22. Methods for displaying comparative genomic information, including unfinished sequence data, are becoming increasingly important. We supplement our printed comparative analysis with a new, Web-based program called Laj (local alignments with java). Laj provides interactive access to archived pairwise sequence alignments via the WWW. It displays synchronized views of a dot-plot, a percent identity plot, a nucleotide-level local alignment and a variety of relevant annotations. Our mouse–human comparison can be viewed at http://web.uvic.ca/~bioweb/laj.html. Laj is available at http://bio.cse.psu.edu/, along with online documentation and additional examples of annotated genomic regions.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Ligand-Gated Ion Channels (LGIC) are polymeric transmembrane proteins involved in the fast response to numerous neurotransmitters. All these receptors are formed by homologous subunits and the last two decades revealed an unexpected wealth of genes coding for these subunits. The Ligand-Gated Ion Channel database (LGICdb) has been developed to handle this increasing amount of data. The database aims to provide only one entry for each gene, containing annotated nucleic acid and protein sequences. The repository is carefully structured and the entries can be retrieved by various criteria. In addition to the sequences, the LGICdb provides multiple sequence alignments, phylogenetic analyses and atomic coordinates when available. The database is accessible via the World Wide Web (http://www.pasteur.fr/recherche/banques/LGIC/LGIC.html), where it is continuously updated. The version 16 (September 2000) available for download contained 333 entries covering 34 species.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

The tmRNA database (tmRDB) is maintained at the University of Texas Health Science Center at Tyler, Texas, and accessible on the World Wide Web at the URL http://psyche.uthct.edu/dbs/tmRDB/tmRDB.html. Mirror sites are located at Auburn University, Auburn, Alabama (http://www.ag.auburn.edu/mirror/tmRDB/) and the Institute of Biological Sciences, Aarhus, Denmark (http://www.bioinf.au.dk/tmRDB/). The tmRDB provides information and citation links about tmRNA, a molecule that combines functions of tRNA and mRNA in trans-translation. tmRNA is likely to be present in all bacteria and has been found in algae chloroplasts, the cyanelle of Cyanophora paradoxa and the mitochondrion of the flagellate Reclinomonas americana. This release adds 26 new sequences and corresponding predicted tmRNA-encoded tag peptides for a total of 86 tmRNAs, ordered alphabetically and phylogenetically. Secondary structures and three-dimensional models in PDB format for representative molecules are being made available. tmRNA alignments prove individual base pairs and are generated manually assisted by computational tools. The alignments with their corresponding structural annotation can be obtained in various formats, including a new column format designed to improve and simplify computational usability of the data.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Signal recognition particle (SRP) is a stable cytoplasmic ribonucleoprotein complex that serves to translocate secretory proteins across membranes during translation. The SRP Database (SRPDB) provides compilations of SRP components, ordered alphabetically and phylogenetically. Alignments emphasize phylogenetically-supported base pairs in SRP RNA and conserved residues in the proteins. Data are provided in various formats including a column arrangement for improved access and simplified computational usability. Included are motifs for identification of new sequences, SRP RNA secondary structure diagrams, 3-D models and links to high-resolution structures. This release includes 11 new SRP RNA sequences (total of 129), two protein SRP9 sequences (total of seven), two protein SRP14 sequences (total of 10), two protein SRP19 sequences (total of 16), 10 new SRP54 (ffh) sequences (total of 66), two protein SRP68 sequences (total of seven) and two protein SRP72 sequences (total of nine). Seven sequences of the SRP receptor α-subunit and its FtsY homolog (total of 51) are new. Also considered are β-subunit of SRP receptor, Flhf, Hbsu, CaM kinase II and cpSRP43. Access to SRPDB is at http://psyche.uthct.edu/dbs/SRPDB/SRPDB.html and the European mirror http://www.medkem.gu.se/dbs/SRPDB/SRPDB.html

Relevância:

10.00% 10.00%

Publicador:

Resumo:

In order to support the structural genomic initiatives, both by rapidly classifying newly determined structures and by suggesting suitable targets for structure determination, we have recently developed several new protocols for classifying structures in the CATH domain database (http://www.biochem.ucl.ac.uk/bsm/cath). These aim to increase the speed of classification of new structures using fast algorithms for structure comparison (GRATH) and to improve the sensitivity in recognising distant structural relatives by incorporating sequence information from relatives in the genomes (DomainFinder). In order to ensure the integrity of the database given the expected increase in data, the CATH Protein Family Database (CATH-PFDB), which currently includes 25 320 structural domains and a further 160 000 sequence relatives has now been installed in a relational ORACLE database. This was essential for developing more rigorous validation procedures and for allowing efficient querying of the database, particularly for genome analysis. The associated Dictionary of Homologous Superfamilies [Bray,J.E., Todd,A.E., Pearl,F.M.G., Thornton,J.M. and Orengo,C.A. (2000) Protein Eng., 13, 153–165], which provides multiple structural alignments and functional information to assist in assigning new relatives, has also been expanded recently and now includes information for 903 homo­logous superfamilies. In order to improve coverage of known structures, preliminary classification levels are now provided for new structures at interim stages in the classification protocol. Since a large proportion of new structures can be rapidly classified using profile-based sequence analysis [e.g. PSI-BLAST: Altschul,S.F., Madden,T.L., Schaffer,A.A., Zhang,J., Zhang,Z., Miller,W. and Lipman,D.J. (1997) Nucleic Acids Res., 25, 3389–3402], this provides preliminary classification for easily recognisable homologues, which in the latest release of CATH (version 1.7) represented nearly three-quarters of the non-identical structures.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

BAliBASE is specifically designed to serve as an evaluation resource to address all the problems encountered when aligning complete sequences. The database contains high quality, manually constructed multiple sequence alignments together with detailed annotations. The alignments are all based on three-dimensional structural superpositions, with the exception of the transmembrane sequences. The first release provided sets of reference alignments dealing with the problems of high variability, unequal repartition and large N/C-terminal extensions and internal insertions. Here we describe version 2.0 of the database, which incorporates three new reference sets of alignments containing structural repeats, trans­membrane sequences and circular permutations to evaluate the accuracy of detection/prediction and alignment of these complex sequences. BAliBASE can be viewed at the web site http://www-igbmc.u-strasbg.fr/BioInfo/BAliBASE2/index.html or can be downloaded from ftp://ftp-igbmc.u-strasbg.fr/pub/BAliBASE2/.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

The Conserved Key Amino Acid Positions DataBase (CKAAPs DB) provides access to an analysis of structurally similar proteins with dissimilar sequences where key residues within a common fold are identified. The derivation and significance of CKAAPs starting from pairwise structure alignments is described fully in Reddy et al. [Reddy,B.V.B., Li,W.W., Shindyalov,I.N. and Bourne,P.E. (2000) Proteins, in press]. The CKAAPs identified from this theoretical analysis are provided to experimentalists and theoreticians for potential use in protein engineering and modeling. It has been suggested that CKAAPs may be crucial features for protein folding, structural stability and function. Over 170 substructures, as defined by the Combinatorial Extension (CE) database, which are found in approximately 3000 representative polypeptide chains have been analyzed and are available in the CKAAPs DB. CKAAPs DB also provides CKAAPs of the representative set of proteins derived from the CE and FSSP databases. Thus the database contains over 5000 representative poly­peptide chains, covering all known structures in the PDB. A web interface to a relational database permits fast retrieval of structure-sequence alignments, CKAAPs and associated statistics. Users may query by PDB ID, protein name, function and Enzyme Classification number. Users may also submit protein alignments of their own to obtain CKAAPs. An interface to display CKAAPs on each structure from a web browser is also being implemented. CKAAPs DB is maintained by the San Diego Supercomputer Center and accessible at the URL http://ckaaps.sdsc.edu.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

The Dali Domain Dictionary (http://www.ebi.ac.uk/dali/domain) is a numerical taxonomy of all known structures in the Protein Data Bank (PDB). The taxonomy is derived fully automatically from measurements of structural, functional and sequence similarities. Here, we report the extension of the classification to match the traditional four hierarchical levels corresponding to: (i) supersecondary structural motifs (attractors in fold space), (ii) the topology of globular domains (fold types), (iii) remote homologues (functional families) and (iv) homologues with sequence identity above 25% (sequence families). The computational definitions of attractors and functional families are new. In September 2000, the Dali classification contained 10 531 PDB entries comprising 17 101 chains, which were partitioned into five attractor regions, 1375 fold types, 2582 functional families and 3724 domain sequence families. Sequence families were further associated with 99 582 unique homologous sequences in the HSSP database, which increases the number of effectively known structures several-fold. The resulting database contains the description of protein domain architecture, the definition of structural neighbours around each known structure, the definition of structurally conserved cores and a comprehensive library of explicit multiple alignments of distantly related protein families.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

The objective of database AsMamDB is to facilitate the systematic study of alternatively spliced genes of mammals. Version 1.0 of AsMamDB contains 1563 alternatively spliced genes of human, mouse and rat, each associated with a cluster of nucleotide sequences. The main information provided by AsMamDB includes gene alternative splicing patterns, gene structures, locations in chromosomes, products of genes and tissues where they express. Alternative splicing patterns are represented by multiple alignments of various gene transcripts and by graphs of their topological structures. Gene structures are illustrated by exon, intron and various regulatory elements distributions. There are 4204 DNAs, 3977 mRNAs, 8989 CDSs and 126 931 ESTs in the current database. More than 130 000 GenBank entries are covered and 4443 MEDLINE records are linked. DNA, mRNA, exon, intron and relevant regulatory element sequences are provided in FASTA format. More information can be obtained by using the web-based multiple alignment tool Asalign and various category lists. AsMamDB can be accessed at http://166.111.30.6 5/ASMAM DB.html.