981 resultados para ACID-SEQUENCES


Relevância:

100.00% 100.00%

Publicador:

Resumo:

Plant seeds contain a large number of protease inhibitors of animal, fungal, and bacterial origin. One of the well-studied families of these inhibitors is the Bowman-Birk family(BBI). The BBIs from dicotyledonous seeds are 8K, double-headed proteins. In contrast, the 8K inhibitors from monocotyledonous seeds are single headed. Monocots also have a 16K, double-headed inhibitor. We have determined the primary structure of a Bowman-Birk inhibitor from a dicot, horsegram, by sequential edman analysis of the intact protein and peptides derived from enzymatic and chemical cleavage. The 76-residue-long inhibitor is very similar to that ofMacrotyloma axillare. An analysis of this inhibitor along with 26 other Bowman-Birk inhibitor domains (MW 8K) available in the SWISSPROT databank revealed that the proteins from monocots and dicots belong to related but distinct families. Inhibitors from monocots show larger variation in sequence. Sequence comparison shows that a crucial disulphide which connects the amino and carboxy termini of the active site loop is lost in monocots. The loss of a reactive site in monocots seems to be correlated to this. However, it appears that this disulphide is not absolutely essential for retention of inhibitory function. Our analysis suggests that gene duplication leading to a 16K inhibitor in monocots has occurred, probably after the divergence of monocots and dicots, and also after the loss of second reactive site in monocots.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The structural annotation of proteins with no detectable homologs of known 3D structure identified using sequence-search methods is a major challenge today. We propose an original method that computes the conditional probabilities for the amino-acid sequence of a protein to fit to known protein 3D structures using a structural alphabet, known as Protein Blocks (PBs). PBs constitute a library of 16 local structural prototypes that approximate every part of protein backbone structures. It is used to encode 3D protein structures into 1D PB sequences and to capture sequence to structure relationships. Our method relies on amino acid occurrence matrices, one for each PB, to score global and local threading of query amino acid sequences to protein folds encoded into PB sequences. It does not use any information from residue contacts or sequence-search methods or explicit incorporation of hydrophobic effect. The performance of the method was assessed with independent test datasets derived from SCOP 1.75A. With a Z-score cutoff that achieved 95% specificity (i.e., less than 5% false positives), global and local threading showed sensitivity of 64.1% and 34.2%, respectively. We further tested its performance on 57 difficult CASP10 targets that had no known homologs in PDB: 38 compatible templates were identified by our approach and 66% of these hits yielded correctly predicted structures. This method scales-up well and offers promising perspectives for structural annotations at genomic level. It has been implemented in the form of a web-server that is freely available at http://www.bo-protscience.fr/forsa.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

IntFOLD is an independent web server that integrates our leading methods for structure and function prediction. The server provides a simple unified interface that aims to make complex protein modelling data more accessible to life scientists. The server web interface is designed to be intuitive and integrates a complex set of quantitative data, so that 3D modelling results can be viewed on a single page and interpreted by non-expert modellers at a glance. The only required input to the server is an amino acid sequence for the target protein. Here we describe major performance and user interface updates to the server, which comprises an integrated pipeline of methods for: tertiary structure prediction, global and local 3D model quality assessment, disorder prediction, structural domain prediction, function prediction and modelling of protein-ligand interactions. The server has been independently validated during numerous CASP (Critical Assessment of Techniques for Protein Structure Prediction) experiments, as well as being continuously evaluated by the CAMEO (Continuous Automated Model Evaluation) project. The IntFOLD server is available at: http://www.reading.ac.uk/bioinf/IntFOLD/

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Phenylamidine cationic groups linked by a furan ring (furamidine) and related compounds bind as monomers to AT sequences of DNA. An unsymmetric derivative (DB293) with one of the phenyl rings of furamidine replaced with a benzimidazole has been found by quantitative footprinting analyses to bind to GC-containing sites on DNA more strongly than to pure AT sequences. NMR structural analysis and surface plasmon resonance binding results clearly demonstrate that DB293 binds in the minor groove at specific GC-containing sequences of DNA in a highly cooperative manner as a stacked dimer. Neither the symmetric bisphenyl nor bisbenzimidazole analogs of DB293 bind significantly to the GC containing sequences. DB293 provides a paradigm for design of compounds for specific recognition of mixed DNA sequences and extends the boundaries for small molecule-DNA recognition.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Evolutionary selection of sequences is studied with a knowledge-based Hamiltonian to find the design principle for folding to a model protein structure. With sequences selected by naive energy minimization, the model structure tends to be unstable and the folding ability is low. Sequences with high folding ability have only the low-lying energy minimum but also an energy landscape which is similar to that found for the native sequence over a wide region of the conformation space. Though there is a large fluctuation in foldable sequences, the hydrophobicity pattern and the glycine locations are preserved among them. Implications of the design principle for the molecular mechanism of folding are discussed.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

1976 ed. issued under title: Variable regions of immunoglobulin chains.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

A single-tube RT-PCR technique generated a 387 bp or 300 bp cDNA amplicon covering the F-0 cleavage site or the carboxyl (C)-terminus of the HN gene, respectively, of Newcastle disease virus (NDV) strain 1-2. Sequence analysis was used to deduce the amino acid sequences of the cleavage site of F protein and the C-terminus of HN protein, which were then compared with sequences for other NDV strains. The cleavage site of NDV strain 1-2 had a sequence Motif of (112)RKQGRLIG(119), consistent with an avirulent phenotype. Nucleotide sequencing and deduction of amino acids at the C-terminus of HN revealed that strain 1-2 had a 7-amino-acid extension (VEILKDGVREARSSR). This differs from the virulent viruses that caused outbreaks of Newcastle disease in Australia in the 1930s and 1990s, which have HN extensions of 0 and 9 amino acids, respectively. Amino acid sequence analyses of the F and HN genes of strain 1-2 confirmed its avirulent nature and its Australian origin.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Motivation: In any macromolecular polyprotic system - for example protein, DNA or RNA - the isoelectric point - commonly referred to as the pI - can be defined as the point of singularity in a titration curve, corresponding to the solution pH value at which the net overall surface charge - and thus the electrophoretic mobility - of the ampholyte sums to zero. Different modern analytical biochemistry and proteomics methods depend on the isoelectric point as a principal feature for protein and peptide characterization. Protein separation by isoelectric point is a critical part of 2-D gel electrophoresis, a key precursor of proteomics, where discrete spots can be digested in-gel, and proteins subsequently identified by analytical mass spectrometry. Peptide fractionation according to their pI is also widely used in current proteomics sample preparation procedures previous to the LC-MS/MS analysis. Therefore accurate theoretical prediction of pI would expedite such analysis. While such pI calculation is widely used, it remains largely untested, motivating our efforts to benchmark pI prediction methods. Results: Using data from the database PIP-DB and one publically available dataset as our reference gold standard, we have undertaken the benchmarking of pI calculation methods. We find that methods vary in their accuracy and are highly sensitive to the choice of basis set. The machine-learning algorithms, especially the SVM-based algorithm, showed a superior performance when studying peptide mixtures. In general, learning-based pI prediction methods (such as Cofactor, SVM and Branca) require a large training dataset and their resulting performance will strongly depend of the quality of that data. In contrast with Iterative methods, machine-learning algorithms have the advantage of being able to add new features to improve the accuracy of prediction. Contact: yperez@ebi.ac.uk Availability and Implementation: The software and data are freely available at https://github.com/ypriverol/pIR. Supplementary information: Supplementary data are available at Bioinformatics online.

Relevância:

70.00% 70.00%

Publicador:

Resumo:

Background Designing novel proteins with site-directed recombination has enormous prospects. By locating effective recombination sites for swapping sequence parts, the probability that hybrid sequences have the desired properties is increased dramatically. The prohibitive requirements for applying current tools led us to investigate machine learning to assist in finding useful recombination sites from amino acid sequence alone. Results We present STAR, Site Targeted Amino acid Recombination predictor, which produces a score indicating the structural disruption caused by recombination, for each position in an amino acid sequence. Example predictions contrasted with those of alternative tools, illustrate STAR'S utility to assist in determining useful recombination sites. Overall, the correlation coefficient between the output of the experimentally validated protein design algorithm SCHEMA and the prediction of STAR is very high (0.89). Conclusion STAR allows the user to explore useful recombination sites in amino acid sequences with unknown structure and unknown evolutionary origin. The predictor service is available from http://pprowler.itee.uq.edu.au/star.

Relevância:

70.00% 70.00%

Publicador:

Resumo:

Amino acid sequences of proteinaceous proteinase inhibitors have been extensively analysed for deriving information regarding the molecular evolution and functional relationship of these proteins. These sequences have been grouped into several well defined families. It was found that the phylogeny constructed with the sequences corresponding to the exposed loop responsible for inhibition has several branches that resemble those obtained from comparisons using the entire sequence. The major branches of the unrooted tree corresponded to the families to which the inhibitors belonged. Further branching is related to the enzyme specificity of the inhibitor. Examination of the active site loop sequences of trypsin inhibitors revealed that there are strong preferences for specific amino acids at different positions of the loop. These preferences are inhibitor class specific. Inhibitors active against more than one enzyme occur within a class and confirm to class specific sequence in their loops. Hence, only a few positions in the loop seem to determine the specificity. The ability to inhibit the same enzyme by inhibitors that belong to different classes appears to be a result of convergent evolution

Relevância:

70.00% 70.00%

Publicador:

Resumo:

Hir/Hira (histone regulation) genes were first identified in yeast as negative regulators of histone gene expression. It has been confirmed that HIRA is a conserved family of proteins present in various animals and plants. In this paper, the cDNAs of the Hira homolog named CagHira and CaHira were isolated from gynogenetic gibel carp (gyno-carp) and gonochoristic color crucian carp (gono-carp) respectively. The full-length CagHira is 3,860 bp in length with an open reading frame (ORF) of 3,033 bp that encodes 1,011 amino acids, while the full-length CaHira is 3,748 bp in length and also has an ORF of 3,033 bp. The deduced amino acid sequences of both Hira homologs contain seven WD domains and show high identity with other HIRA family members. RT-PCR analyses revealed strong expression of Hira in the ovaries, whereas no expression was detected in the testes of either of the fishes. Hira transcription was not detected in the liver of gyno-carp, but a high level of Hira mRNA was observed in gono-carp. The temporal expression pattern showed that the Hira mRNA is consistently expressed during all embryonic development stages in gyno-carp. However, the abundance of CaHira mRNA significantly decreased (P < 0.05) shortly after fertilization and then increased again and remained stable from gastrula till hatching. The varying spatiotemporal expression patterns of Hira genes in gyno-carp and gono-carp may be associated with the differing reproductive modes used by these two closely related fishes. Our results suggest that Hira may play a role not only in the decondensation of sperm nucleus and the formation of pronucleus during fertilization, but also in gastrulation and the subsequent development of embryos.

Relevância:

70.00% 70.00%

Publicador:

Resumo:

Growth hormone (GH), prolactin (PRL) and somatolactin (SL) were purified simultaneously under alkaline condition (pH 9.0) from pituitary glands of sea perch (Lateolabrax japonicas) by a two-step procedure involving gel filtration on Sephadex G-100 and reverse-phase high-performance liquid chromatography (rpHPLC). At each step of purification, fractions were monitored by sodium dodecyl sulfate polyacrylamide gel electrophoresis (SDS-PAGE) and by immunoblotting with chum salmon GH. PRL and SL antisera. The yields of sea perch GH, PRL and SL were 4.2, 1.0 and 0.28 mg/g wet tissue, respectively. The molecular weights of 19,200 and 20,370 Da were estimated by SDS-PAGE for sea perch GH and PRL, respectively. Two forms of sea perch SL were found: one (28,400 Da) is probably glycosylated, while the other one (23,200 Da) is believed to be deglycosylated. GH bioactivity was examined by an in vivo assay. Intraperitoneal injection of sea perch GH at a dose of 0.01 and 0.1 mug/g body weight at 7-day intervals resulted in a significant increase in body weight and length of juvenile rainbow trout. The complete sea-perch GH amino acid sequence of 187 residues was determined by sequencing fragments cleaved by chemicals and enzymes. Alignment of sea-perch GH with those of other fish GHs revealed that sea-perch GH is most similar to advanced marine fish, such as tuna, gilthead sea bream, yellowfin porgy, red sea bream, bonito and yellow tail with 98.4, 96.2%, 95.7%, 95.2%, 94.1% and 91% sequence identity, respectively. Sea-perch GH has low identity to Atlantic cod (76.5%), hardtail (73.3%), flounder (68.4%), chum salmon (66.3%), carp (54%) and blue shark (38%). Partial amino-acid sequences of 127 of sea-perch PRL and the N-terminal of 16 amino-acid sequence of sea-perch SL have been determined. The data show that sea-perch PRL has a slightly higher sequence identity with tilapia PRL( 73.2%) than with chum salmon PRL(70%) in this 127 amino-acid sequence. (C) 2001 Elsevier Science B.V. All rights reserved.

Relevância:

70.00% 70.00%

Publicador:

Resumo:

The molecular networks regulating the G1-S transition in budding yeast and mammals are strikingly similar in network structure. However, many of the individual proteins performing similar network roles appear to have unrelated amino acid sequences, suggesting either extremely rapid sequence evolution, or true polyphyly of proteins carrying out identical network roles. A yeast/mammal comparison suggests that network topology, and its associated dynamic properties, rather than regulatory proteins themselves may be the most important elements conserved through evolution. However, recent deep phylogenetic studies show that fungal and animal lineages are relatively closely related in the opisthokont branch of eukaryotes. The presence in plants of cell cycle regulators such as Rb, E2F and cyclins A and D, that appear lost in yeast, suggests cell cycle control in the last common ancestor of the eukaryotes was implemented with this set of regulatory proteins. Forward genetics in non-opisthokonts, such as plants or their green algal relatives, will provide direct information on cell cycle control in these organisms, and may elucidate the potentially more complex cell cycle control network of the last common eukaryotic ancestor.

Relevância:

70.00% 70.00%

Publicador:

Resumo:

Wzx belongs to a family of membrane proteins involved in the translocation of isoprenoid lipid-linked glycans, which is loosely related to members of the major facilitator superfamily. Despite Wzx homologs performing a conserved function, it has been difficult to pinpoint specific motifs of functional significance in their amino acid sequences. Here, we elucidate the topology of the Escherichia coli O157 Wzx (Wzx(EcO157)) by a combination of bioinformatics and substituted cysteine scanning mutagenesis, as well as targeted deletion-fusions to green fluorescent protein and alkaline phosphatase. We conclude that Wzx(EcO157) consists of 12 transmembrane (TM) helices and six periplasmic and five cytosolic loops, with N and C termini facing the cytoplasm. Four TM helices (II, IV, X, and XI) contain polar residues (aspartic acid or lysine), and they may form part of a relatively hydrophilic core. Thirty-five amino acid replacements to alanine or serine were targeted to five native cysteines and most of the aspartic acid, arginine, and lysine residues. From these, only replacements of aspartic acid-85, aspartic acid-326, arginine-298, and lysine-419 resulted in a protein unable to support O-antigen production. Aspartic acid-85 and lysine-419 are located in TM helices II and XI, while arginine-298 and aspartic acid-326 are located in periplasmic and cytosolic loops 4, respectively. Further analysis revealed that the charge at these positions is required for Wzx function since conservative substitutions maintaining the same charge polarity resulted in a functional protein, whereas those reversing or eliminating polarity abolished function. We propose that the functional requirement of charged residues at both sides of the membrane and in two TM helices could be important to allow the passage of the Und-PP-linked saccharide substrate across the membrane.