965 resultados para protein sequence classification


Relevância:

80.00% 80.00%

Publicador:

Resumo:

Sequence-structure correlation studies are important in deciphering the relationships between various structural aspects, which may shed light on the protein-folding problem. The first step of this process is the prediction of secondary structure for a protein sequence of unknown three-dimensional structure. To this end, a web server has been created to predict the consensus secondary structure using well known algorithms from the literature. Furthermore, the server allows users to see the occurrence of predicted secondary structural elements in other structure and sequence databases and to visualize predicted helices as a helical wheel plot. The web server is accessible at http://bioserver1.physics.iisc.ernet.in/cssp/.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Large-scale gene discovery has been performed for the grass fungal endophytes Neotyphodium coenophialum, Neotyphodium lolii, and Epichloë festucae. The resulting sequences have been annotated by comparison with public DNA and protein sequence databases and using intermediate gene ontology annotation tools. Endophyte sequences have also been analysed for the presence of simple sequence repeat and single nucleotide polymorphism molecular genetic markers. Sequences and annotation are maintained within a MySQL database that may be queried using a custom web interface. Two cDNA-based microarrays have been generated from this genome resource. They permit the interrogation of 3806 Neotyphodium genes (NchipTM microarray), and 4195 Neotyphodium and 920 Epichloë genes (EndoChipTM microarray), respectively. These microarrays provide tools for high-throughput transcriptome analysis, including genome-specific gene expression studies, profiling of novel endophyte genes, and investigation of the host grass–symbiont interaction. Comparative transcriptome analysis in Neotyphodium and Epichloë was performed

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Aims: To examine the prevalence of bacteriocin production in Streptococcus bovis isolates from Australian ruminants and the feasibility of industrial production of bacteriocin. Methods and Results: Streptococcus bovis strains were tested for production of bacteriocin-like inhibitory substances (BLIS) by antagonism assay against Lactococcus lactis. BLIS production was associated with source animal location (i.e. proximity of other bacteriocin-positive source animals) rather than ruminant species/breed or diet. One bacteriocin showing strong inhibitory activity (Sb15) was isolated and examined. Protein sequence, stability and activity spectrum of this bovicin were very similar to bovicin HC5. Production could be increased through serial culturing, and increased productivity could be partially maintained during cold storage of cultures. Conclusions: BLIS production is geographically widely distributed in Eastern Australia, and it appears that the bacteriocin+ trait is maintained in animals at the same location. The HC5-like bacteriocin, originally identified in North America, is also found in Australia. Production of bacteriocin can be increased through serial culturing. Significance and Impact of the Study: The HC5-like bacteriocins appear to have a broad global distribution. Serial culturing may provide a route towards commercial manufacturing for use in industrial applications, and purified bacteriocin from S. bovis Sb15 could potentially be used to prevent food spoilage or as a feed additive to promote growth in ruminant species.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

The work covered in this thesis is focused on the development of technology for bioconversion of glucose into D-erythorbic acid (D-EA) and 5-ketogluconic acid (5-KGA). The task was to show on proof-of-concept level the functionality of the enzymatic conversion or one-step bioconversion of glucose to these acids. The feasibility of both studies to be further developed for production processes was also evaluated. The glucose - D-EA bioconversion study was based on the use of a cloned gene encoding a D-EA forming soluble flavoprotein, D-gluconolactone oxidase (GLO). GLO was purified from Penicillium cyaneo-fulvum and partially sequenced. The peptide sequences obtained were used to isolate a cDNA clone encoding the enzyme. The cloned gene (GenBank accession no. AY576053) is homologous to the other known eukaryotic lactone oxidases and also to some putative prokaryotic lactone oxidases. Analysis of the deduced protein sequence of GLO indicated the presence of a typical secretion signal sequence at the N-terminus of the enzyme. No other targeting/anchoring signals were found, suggesting that GLO is the first known lactone oxidase that is secreted rather than targeted to the membranes of the endoplasmic reticulum or mitochondria. Experimental evidence supports this analysis, as near complete secretion of GLO was observed in two different yeast expression systems. Highest expression levels of GLO were obtained using Pichia pastoris as an expression host. Recombinant GLO was characterised and the suitability of purified GLO for the production of D-EA was studied. Immobilised GLO was found to be rapidly inactivated during D-EA production. The feasibility of in vivo glucose - D-EA conversion using a P. pastoris strain co-expressing the genes of GLO and glucose oxidase (GOD, E.C. 1.1.3.4) of A. niger was demonstrated. The glucose - 5-KGA bioconversion study followed a similar strategy to that used in the D-EA production research. The rationale was based on the use of a cloned gene encoding a membrane-bound pyrroloquinoline quinone (PQQ)-dependent gluconate 5-dehydrogenase (GA 5-DH). GA 5-DH was purified to homogeneity from the only source of this enzyme known in literature, Gluconobacter suboxydans, and partially sequenced. Using the amino acid sequence information, the GA 5-DH gene was cloned from a genomic library of G. suboxydans. The cloned gene was sequenced (GenBank accession no. AJ577472) and found to be an operon of two adjacent genes encoding two subunits of GA 5-DH. It turned out that GA 5-DH is a rather close homologue of a sorbitol dehydrogenase from another G. suboxydans strain. It was also found that GA 5-DH has significant polyol dehydrogenase activity. The G. suboxydans GA 5-DH gene was poorly expressed in E. coli. Under optimised conditions maximum expression levels of GA 5-DH did not exceed the levels found in wild-type G. suboxydans. Attempts to increase expression levels resulted in repression of growth and extensive cell lysis. However, the expression levels were sufficient to demonstrate the possibility of bioconversion of glucose and gluconate into 5-KGA using recombinant strains of E. coli. An uncharacterised homologue of GA 5-DH was identified in Xanthomonas campestris using in silico screening. This enzyme encoded by chromosomal locus NP_636946 was found by a sequencing project of X. campestris and named as a hypothetical glucose dehydrogenase. The gene encoding this uncharacterised enzyme was cloned, expressed in E. coli and found to encode a gluconate/polyol dehydrogenase without glucose dehydrogenase activity. Moreover, the X. campestris GA 5-DH gene was expressed in E. coli at nearly 30 times higher levels than the G. suboxydans GA 5-DH gene. Good expressability of the X. campestris GA-5DH gene makes it a valuable tool not only for 5-KGA production in the tartaric acid (TA) bioprocess, but possibly also for other bioprocesses (e.g. oxidation of sorbitol into L-sorbose). In addition to glucose - 5-KGA bioconversion, a preliminary study of the feasibility of enzymatic conversion of 5-KGA into TA was carried out. Here, the efficacy of the first step of a prospective two-step conversion route including a transketolase and a dehydrogenase was confirmed. It was found that transketolase convert 5-KGA into TA semialdehyde. A candidate for the second step was suggested to be succinic dehydrogenase, but this was not tested. The analysis of the two subprojects indicated that bioconversion of glucose to TA using X. campestris GA 5-DH should be prioritised first and the process development efforts in future should be focused on development of more efficient GA 5-DH production strains by screening a more suitable production host and by protein engineering.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

In this paper, we present numerical evidence that supports the notion of minimization in the sequence space of proteins for a target conformation. We use the conformations of the real proteins in the Protein Data Bank (PDB) and present computationally efficient methods to identify the sequences with minimum energy. We use edge-weighted connectivity graph for ranking the residue sites with reduced amino acid alphabet and then use continuous optimization to obtain the energy-minimizing sequences. Our methods enable the computation of a lower bound as well as a tight upper bound for the energy of a given conformation. We validate our results by using three different inter-residue energy matrices for five proteins from protein data bank (PDB), and by comparing our energy-minimizing sequences with 80 million diverse sequences that are generated based on different considerations in each case. When we submitted some of our chosen energy-minimizing sequences to Basic Local Alignment Search Tool (BLAST), we obtained some sequences from non-redundant protein sequence database that are similar to ours with an E-value of the order of 10(-7). In summary, we conclude that proteins show a trend towards minimizing energy in the sequence space but do not seem to adopt the global energy-minimizing sequence. The reason for this could be either that the existing energy matrices are not able to accurately represent the inter-residue interactions in the context of the protein environment or that Nature does not push the optimization in the sequence space, once it is able to perform the function.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

The non-oxidative decarboxylation of aromatic acids is a poorly understood reaction. The transformation of 2,3-dihydroxybenzoic acid to catechol in the fungal metabolism of indole is a prototype of such a reaction. 2,3-Dihydroxybenzoic acid decarboxylase (EC 4.1.1.46) which catalyzes this reaction was purified to homogeneity from anthranilate induced cultures of Aspergillus oryzae using affinity chromatography. The enzyme did not require cofactors like NAD(+), PLP, TPP or metal ions for its activity. There was no spectral evidence for the presence of enzyme bound cofactors. The preparation, which was adjudged homogeneous by the criteria of SDS-PAGE, sedimentation analysis and N-terminal analysis, was characterized for its physicochemical and kinetic parameters. The enzyme was inactivated by group-specific modifiers like diethyl pyrocarbonate (DEPC) and N-ethylmaleimide (NEM). The kinetics of inactivation by DEPC suggested the presence of a single class of essential histidine residues, the second order rate constant of inactivation for which was 12.5 M(-1) min(-1). A single class of cysteine residues was modified by NEM with a second order rate constant of 33 M(-1) min(-1). Substrate analogues protected the enzyme against inactivation by both DEPC and NEM, suggesting the Location of the essential histidine and cysteine to be at the active site of the enzyme. The incorporation of radiolabelled NEM in a differential labelling experiment was 0.73 mol per mol subunit confirming the presence of a single essential cysteine per active-site. Differentially labelled enzyme was enzymatically cleaved and the peptide bearing the label was purified and sequenced. The active-site peptide LLGLAETCK and the N-terminal sequence MLGKIALEEAFALPRFEEKT did not bear any similarity to sequences reported in the Swiss-Prot Protein Sequence Databank, a reflection probably of the unique primary structure of this novel enzyme. The sequences reported in this study will appear in the Swiss-Prot Protein Sequence Databank under the accession number P80402.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

The Basic Local Alignment Search Tool (BLAST) is one of the most widely used sequence alignment programs with which similarity searches, for both protein and nucleic acid sequences, can be performed against large databases at high speed. A large number of tools exist for processing BLAST output, but none of them provide three-dimensional structure visualization. This shortcoming has been addressed in the proposed tool BLAST Server for Structural Biologists (BSSB), which maps a BLAST output onto the three-dimensional structure of the subject protein. The three-dimensional structure of the subject protein is represented using a three-color coding scheme (identical: red; similar: yellow; and mismatch: white) based on the pairwise alignment obtained. Thus, the user will be able to visualize a possible three-dimensional structure for the query protein sequence. This information can be used to gain a deeper insight into the sequence-structure correlation. Furthermore, the additional structure-level information enables the user to make coherent and logical decisions regarding the type of input model structure or fragment that can be used for molecular replacement calculations. This tool is freely available to all users at http://bioserver1.physics.iisc.ernet.in/bssb/.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

A palindrome is a set of characters that reads the same forwards and backwards. Since the discovery of palindromic peptide sequences two decades ago, little effort has been made to understand its structural, functional and evolutionary significance. Therefore, in view of this, an algorithm has been developed to identify all perfect palindromes (excluding the palindromic subset and tandem repeats) in a single protein sequence. The proposed algorithm does not impose any restriction on the number of residues to be given in the input sequence. This avant-garde algorithm will aid in the identification of palindromic peptide sequences of varying lengths in a single protein sequence.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Genomic sequences are far from being random but are made up of systematically ordered and information rich patterns. These repeated sequence patterns have been vastly utilized for their fundamental importance in understanding the genome function and organization. To this end, a comprehensive toolkit, RepEx, has been developed which extracts repeat (inverted, everted and mirror) patterns from the given genome sequence(s) without any constraints. The toolkit can also be used to fetch the inverted repeats present in the protein sequence (s). Further, it is capable of extracting exact and degenerate repeats with a user defined spacer intervals. It is remarkably more precise and sensitive when compared to the existing tools. An example with comprehensive case studies and a performance evaluation of the proposed toolkit has been presented to authenticate its efficiency and accuracy. (C) 2013 Elsevier Inc. All rights reserved.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

As a basic tool of modern biology, sequence alignment can provide us useful information in fold, function, and active site of protein. For many cases, the increased quality of sequence alignment means a better performance. The motivation of present work is to increase ability of the existing scoring scheme/algorithm by considering residue–residue correlations better. Based on a coarse-grained approach, the hydrophobic force between each pair of residues is written out from protein sequence. It results in the construction of an intramolecular hydrophobic force network that describes the whole residue–residue interactions of each protein molecule, and characterizes protein's biological properties in the hydrophobic aspect. A former work has suggested that such network can characterize the top weighted feature regarding hydrophobicity. Moreover, for each homologous protein of a family, the corresponding network shares some common and representative family characters that eventually govern the conservation of biological properties during protein evolution. In present work, we score such family representative characters of a protein by the deviation of its intramolecular hydrophobic force network from that of background. Such score can assist the existing scoring schemes/algorithms, and boost up the ability of multiple sequences alignment, e.g. achieving a prominent increase (50%) in searching the structurally alike residue segments at a low identity level. As the theoretical basis is different, the present scheme can assist most existing algorithms, and improve their efficiency remarkably.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

The design, synthesis, and characterization of two novel metalloprotein motifs is presented. The first project involved the design and construction of a protein motif which was programmed to form a tetradentate metal complex upon the addition of metal cations. The overall structure of the motif was based on a ββ super-secondary structure consisting of a flexible peptide sequence flanked by metal binding regions located at the carboxy and amino termini. The metal binding region near the amino terminus was constructed from a reverse turn motif with two metal ligating residues, (2R, 3R)-β-methyl-cysteine and histidine. Selection of the peptide sequence for this region was based on the conformational analysis of a series of tetrapeptides designed to form reverse turns in solution.

The stereospecific syntheses of a series of novel bipyridyl- and phenanthrolylsubstituted amino acids was carried out to provide ligands for the carboxy terminus metal binding region. These residues were incorporated into peptide sequences using solid phase peptide synthesis protocols, and metal binding studies indicated that the metal binding properties of these ligands was dictated by the specific regioisomer of the heteroaromatic ring and the peptide primary sequence.

Finally, a peptide containing optimized components for the metal binding regions was prepared to test the ability of the compound to form the desired intramolecular peptide:metal cation complexes. Metal binding studies demonstrated that the peptide formed monomeric complexes with very high metal cation binding affinities and that the two metal binding regions act cooperatively in the metal binding process. The use of these systems in the design of proteins capable of regulating naturally occurring proteins is discussed.

The second project involved the semisynthesis of two horse heart cytochrome c mutants incorporating the bipyridyl-amino acids at position 72 of the protein sequence. Structural studies on the proteins indicated that the bipyridyl amino acids had a neglible effect on the protein structure. One of the mutants was modified with Ru(bpy)_2^(+2) to form a redox-active protein, and the modified protein was found to have enhanced electron transfer properties between the heme and the introduced metal site.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Because so little is known about the structure of membrane proteins, an attempt has been made in this work to develop techniques by which to model them in three dimensions. The procedures devised rely heavily upon the availability of several sequences of a given protein. The modelling procedure is composed of two parts. The first identifies transmembrane regions within the protein sequence on the basis of hydrophobicity, β-turn potential, and the presence of certain amino acid types, specifically, proline and basic residues. The second part of the procedure arranges these transmembrane helices within the bilayer based upon the evolutionary conservation of their residues. Conserved residues are oriented toward other helices and variable residues are positioned to face the surrounding lipids. Available structural information concerning the protein's helical arrangement, including the lengths of interhelical loops, is also taken into account. Rhodopsin, band 3, and the nicotinic acetylcholine receptor have all been modelled using this methodology, and mechanisms of action could be proposed based upon the resulting structures.

Specific residues in the rhodopsin and iodopsin sequences were identified, which may regulate the proteins' wavelength selectivities. A hinge-like motion of helices M3, M4, and M5 with respect to the rest of the protein was proposed to result in the activation of transducin, the G-protein associated with rhodopsin. A similar mechanism is also proposed for signal transduction by the muscarinic acetylcholine and β-adrenergic receptors.

The nicotinic acetylcholine receptor was modelled with four trans-membrane helices per subunit and with the five homologous M2 helices forming the cation channel. Putative channel-lining residues were identified and a mechanism of channel-opening based upon the concerted, tangential rotation of the M2 helices was proposed.

Band 3, the anion exchange protein found in the erythrocyte membrane, was modelled with 14 transmembrane helices. In general the pathway of anion transport can be viewed as a channel composed of six helices that contains a single hydrophobic restriction. This hydrophobic region will not allow the passage of charged species, unless they are part of an ion-pair. An arginine residue located near this restriction is proposed to be responsible for anion transport. When ion-paired with a transportable anion it rotates across the barrier and releases the anion on the other side of the membrane. A similar process returns it to its original position. This proposed mechanism, based on the three-dimensional model, can account for the passive, electroneutral, anion exchange observed for band 3. Dianions can be transported through a similar mechanism with the additional participation of a histidine residue. Both residues are located on M10.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

The ability to sense mechanical force is vital to all organisms to interact with and respond to stimuli in their environment. Mechanosensation is critical to many physiological functions such as the senses of hearing and touch in animals, gravitropism in plants and osmoregulation in bacteria. Of these processes, the best understood at the molecular level involve bacterial mechanosensitive channels. Under hypo-osmotic stress, bacteria are able to alleviate turgor pressure through mechanosensitive channels that gate directly in response to tension in the membrane lipid bilayer. A key participant in this response is the mechanosensitive channel of large conductance (MscL), a non-selective channel with a high conductance of ~3 nS that gates at tensions close to the membrane lytic tension.

It has been appreciated since the original discovery by C. Kung that the small subunit size (~130 to 160 residues) and the high conductance necessitate that MscL forms a homo-oligomeric channel. Over the past 20 years of study, the proposed oligomeric state of MscL has ranged from monomer to hexamer. Oligomeric state has been shown to vary between MscL homologues and is influenced by lipid/detergent environment. In this thesis, we report the creation of a chimera library to systematically survey the correlation between MscL sequence and oligomeric state to identify the sequence determinants of oligomeric state. Our results demonstrate that although there is no combination of sequences uniquely associated with a given oligomeric state (or mixture of oligomeric states), there are significant correlations. In the quest to characterize the oligomeric state of MscL, an exciting discovery was made about the dynamic nature of the MscL complex. We found that in detergent solution, under mild heating conditions (37 °C – 60 °C), subunits of MscL can exchange between complexes, and the dynamics of this process are sensitive to the protein sequence.

Extensive efforts were made to produce high diffraction quality crystals of MscL for the determination of a high resolution X-ray crystal structure of a full length channel. The surface entropy reduction strategy was applied to the design of S. aureus MscL variants and while the strategy appears to have improved the crystallizability of S. aureus MscL, unfortunately the diffraction qualities of these crystals were not significantly improved. MscL chimeras were also screened for crystallization in various solubilization detergents, but also failed to yield high quality crystals.

MscL is a fascinating protein and continues to serve as a model system for the study of the structural and functional properties of mechanosensitive channels. Further characterization of the MscL chimera library will offer more insight into the characteristics of the channel. Of particular interest are the functional characterization of the chimeras and the exploration of the physiological relevance of intercomplex subunit exchange.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

The creation of novel enzyme activity is a great challenge to protein engineers, but nature has done so repeatedly throughout the process of natural selection. I begin by outlining the multitude of distinct reactions catalyzed by a single enzyme class, cytochrome P450 monooxygenases. I discuss the ability of cytochrome P450 to generate reactive intermediates capable of diverse reactivity, suggesting this enzyme can also be used to generate novel reactive intermediates in the form of metal-carbenoid and nitrenoid species. I then show that cytochrome P450 from Bacillus megaterium (P450BM3) and its isolated cofactor can catalyze metal-nitrenoid transfer in the form of intramolecular C–H bond amination. Mutations to the protein sequence can enhance the reactivity and selectivity of this transformation significantly beyond that of the free cofactor. Next, I demonstrate an intermolecular nitrene transfer reaction catalyzed by P450BM3 in the form of sulfide imidation. Understanding that sulfur heteroatoms are strong nucleophiles, I show that increasing the sulfide nucleophilicity through substituents on the aryl sulfide ring can dramatically increase reaction productivity. To explore engineering nitrenoid transfer in P450BM3, active site mutagenesis is employed to tune the regioselectivity intramolecular C–H amination catalysts. The solution of the crystal structure of a highly selective variant demonstrates that hydrophobic residues in the active site strongly modulate reactivity and regioselectivity. Finally, I use a similar strategy to develop P450-based catalysts for intermolecular olefin aziridination, demonstrating that active site mutagenesis can greatly enhance this nitrene transfer reaction. The resulting variant can catalyze intermolecular aziridination with more than 1000 total turnovers and enantioselectivity of up to 99% ee.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Recent studies showed that nonhuman primate TRIM5 alpha can efficiently block HIV-1 infection in human cell lines. It can also restrict other retroviruses, therefore, suggested as a general defender against retrovirus infection. Here, we present an evolutionary analysis of TRIM5 alpha in primates. Our results demonstrated that TRIM5a has been evolving rapidly in primates, which is likely caused by Darwinian positive selection. The SPRY domain of TRM5 alpha, which may be responsible for recognition of incoming viral capsids showed higher nonsynonymous/synonymous substitution ratios than the non-SPRY domain, indicating that the adaptive evolution of TRIM5a ill primates might be an innate strategy developed in defending retrovirus infection during primate evolution. In addition, the comparative protein sequence analysis suggested that the amino acid substitution pattern at a single site (344R/Q/P) located in the SPRY domain may explain the differences in Susceptibilities of HIV-1 infection in diverse primate species. (c) 2005 Elsevier B.V. All rights reserved.