952 resultados para Protein Structure
Resumo:
The cDNA of Chlamydomonas reinhardtii SE encoding hydrogenase (HydA2) was obtained from the total RNA of C reinhardtii SE by RT-PCR. The DNA of hydrogenase was amplified by PCR from the genomic DNA of C reinhardtii SE. The cDNA and DNA of hydrogenase were sequenced, respectively. The structure of hydrogenase gene was analyzed by biology software. The open reading frame predicts that the hydrogenase is composed of 3584 bp encoding 505 amino acids in length with a predicted M.W. of 53.69 kDa. Ten exons (including 1518 bp) and nine introns (including 2066 bp) have been found in the hydrogenase, and there were two potential N-glycosylate sites, eight protein kinase C phosphorylation site, eight casein kinase H phosphorylation site and one sulphorylation in the sequence. The theory pI was 6.15. Total number of negatively charged residues (Asp + Glu) and positively charged residues (Arg + Lys) were 55 and 61, respectively. (c) 2005 Elsevier Ltd. All rights reserved.
Resumo:
King, R. D. and Wise, P. H. and Clare, A. (2004) Confirmation of Data Mining Based Predictions of Protein Function. Bioinformatics 20(7), 1110-1118
Resumo:
Enot, D. and King, R. D. (2003) Application of Inductive Logic Programming to Structure-Based Drug Design. 7th European Conference on Principles and Practice of Knowledge Discovery in Databases (PKDD '03). Springer LNAI 2838 p156-167
Resumo:
Chungui Lu, Olga A. Koroleva, John F. Farrar, Joe Gallagher, Chris J. Pollock, and A. Deri Tomos (2002). Rubisco small subunit, chlorophyll a/b-binding protein and sucrose : fructan-6-fructosyl transferase gene expression and sugar status in single barley leaf cells in situ. Cell type specificity and induction by light. Plant Physiology, 130 (3) pp.1335-1348 Sponsorship: BBSRC RAE2008
Resumo:
BACKGROUND:In the current climate of high-throughput computational biology, the inference of a protein's function from related measurements, such as protein-protein interaction relations, has become a canonical task. Most existing technologies pursue this task as a classification problem, on a term-by-term basis, for each term in a database, such as the Gene Ontology (GO) database, a popular rigorous vocabulary for biological functions. However, ontology structures are essentially hierarchies, with certain top to bottom annotation rules which protein function predictions should in principle follow. Currently, the most common approach to imposing these hierarchical constraints on network-based classifiers is through the use of transitive closure to predictions.RESULTS:We propose a probabilistic framework to integrate information in relational data, in the form of a protein-protein interaction network, and a hierarchically structured database of terms, in the form of the GO database, for the purpose of protein function prediction. At the heart of our framework is a factorization of local neighborhood information in the protein-protein interaction network across successive ancestral terms in the GO hierarchy. We introduce a classifier within this framework, with computationally efficient implementation, that produces GO-term predictions that naturally obey a hierarchical 'true-path' consistency from root to leaves, without the need for further post-processing.CONCLUSION:A cross-validation study, using data from the yeast Saccharomyces cerevisiae, shows our method offers substantial improvements over both standard 'guilt-by-association' (i.e., Nearest-Neighbor) and more refined Markov random field methods, whether in their original form or when post-processed to artificially impose 'true-path' consistency. Further analysis of the results indicates that these improvements are associated with increased predictive capabilities (i.e., increased positive predictive value), and that this increase is consistent uniformly with GO-term depth. Additional in silico validation on a collection of new annotations recently added to GO confirms the advantages suggested by the cross-validation study. Taken as a whole, our results show that a hierarchical approach to network-based protein function prediction, that exploits the ontological structure of protein annotation databases in a principled manner, can offer substantial advantages over the successive application of 'flat' network-based methods.
Resumo:
A number of lines of evidence suggest that cross-talk exists between the cellular signal transduction pathways involving tyrosine phosphorylation catalyzed by members of the pp60c-src kinase family and those mediated by guanine nucleotide regulatory proteins (G proteins). In this study, we explore the possibility that direct interactions between pp60c-src and G proteins may occur with functional consequences. Preparations of pp60c-src isolated by immunoprecipitation phosphorylate on tyrosine residues the purified G-protein alpha subunits (G alpha) of several heterotrimeric G proteins. Phosphorylation is highly dependent on G-protein conformation, and G alpha(GDP) uncomplexed by beta gamma subunits appears to be the preferred substrate. In functional studies, phosphorylation of stimulatory G alpha (G alpha s) modestly increases the rate of binding of guanosine 5'-[gamma-[35S]thio]triphosphate to Gs as well as the receptor-stimulated steady-state rate of GTP hydrolysis by Gs. Heterotrimeric G proteins may represent a previously unappreciated class of potential substrates for pp60c-src.
Resumo:
Light-dependent deactivation of rhodopsin as well as homologous desensitization of beta-adrenergic receptors involves receptor phosphorylation that is mediated by the highly specific protein kinases rhodopsin kinase (RK) and beta-adrenergic receptor kinase (beta ARK), respectively. We report here the cloning of a complementary DNA for RK. The deduced amino acid sequence shows a high degree of homology to beta ARK. In a phylogenetic tree constructed by comparing the catalytic domains of several protein kinases, RK and beta ARK are located on a branch close to, but separate from the cyclic nucleotide-dependent protein kinase and protein kinase C subfamilies. From the common structural features we conclude that both RK and beta ARK are members of a newly delineated gene family of guanine nucleotide-binding protein (G protein)-coupled receptor kinases that may function in diverse pathways to regulate the function of such receptors.
Resumo:
Proteins are essential components of cells and are crucial for catalyzing reactions, signaling, recognition, motility, recycling, and structural stability. This diversity of function suggests that nature is only scratching the surface of protein functional space. Protein function is determined by structure, which in turn is determined predominantly by amino acid sequence. Protein design aims to explore protein sequence and conformational space to design novel proteins with new or improved function. The vast number of possible protein sequences makes exploring the space a challenging problem.
Computational structure-based protein design (CSPD) allows for the rational design of proteins. Because of the large search space, CSPD methods must balance search accuracy and modeling simplifications. We have developed algorithms that allow for the accurate and efficient search of protein conformational space. Specifically, we focus on algorithms that maintain provability, account for protein flexibility, and use ensemble-based rankings. We present several novel algorithms for incorporating improved flexibility into CSPD with continuous rotamers. We applied these algorithms to two biomedically important design problems. We designed peptide inhibitors of the cystic fibrosis agonist CAL that were able to restore function of the vital cystic fibrosis protein CFTR. We also designed improved HIV antibodies and nanobodies to combat HIV infections.
Resumo:
X-ray crystallography is the predominant method for obtaining atomic-scale information about biological macromolecules. Despite the success of the technique, obtaining well diffracting crystals still critically limits going from protein to structure. In practice, the crystallization process proceeds through knowledge-informed empiricism. Better physico-chemical understanding remains elusive because of the large number of variables involved, hence little guidance is available to systematically identify solution conditions that promote crystallization. To help determine relationships between macromolecular properties and their crystallization propensity, we have trained statistical models on samples for 182 proteins supplied by the Northeast Structural Genomics consortium. Gaussian processes, which capture trends beyond the reach of linear statistical models, distinguish between two main physico-chemical mechanisms driving crystallization. One is characterized by low levels of side chain entropy and has been extensively reported in the literature. The other identifies specific electrostatic interactions not previously described in the crystallization context. Because evidence for two distinct mechanisms can be gleaned both from crystal contacts and from solution conditions leading to successful crystallization, the model offers future avenues for optimizing crystallization screens based on partial structural information. The availability of crystallization data coupled with structural outcomes analyzed through state-of-the-art statistical models may thus guide macromolecular crystallization toward a more rational basis.
Resumo:
Most biological reactions rely on interplay between binding and changes in both macromolecular structure and dynamics. Practical understanding of this interplay requires detection of critical intermediates and determination of their binding and conformational characteristics. However, many of these species are only transiently present and they have often been overlooked in mechanistic studies of reactions that couple binding to conformational change. We monitored the kinetics of ligand-induced conformational changes in a small protein using six different ligands. We analyzed the kinetic data to simultaneously determine both binding affinities for the conformational states and the rate constants of conformational change. The approach we used is sufficiently robust to determine the affinities of three conformational states and detect even modest differences in the protein's affinities for relatively similar ligands. Ligand binding favors higher-affinity conformational states by increasing forward conformational rate constants and/or decreasing reverse conformational rate constants. The amounts by which forward rate constants increase and reverse rate constants decrease are proportional to the ratio of affinities of the conformational states. We also show that both the affinity ratio and another parameter, which quantifies the changes in conformational rate constants upon ligand binding, are strong determinants of the mechanism (conformational selection and/or induced fit) of molecular recognition. Our results highlight the utility of analyzing the kinetics of conformational changes to determine affinities that cannot be determined from equilibrium experiments. Most importantly, they demonstrate an inextricable link between conformational dynamics and the binding affinities of conformational states.
Resumo:
BACKGROUND: Vesiculation is a ubiquitous secretion process of Gram-negative bacteria, where outer membrane vesicles (OMVs) are small spherical particles on the order of 50 to 250 nm composed of outer membrane (OM) and lumenal periplasmic content. Vesicle functions have been elucidated in some detail, showing their importance in virulence factor secretion, bacterial survival, and biofilm formation in pathogenesis. Furthermore, OMVs serve as an envelope stress response, protecting the secreting bacteria from internal protein misfolding stress, as well as external envelope stressors. Despite their important functional roles very little is known about the regulation and mechanism of vesicle production. Based on the envelope architecture and prior characterization of the hypervesiculation phenotypes for mutants lacking the lipoprotein, Lpp, which is involved in the covalent OM-peptidoglycan (PG) crosslinks, it is expected that an inverse relationship exists between OMV production and PG-crosslinked Lpp. RESULTS: In this study, we found that subtle modifications of PG remodeling and crosslinking modulate OMV production, inversely correlating with bound Lpp levels. However, this inverse relationship was not found in strains in which OMV production is driven by an increase in "periplasmic pressure" resulting from the accumulation of protein, PG fragments, or lipopolysaccharide. In addition, the characterization of an nlpA deletion in backgrounds lacking either Lpp- or OmpA-mediated envelope crosslinks demonstrated a novel role for NlpA in envelope architecture. CONCLUSIONS: From this work, we conclude that OMV production can be driven by distinct Lpp concentration-dependent and Lpp concentration-independent pathways.
Resumo:
Hoogsteen (HG) base pairs (bps) provide an alternative pairing geometry to Watson-Crick (WC) bps and can play unique functional roles in duplex DNA. Here, we use structural features unique to HG bps (syn purine base, HG hydrogen bonds and constricted C1'-C1' distance across the bp) to search for HG bps in X-ray structures of DNA duplexes in the Protein Data Bank. The survey identifies 106 A•T and 34 G•C HG bps in DNA duplexes, many of which are undocumented in the literature. It also uncovers HG-like bps with syn purines lacking HG hydrogen bonds or constricted C1'-C1' distances that are analogous to conformations that have been proposed to populate the WC-to-HG transition pathway. The survey reveals HG preferences similar to those observed for transient HG bps in solution by nuclear magnetic resonance, including stronger preferences for A•T versus G•C bps, TA versus GG steps, and also suggests enrichment at terminal ends with a preference for 5'-purine. HG bps induce small local perturbations in neighboring bps and, surprisingly, a small but significant degree of DNA bending (∼14°) directed toward the major groove. The survey provides insights into the preferences and structural consequences of HG bps in duplex DNA.
Resumo:
With increasing recognition of the roles RNA molecules and RNA/protein complexes play in an unexpected variety of biological processes, understanding of RNA structure-function relationships is of high current importance. To make clean biological interpretations from three-dimensional structures, it is imperative to have high-quality, accurate RNA crystal structures available, and the community has thoroughly embraced that goal. However, due to the many degrees of freedom inherent in RNA structure (especially for the backbone), it is a significant challenge to succeed in building accurate experimental models for RNA structures. This chapter describes the tools and techniques our research group and our collaborators have developed over the years to help RNA structural biologists both evaluate and achieve better accuracy. Expert analysis of large, high-resolution, quality-conscious RNA datasets provides the fundamental information that enables automated methods for robust and efficient error diagnosis in validating RNA structures at all resolutions. The even more crucial goal of correcting the diagnosed outliers has steadily developed toward highly effective, computationally based techniques. Automation enables solving complex issues in large RNA structures, but cannot circumvent the need for thoughtful examination of local details, and so we also provide some guidance for interpreting and acting on the results of current structure validation for RNA.
Resumo:
The attachment of a sugar to a hydrophobic polyisoprenyl carrier is the first step for all extracellular glycosylation processes. The enzymes that perform these reactions, polyisoprenyl-glycosyltransferases (PI-GTs) include dolichol phosphate mannose synthase (DPMS), which generates the mannose donor for glycosylation in the endoplasmic reticulum. Here we report the 3.0 Å resolution crystal structure of GtrB, a glucose-specific PI-GT from Synechocystis, showing a tetramer in which each protomer contributes two helices to a membrane-spanning bundle. The active site is 15 Å from the membrane, raising the question of how water-soluble and membrane-embedded substrates are brought into apposition for catalysis. A conserved juxtamembrane domain harbours disease mutations, which compromised activity in GtrB in vitro and in human DPM1 tested in zebrafish. We hypothesize a role of this domain in shielding the polyisoprenyl-phosphate for transport to the active site. Our results reveal the basis of PI-GT function, and provide a potential molecular explanation for DPM1-related disease.
Resumo:
The second round of the community-wide initiative Critical Assessment of automated Structure Determination of Proteins by NMR (CASD-NMR-2013) comprised ten blind target datasets, consisting of unprocessed spectral data, assigned chemical shift lists and unassigned NOESY peak and RDC lists, that were made available in both curated (i.e. manually refined) or un-curated (i.e. automatically generated) form. Ten structure calculation programs, using fully automated protocols only, generated a total of 164 three-dimensional structures (entries) for the ten targets, sometimes using both curated and un-curated lists to generate multiple entries for a single target. The accuracy of the entries could be established by comparing them to the corresponding manually solved structure of each target, which was not available at the time the data were provided. Across the entire data set, 71 % of all entries submitted achieved an accuracy relative to the reference NMR structure better than 1.5 Å. Methods based on NOESY peak lists achieved even better results with up to 100 % of the entries within the 1.5 Å threshold for some programs. However, some methods did not converge for some targets using un-curated NOESY peak lists. Over 90 % of the entries achieved an accuracy better than the more relaxed threshold of 2.5 Å that was used in the previous CASD-NMR-2010 round. Comparisons between entries generated with un-curated versus curated peaks show only marginal improvements for the latter in those cases where both calculations converged.