945 resultados para Protein structures
Resumo:
How large is the volume of sequence space that is compatible with a given protein structure? Starting from random sequences, low free energy sequences were generated for 108 protein backbone structures by using a Monte Carlo optimization procedure and a free energy function based primarily on Lennard–Jones packing interactions and the Lazaridis–Karplus implicit solvation model. Remarkably, in the designed sequences 51% of the core residues and 27% of all residues were identical to the amino acids in the corresponding positions in the native sequences. The lowest free energy sequences obtained for ensembles of native-like backbone structures were also similar to the native sequence. Furthermore, both the individual residue frequencies and the covariances between pairs of positions observed in the very large SH3 domain family were recapitulated in core sequences designed for SH3 domain structures. Taken together, these results suggest that the volume of sequence space optimal for a protein structure is surprisingly restricted to a region around the native sequence.
Resumo:
The question of whether proteins originate from random sequences of amino acids is addressed. A statistical analysis is performed in terms of blocked and random walk values formed by binary hydrophobic assignments of the amino acids along the protein chains. Theoretical expectations of these variables from random distributions of hydrophobicities are compared with those obtained from functional proteins. The results, which are based upon proteins in the SWISS-PROT data base, convincingly show that the amino acid sequences in proteins differ from what is expected from random sequences in a statistically significant way. By performing Fourier transforms on the random walks, one obtains additional evidence for nonrandomness of the distributions. We have also analyzed results from a synthetic model containing only two amino acid types, hydrophobic and hydrophilic. With reasonable criteria on good folding properties in terms of thermodynamical and kinetic behavior, sequences that fold well are isolated. Performing the same statistical analysis on the sequences that fold well indicates similar deviations from randomness as for the functional proteins. The deviations from randomness can be interpreted as originating from anticorrelations in terms of an Ising spin model for the hydrophobicities. Our results, which differ from some previous investigations using other methods, might have impact on how permissive with respect to sequence specificity protein folding process is-only sequences with nonrandom hydrophobicity distributions fold well. Other distributions give rise to energy landscapes with poor folding properties and hence did not survive the evolution.
Resumo:
Recently, a large family of transducer proteins in the Archaeon Halobacterium salinarium was identified. On the basis of the comparison of the predicted structural domains of these transducers, three distinct subfamilies of transducers were proposed. Here we report isolation, complete gene sequences, and analysis of the encoded primary structures of transducer gene htrII, a member of family B, and its blue light receptor gene (sopII) of sensory rhodopsin II (SRII). The start codon ATG of the 714-bp sopII gene is one nucleotide beyond the termination codon TGA of the 2298-bp htrII gene. The deduced protein sequence of HtrII predicts a eubacterial chemotaxis transducer type with two hydrophobic membrane-spanning segments connecting sizable domains in the periplasm and cytoplasm. HtrII has a common feature with HtrI, the sensory rhodopsin I transducer; like HtrI, HtrII possesses a hydrophilic loop structure just after the second transmembrane segment. The C-terminal 299 residues (765 amino acid residues total) of HtrII show strong homology to the signaling and methylation domain of eubacterial transducer Tsr. The hydropathy plot of the primary structure of SRII indicates seven membrane-spanning alpha-helical segments, a characteristic feature of retinylidene proteins ("rhodopsins") from a widespread family of photoactive pigments. SRII shows high identity with SRI (42%), bacteriorhodopsin (BR) (32%), and halorhodopsin (24%). The crucial positions for retinal binding sites in these proteins are nearly identical, with the exception of Met-118 (numbering according to the mature BR sequence), which is replaced by Val in SRII. In BR, residues Asp-85 and Asp-96 are crucial in proton pumping. In SRII, the position corresponding to Asp-85 in BR is conserved, but the corresponding position of Asp-96 is replaced by an aromatic Tyr. Coexpression of the htrII and sopII genes restores SRII phototaxis to a mutant (Pho81) that contains a deletion in the htrI/sopI and insertion in htrII/sopII regions. This paper describes the first example that both HtrI and HtrII exist in the same halobacterial cell, confirming that different sensory rhodopsins SRI and SRII in the same organism have their own distinct transducers.
Resumo:
The structure of a multisubunit protein (immunoglobulin light chain) was solved in three crystal forms, differing only in the solvent of crystallization. The three structures were obtained at high ionic strength and low pH, high ionic strength and high pH, and low ionic strength and neutral pH. The three resulting "snapshots" of possible structures show that their variable-domain interactions differ, reflecting their stabilities under specific solvent conditions. In the three crystal forms, the variable domains had different rotational and translational relationships, whereas no alteration of the constant domains was found. The critical residues involved in the observed effect of the solvent are tryptophans and histidines located between the two variable domains in the dimeric structure. Tryptophan residues are commonly found in interfaces between proteins and their subunits, and histidines have been implicated in pH-dependent conformation changes. The quaternary structure observed for a multisubunit protein or protein complex in a crystal may be influenced by the interactions of the constituents within the molecule or complex and/or by crystal packing interactions. The comparison of buried surface areas and hydrogen bonds between the domains forming the molecule and between the molecules forming the crystals suggest that, for this system, the interactions within the molecule are most likely the determining factors.
Resumo:
The human immunodeficiency virus type 1 (HIV-1) matrix protein forms a structural shell associated with the inner viral membrane and performs other essential functions throughout the viral life cycle. The crystal structure of the HIV-1 matrix protein, determined at 2.3 angstrom resolution, reveals that individual matrix molecules are composed of five major helices capped by a three-stranded mixed beta-sheet. Unexpectedly, the protein assembles into a trimer in three different crystal lattices, burying 1880 angstrom2 of accessible surface area at the trimer interfaces. Trimerization appears to create a large, bipartite membrane binding surface in which exposed basic residues could cooperate with the N-terminal myristoyl groups to anchor the protein on the acidic inner membrane of the virus.
Resumo:
Engineering site-specific amino acid substitutions into the protein-tyrosine phosphatase (PTPase) PTP1 and the dual-specific vaccinia H1-related phosphatase (VHR), has kinetically isolated the two chemical steps of the reaction and provided a rare opportunity for examining transition states and directly observing the phosphoenzyme intermediate. Changing serine to alanine in the active-site sequence motif HCXXGXXRS shifted the rate-limiting step from intermediate formation to intermediate hydrolysis. Using phosphorus 31P NMR, the covalent thiol-phosphate intermediate was directly observed during catalytic turnover. The importance of the conserved aspartic acid (D92 in VHR and D181 in PTP1) in both chemical steps was established. Kinetic analysis of D92N and D181N mutants indicated that aspartic acid acts as a general acid by protonating the leaving-group phenolic oxygen. Structure-reactivity experiments with native and aspartate mutant enzymes established that proton transfer is concomitant with P-O cleavage, such that no charge develops on the phenolic oxygen. Steady- and presteady-state kinetics, as well as NMR analysis of the double mutant D92N/S131A (VHR), suggested that the conserved aspartic acid functions as a general base during intermediate hydrolysis. As a general base, aspartate would activate a water molecule to facilitate nucleophilic attack. The amino acids involved in transition-state stabilization for cysteinylphosphate hydrolysis were confirmed by the x-ray structure of the Yersinia PTPase complexed with vanadate, a transition-state mimic that binds covalently to the active-site cysteine. Consistent with the NMR, x-ray, biochemical, and kinetic data, a unifying mechanism for catalysis is proposed.
Resumo:
Patients with the M4Eo subtype of acute myeloid leukemia almost invariably are found to have an inversion of chromosome 16 in their leukemic cells, which results in a gene fusion between the transcription factor called core binding factor beta (CBFbeta) on 16q and a smooth muscle myosin heavy chain (SMMHC) gene on 16p. Subcellular localizations of the wild-type CBFbeta and the CBFbeta-SMMHC fusion protein were determined by immunofluorescence of NIH 3T3 cells that overexpress wild-type or fusion protein. Normal CBFbeta showed an unexpected perinuclear pattern consistent with primary localization in the Golgi complex. The CBFbeta-SMMHC fusion protein had a very different pattern. Nuclear staining included rod-like crystalline structures as long as 11 microm. The heterodimeric partner of CBFbeta, CBFalpha, formed part of this complex. Cytoplasmic staining included stress fibers that colocalized with actin, probably as a consequence of the myosin heavy chain component of the fusion protein. Deletion of different regions of the CBFbeta portion of the fusion protein showed that binding to CBFalpha was not required for nuclear translocation. However, deletion of parts of the SMMHC domain of the fusion protein involved in myosin-mediated filament formation resulted in proteins that did not form rod-like structures. These observations confirm previous indirect evidence that the CBFbeta-SMMHC fusion protein is capable of forming macromolecular nuclear aggregates and suggests possible models for the mechanism of leukemic transformation.
Resumo:
Background: The structure of proteins may change as a result of the inherent flexibility of some protein regions. We develop and explore probabilistic machine learning methods for predicting a continuum secondary structure, i.e. assigning probabilities to the conformational states of a residue. We train our methods using data derived from high-quality NMR models. Results: Several probabilistic models not only successfully estimate the continuum secondary structure, but also provide a categorical output on par with models directly trained on categorical data. Importantly, models trained on the continuum secondary structure are also better than their categorical counterparts at identifying the conformational state for structurally ambivalent residues. Conclusion: Cascaded probabilistic neural networks trained on the continuum secondary structure exhibit better accuracy in structurally ambivalent regions of proteins, while sustaining an overall classification accuracy on par with standard, categorical prediction methods.
Resumo:
Infection of plant cells by potyviruses induces the formation of cytoplasmic inclusions ranging in size from 200 to 1000 nm. To determine if the ability to form these ordered, insoluble structures is intrinsic to the potyviral cytoplasmic inclusion protein, we have expressed the cytoplasmic inclusion protein from Potato virus Y in tobacco under the control of the chrysanthemum ribulose-1,5-bisphosphate carboxylase small subunit promoter, a highly active, green tissue promoter. No cytoplasmic inclusions were observed in the leaves of transgenic tobacco using transmission electron microscopy, despite being able to clearly visualize these inclusions in Potato virus Y infected tobacco leaves under the same conditions. However, we did observe a wide range of tissue and sub-cellular abnormalities associated with the expression of the Potato virus Y cytoplasmic inclusion protein. These changes included the disruption of normal cell morphology and organization in leaves, mitochondrial and chloroplast internal reorganization, and the formation of atypical lipid accumulations. Despite these significant structural changes, however, transgenic tobacco plants were viable and the results are discussed in the context of potyviral cytoplasmic inclusion protein function.
Resumo:
Genomic and proteomic analyses have attracted a great deal of interests in biological research in recent years. Many methods have been applied to discover useful information contained in the enormous databases of genomic sequences and amino acid sequences. The results of these investigations inspire further research in biological fields in return. These biological sequences, which may be considered as multiscale sequences, have some specific features which need further efforts to characterise using more refined methods. This project aims to study some of these biological challenges with multiscale analysis methods and stochastic modelling approach. The first part of the thesis aims to cluster some unknown proteins, and classify their families as well as their structural classes. A development in proteomic analysis is concerned with the determination of protein functions. The first step in this development is to classify proteins and predict their families. This motives us to study some unknown proteins from specific families, and to cluster them into families and structural classes. We select a large number of proteins from the same families or superfamilies, and link them to simulate some unknown large proteins from these families. We use multifractal analysis and the wavelet method to capture the characteristics of these linked proteins. The simulation results show that the method is valid for the classification of large proteins. The second part of the thesis aims to explore the relationship of proteins based on a layered comparison with their components. Many methods are based on homology of proteins because the resemblance at the protein sequence level normally indicates the similarity of functions and structures. However, some proteins may have similar functions with low sequential identity. We consider protein sequences at detail level to investigate the problem of comparison of proteins. The comparison is based on the empirical mode decomposition (EMD), and protein sequences are detected with the intrinsic mode functions. A measure of similarity is introduced with a new cross-correlation formula. The similarity results show that the EMD is useful for detection of functional relationships of proteins. The third part of the thesis aims to investigate the transcriptional regulatory network of yeast cell cycle via stochastic differential equations. As the investigation of genome-wide gene expressions has become a focus in genomic analysis, researchers have tried to understand the mechanisms of the yeast genome for many years. How cells control gene expressions still needs further investigation. We use a stochastic differential equation to model the expression profile of a target gene. We modify the model with a Gaussian membership function. For each target gene, a transcriptional rate is obtained, and the estimated transcriptional rate is also calculated with the information from five possible transcriptional regulators. Some regulators of these target genes are verified with the related references. With these results, we construct a transcriptional regulatory network for the genes from the yeast Saccharomyces cerevisiae. The construction of transcriptional regulatory network is useful for detecting more mechanisms of the yeast cell cycle.
Resumo:
Using six kinds of lattice types (4×4 ,5×5 , and6×6 square lattices;3×3×3 cubic lattice; and2+3+4+3+2 and4+5+6+5+4 triangular lattices), three different size alphabets (HP ,HNUP , and 20 letters), and two energy functions, the designability of proteinstructures is calculated based on random samplings of structures and common biased sampling (CBS) of proteinsequence space. Then three quantities stability (average energy gap),foldability, and partnum of the structure, which are defined to elucidate the designability, are calculated. The authors find that whatever the type of lattice, alphabet size, and energy function used, there will be an emergence of highly designable (preferred) structure. For all cases considered, the local interactions reduce degeneracy and make the designability higher. The designability is sensitive to the lattice type, alphabet size, energy function, and sampling method of the sequence space. Compared with the random sampling method, both the CBS and the Metropolis Monte Carlo sampling methods make the designability higher. The correlation coefficients between the designability, stability, and foldability are mostly larger than 0.5, which demonstrate that they have strong correlation relationship. But the correlation relationship between the designability and the partnum is not so strong because the partnum is independent of the energy. The results are useful in practical use of the designability principle, such as to predict the proteintertiary structure.
Resumo:
Epidermal growth factor (EGF) activation of the EGF receptor (EGFR) is an important mediator of cell migration, and aberrant signaling via this system promotes a number of malignancies including ovarian cancer. We have identified the cell surface glycoprotein CDCP1 as a key regulator of EGF/EGFR-induced cell migration. We show that signaling via EGF/EGFR induces migration of ovarian cancer Caov3 and OVCA420 cells with concomitant up-regulation of CDCP1 mRNA and protein. Consistent with a role in cell migration CDCP1 relocates from cell-cell junctions to punctate structures on filopodia after activation of EGFR. Significantly, disruption of CDCP1 either by silencing or the use of a function blocking antibody efficiently reduces EGF/EGFR-induced cell migration of Caov3 and OVCA420 cells. We also show that up-regulation of CDCP1 is inhibited by pharmacological agents blocking ERK but not Src signaling, indicating that the RAS/RAF/MEK/ERK pathway is required downstream of EGF/EGFR to induce increased expression of CDCP1. Our immunohistochemical analysis of benign, primary, and metastatic serous epithelial ovarian tumors demonstrates that CDCP1 is expressed during progression of this cancer. These data highlight a novel role for CDCP1 in EGF/EGFR-induced cell migration and indicate that targeting of CDCP1 may be a rational approach to inhibit progression of cancers driven by EGFR signaling including those resistant to anti-EGFR drugs because of activating mutations in the RAS/RAF/MEK/ERK pathway.
Resumo:
Ubiquitination involves the attachment of ubiquitin (Ub) to lysine residues on substrate proteins or itself, which can result in protein monoubiquitination or polyubiquitination. Polyubiquitination through different lysines (seven) or the N-terminus of Ub can generate different protein-Ub structures. These include monoubiquitinated proteins, polyubiqutinated proteins with homotypic chains through a particular lysine on Ub or mixed polyubiquitin chains generated by polymerization through different Ub lysines. The ability of the ubiquitination pathway to generate different protein-Ub structures provides versatility of this pathway to target proteins to different fates. Protein ubiquitination is catalyzed by Ub-conjugating and Ub-ligase enzymes, with different combinations of these enzymes specifying the type of Ub modification on protein substrates. How Ub-conjugating and Ub-ligase enzymes generate this structural diversity is not clearly understood. In the current review, we discuss mechanisms utilized by the Ub-conjugating and Ub-ligase enzymes to generate structural diversity during protein ubiquitination, with a focus on recent mechanistic insights into protein monoubiquitination and polyubiquitination.
Resumo:
Transport between compartments of eukaryotic cells is mediated by coated vesicles. The archetypal protein coats COPI, COPII, and clathrin are conserved from yeast to human. Structural studies of COPII and clathrin coats assembled in vitro without membranes suggest that coat components assemble regular cages with the same set of interactions between components. Detailed three-dimensional structures of coated membrane vesicles have not been obtained. Here, we solved the structures of individual COPI-coated membrane vesicles by cryoelectron tomography and subtomogram averaging of in vitro reconstituted budding reactions. The coat protein complex, coatomer, was observed to adopt alternative conformations to change the number of other coatomers with which it interacts and to form vesicles with variable sizes and shapes. This represents a fundamentally different basis for vesicle coat assembly.