923 resultados para secondary structure analysis
Resumo:
Membrane proteins are a large and important class of proteins. They are responsible for several of the key functions in a living cell, e.g. transport of nutrients and ions, cell-cell signaling, and cell-cell adhesion. Despite their importance it has not been possible to study their structure and organization in much detail because of the difficulty to obtain 3D structures. In this thesis theoretical studies of membrane protein sequences and structures have been carried out by analyzing existing experimental data. The data comes from several sources including sequence databases, genome sequencing projects, and 3D structures. Prediction of the membrane spanning regions by hydrophobicity analysis is a key technique used in several of the studies. A novel method for this is also presented and compared to other methods. The primary questions addressed in the thesis are: What properties are common to all membrane proteins? What is the overall architecture of a membrane protein? What properties govern the integration into the membrane? How many membrane proteins are there and how are they distributed in different organisms? Several of the findings have now been backed up by experiments. An analysis of the large family of G-protein coupled receptors pinpoints differences in length and amino acid composition of loops between proteins with and without a signal peptide and also differences between extra- and intracellular loops. Known 3D structures of membrane proteins have been studied in terms of hydrophobicity, distribution of secondary structure and amino acid types, position specific residue variability, and differences between loops and membrane spanning regions. An analysis of several fully and partially sequenced genomes from eukaryotes, prokaryotes, and archaea has been carried out. Several differences in the membrane protein content between organisms were found, the most important being the total number of membrane proteins and the distribution of membrane proteins with a given number of transmembrane segments. Of the properties that were found to be similar in all organisms, the most obvious is the bias in the distribution of positive charges between the extra- and intracellular loops. Finally, an analysis of homologues to membrane proteins with known topology uncovered two related, multi-spanning proteins with opposite predicted orientations. The predicted topologies were verified experimentally, providing a first example of "divergent topology evolution".
Resumo:
The study of protein fold is a central problem in life science, leading in the last years to several attempts for improving our knowledge of the protein structures. In this thesis this challenging problem is tackled by means of molecular dynamics, chirality and NMR studies. In the last decades, many algorithms were designed for the protein secondary structure assignment, which reveals the local protein shape adopted by segments of amino acids. In this regard, the use of local chirality for the protein secondary structure assignment was demonstreted, trying to correlate as well the propensity of a given amino acid for a particular secondary structure. The protein fold can be studied also by Nuclear Magnetic Resonance (NMR) investigations, finding the average structure adopted from a protein. In this context, the effect of Residual Dipolar Couplings (RDCs) in the structure refinement was shown, revealing a strong improvement of structure resolution. A wide extent of this thesis is devoted to the study of avian prion protein. Prion protein is the main responsible of a vast class of neurodegenerative diseases, known as Bovine Spongiform Encephalopathy (BSE), present in mammals, but not in avian species and it is caused from the conversion of cellular prion protein to the pathogenic misfolded isoform, accumulating in the brain in form of amiloyd plaques. In particular, the N-terminal region, namely the initial part of the protein, is quite different between mammal and avian species but both of them contain multimeric sequences called Repeats, octameric in mammals and hexameric in avians. However, such repeat regions show differences in the contained amino acids, in particular only avian hexarepeats contain tyrosine residues. The chirality analysis of avian prion protein configurations obtained from molecular dynamics reveals a high stiffness of the avian protein, which tends to preserve its regular secondary structure. This is due to the presence of prolines, histidines and especially tyrosines, which form a hydrogen bond network in the hexarepeat region, only possible in the avian protein, and thus probably hampering the aggregation.
Resumo:
Chemists have long sought to extrapolate the power of biological catalysis and recognition to synthetic systems. These efforts have focused largely on low molecular weight catalysts and receptors; however, biological systems themselves rely almost exclusively on polymers, proteins and RNA, to perform complex chemical functions. Proteins and RNA are unique in their ability to adopt compact, well-ordered conformations, and specific folding provides precise spatial orientation of the functional groups that comprise the “active site”. These features suggest that identification of new polymer backbones with discrete and predictable folding propensities (“foldamers”) will provide a basis for design of molecular machines with unique capabilities. The foldamer approach complements current efforts to design unnatural properties into polypeptides and polynucleotides. The aim of this thesis is the synthesis and conformational studies of new classes of foldamers, using a peptidomimetic approach. Moreover their attitude to be utilized as ionophores, catalysts, and nanobiomaterials were analyzed in solution and in the solid state. This thesis is divided in thematically chapters that are reported below. It begins with a very general introduction (page 4) which is useful, but not strictly necessary, to the expert reader. It is worth mentioning that paragraph I.3 (page 22) is the starting point of this work and paragraph I.5 (page 32) isrequired to better understand the results of chapters 4 and 5. In chapter 1 (page 39) is reported the synthesis and conformational analysis of a novel class of foldamers containing (S)-β3-homophenylglycine [(S)-β3-hPhg] and D- 4-carboxy-oxazolidin-2-one (D-Oxd) residues in alternate order is reported. The experimental conformational analysis performed in solution by IR, 1HNMR, and CD spectroscopy unambiguously proved that these oligomers fold into ordered structures with increasing sequence length. Theoretical calculations employing ab initio MO theory suggest a helix with 11-membered hydrogenbonded rings as the preferred secondary structure type. The novel structures enrich the field of peptidic foldamers and might be useful in the mimicry of native peptides. In chapter 2 cyclo-(L-Ala-D-Oxd)3 and cyclo-(L-Ala-DOxd) 4 were prepared in the liquid phase with good overall yields and were utilized for bivalent ions chelation (Ca2+, Mg2+, Cu2+, Zn2+ and Hg2+); their chelation skill was analyzed with ESI-MS, CD and 1HNMR techniques and the best results were obtained with cyclo-(L-Ala-D-Oxd)3 and Mg2+ or Ca2+. Chapter 3 describes an application of oligopeptides as catalysts for aldol reactions. Paragraph 3.1 concerns the use of prolinamides as catalysts of the cross aldol addition of hydroxyacetone to aromatic aldeydes, whereas paragraphs 3.2 and 3.3 are about the catalyzed aldol addition of acetone to isatins. By means of DFT and AIM calculations, the steric and stereoelectronic effects that control the enantioselectivity in the cross-aldol addition of acetone to isatin catalysed by L-proline have been studied, also in the presence of small quantities of water. In chapter 4 is reported the synthesis and the analysis of a new fiber-like material, obtained from the selfaggregation of the dipeptide Boc-L-Phe-D-Oxd-OBn, which spontaneously forms uniform fibers consisting of parallel infinite linear chains arising from singleintermolecular N-H···O=C hydrogen bonds. This is the absolute borderline case of a parallel β-sheet structure. Longer oligomers of the same series with general formula Boc-(L-Phe-D-Oxd)n-OBn (where n = 2-5), are described in chapter 5. Their properties in solution and in the solid state were analyzed, in correlation with their attitude to form intramolecular hydrogen bond. In chapter 6 is reported the synthesis of imidazolidin-2- one-4-carboxylate and (tetrahydro)-pyrimidin-2-one-5- carboxylate, via an efficient modification of the Hofmann rearrangement. The reaction affords the desired compounds from protected asparagine or glutamine in good to high yield, using PhI(OAc)2 as source of iodine(III).
Resumo:
Helicobacter pylori infection is frequently acquired during childhood. This microorganism is known to cause gastritis, and duodenal ulcer in pediatric patients, however most children remain completely asymptomatic to the infection. Currently there is no consensus in favor of treatment of H. pylori infection in asymptomatic children. The firstline of treatment for this population is triple medication therapy including two antibacterial agents and one proton pump inhibitor for a 2 week duration course. Decreased eradication rate of less than 75% has been documented with the use of this first-line therapy but novel tinidazole-containing quadruple sequential therapies seem worth investigating. None of the previous studies on such therapy has been done in the United States of America. As part of an iron deficiency anemia study in asymptomatic H. pylori infected children of El Paso, Texas, we conducted a secondary data analysis of study data collected in this trial to assess the effectiveness of this tinidazole-containing sequential quadruple therapy compared to placebo on clearing the infection. Subjects were selected from a group of asymptomatic children identified through household visits to 11,365 randomly selected dwelling units. After obtaining parental consent and child assent a total of 1,821 children 3-10 years of age were screened and 235 were positive to a novel urine immunoglobulin class G antibodies test for H. pylori infection and confirmed as infected using a 13C urea breath test, using a hydrolysis urea rate >10 μg/min as cut-off value. Out of those, 119 study subjects had a complete physical exam and baseline blood work and were randomly allocated to four groups, two of which received active H. pylori eradication medication alone or in combination with iron, while the other two received iron only or placebo only. Follow up visits to their houses were done to assess compliance and occurrence of adverse events and at 45+ days post-treatment, a second urea breath test was performed to assess their infection status. The effectiveness was primarily assessed on intent to treat basis (i.e., according to their treatment allocation), and the proportion of those who cleared their infection using a cut-off value >10 μg/min of for urea hydrolysis rate, was the primary outcome. Also we conducted analysis on a per-protocol basis and according to the cytotoxin associated gene A product of the H. pylori infection status. Also we compared the rate of adverse events across the two arms. On intent-to-treat and per-protocol analyses, 44.3% and 52.9%, respectively, of the children receiving the novel quadruple sequential eradication cleared their infection compared to 12.2% and 15.4% in the arms receiving iron or placebo only, respectively. Such differences were statistically significant (p<0.001). The study medications were well accepted and safe. In conclusion, we found in this study population, of mostly asymptomatically H. pylori infected children, living in the US along the border with Mexico, that the quadruple sequential eradication therapy cleared the infection in only half of the children receiving this treatment. Research is needed to assess the antimicrobial susceptibility of the strains of H. pylori infecting this population to formulate more effective therapies. ^
Resumo:
The overall folded (global) structure of mRNA may be critical to translation and turnover control mechanisms, but it has received little experimental attention. Presented here is a comparative analysis of the basic features of the global secondary structure of a synthetic mRNA and the same intracellular eukaryotic mRNA by dimethyl sulfate (DMS) structure probing. Synthetic MFA2 mRNA of Saccharomyces cerevisiae first was examined by using both enzymes and chemical reagents to determine single-stranded and hybridized regions; RNAs with and without a poly(A) tail were compared. A folding pattern was obtained with the aid of the mfold program package that identified the model that best satisfied the probing data. A long-range structural interaction involving the 5′ and 3′ untranslated regions and causing a juxtaposition of the ends of the RNA, was examined further by a useful technique involving oligo(dT)-cellulose chromatography and antisense oligonucleotides. DMS chemical probing of A and C nucleotides of intracellular MFA2 mRNA was then done. The modification data support a very similar intracellular structure. When low reactivity of A and C residues is found in the synthetic RNA, ≈70% of the same sites are relatively more resistant to DMS modification in vivo. A slightly higher sensitivity to DMS is found in vivo for some of the A and C nucleotides predicted to be hybridized from the synthetic structural model. With this small mRNA, the translation process and mRNA-binding proteins do not block DMS modifications, and all A and C nucleotides are modified the same or more strongly than with the synthetic RNA.
Resumo:
I attempt to reconcile apparently conflicting factors and mechanisms that have been proposed to determine the rate constant for two-state folding of small proteins, on the basis of general features of the structures of transition states. Φ-Value analysis implies a transition state for folding that resembles an expanded and distorted native structure, which is built around an extended nucleus. The nucleus is composed predominantly of elements of partly or well-formed native secondary structure that are stabilized by local and long-range tertiary interactions. These long-range interactions give rise to connecting loops, frequently containing the native loops that are poorly structured. I derive an equation that relates differences in the contact order of a protein to changes in the length of linking loops, which, in turn, is directly related to the unfavorable free energy of the loops in the transition state. Kinetic data on loop extension mutants of CI2 and α-spectrin SH3 domain fit the equation qualitatively. The rate of folding depends primarily on the interactions that directly stabilize the nucleus, especially those in native-like secondary structure and those resulting from the entropy loss from the connecting loops, which vary with contact order. This partitioning of energy accounts for the success of some algorithms that predict folding rates, because they use these principles either explicitly or implicitly. The extended nucleus model thus unifies the observations of rate depending on both stability and topology.
Resumo:
αB-crystallin, a member of the small heat shock protein family, possesses chaperone-like function. Recently, it has been shown that a missense mutation in αB-crystallin, R120G, is genetically linked to a desmin-related myopathy as well as to cataracts [Vicart, P., Caron, A., Guicheney, P., Li, A., Prevost, M.-C., Faure, A., Chateau, D., Chapon, F., Tome, F., Dupret, J.-M., et al. (1998) Nat. Genet. 20, 92–95]. By using α-lactalbumin, alcohol dehydrogenase, and insulin as target proteins, in vitro assays indicated that R120G αB-crystallin had reduced or completely lost chaperone-like function. The addition of R120G αB-crystallin to unfolding α-lactalbumin enhanced the kinetics and extent of its aggregation. R120G αB-crystallin became entangled with unfolding α-lactalbumin and was a major portion of the resulting insoluble pellet. Similarly, incubation of R120G αB-crystallin with alcohol dehydrogenase and insulin also resulted in the presence of R120G αB-crystallin in the insoluble pellets. Far and near UV CD indicate that R120G αB-crystallin has decreased β-sheet secondary structure and an altered aromatic residue environment compared with wild-type αB-crystallin. The apparent molecular mass of R120G αB-crystallin, as determined by gel filtration chromatography, is 1.4 MDa, which is more than twice the molecular mass of wild-type αB-crystallin (650 kDa). Images obtained from cryoelectron microscopy indicate that R120G αB-crystallin possesses an irregular quaternary structure with an absence of a clear central cavity. The results of this study show, through biochemical analysis, that an altered structure and defective chaperone-like function of αB-crystallin are associated with a point mutation that leads to a desmin-related myopathy and cataracts.
Resumo:
The prion diseases seem to be caused by a conformational change of the prion protein (PrP) from the benign cellular form PrPC to the infectious scrapie form PrPSc; thus, detailed information about PrP structure may provide essential insights into the mechanism by which these diseases develop. In this study, the secondary structure of the recombinant Syrian hamster PrP of residues 29–231 [PrP(29–231)] is investigated by multidimensional heteronuclear NMR. Chemical shift index analysis and nuclear Overhauser effect data show that PrP(29–231) contains three helices and possibly one short β-strand. Most striking is the random-coil nature of chemical shifts for residues 30–124 in the full-length PrP. Although the secondary structure elements are similar to those found in mouse PrP fragment PrP(121–231), the secondary structure boundaries of PrP(29–231) are different from those in mouse PrP(121–231) but similar to those found in the structure of Syrian hamster PrP(90–231). Comparison of resonance assignments of PrP(29–231) and PrP(90–231) indicates that there may be transient interactions between the additional residues and the structured core. Backbone dynamics studies done by using the heteronuclear [1H]-15N nuclear Overhauser effect indicate that almost half of PrP(29–231), residues 29–124, is highly flexible. This plastic region could feature in the conversion of PrPC to PrPSc by template-assisted formation of β-structure.
Resumo:
A detailed computational analysis of 32 protein–RNA complexes is presented. A number of physical and chemical properties of the intermolecular interfaces are calculated and compared with those observed in protein–double-stranded DNA and protein–single-stranded DNA complexes. The interface properties of the protein–RNA complexes reveal the diverse nature of the binding sites. van der Waals contacts played a more prevalent role than hydrogen bond contacts, and preferential binding to guanine and uracil was observed. The positively charged residue, arginine, and the single aromatic residues, phenylalanine and tyrosine, all played key roles in the RNA binding sites. A comparison between protein–RNA and protein–DNA complexes showed that whilst base and backbone contacts (both hydrogen bonding and van der Waals) were observed with equal frequency in the protein–RNA complexes, backbone contacts were more dominant in the protein–DNA complexes. Although similar modes of secondary structure interactions have been observed in RNA and DNA binding proteins, the current analysis emphasises the differences that exist between the two types of nucleic acid binding protein at the atomic contact level.
Resumo:
PCR amplification of template DNAs extracted from mixed, naturally occurring microbial populations, using oligonucleotide primers complementary to highly conserved sequences, was used to obtain a large collection of diverse RNase P RNA-encoding genes. An alignment of these sequences was used in a comparative analysis of RNase P RNA secondary and tertiary structure. The new sequences confirm the secondary structure model based on sequences from cultivated organisms (with minor alterations in helices P12 and P18), providing additional support for nearly every base pair. Analysis of sequence covariation using the entire RNase P RNA data set reveals elements of tertiary structure in the RNA; the third nucleotides (underlined) of the GNRA tetraloops L14 and L18 are seen to interact with adjacent Watson-Crick base pairs in helix P8, forming A:G/C or G:A/U base triples. These experiments demonstrate one way in which the enormous diversity of natural microbial populations can be used to elucidate molecular structure through comparative analysis.
Resumo:
The PotE protein is a putrescine-ornithine antiporter found in many gram-negative bacteria. It is a member of the APA family of transporters and has 12 predicted alpha-helical transmembrane spanning segments (TMS). While the substrate binding site has previously been mapped to a region near the surface of the cytoplasmic lipid layer, no structural feature within the periplasmic domains of PotE have been shown to be important for function. We examined the role of the only large outer loop, situated between transmembrane spanning segment 7 and 8, in putrescine uptake. Deletion of the highly conserved amino acids in the region closest to transmembrane spanning segment 7 produced a protein with little activity. Glycine-scanning mutagenesis of this region showed that Val(249) and Leu(254) were required for optimal transporter function. The V249G mutant transported putrescine at a lower maximal rate compared to wild-type (WT) but with the same substrate binding affinity. In contrast, the L254G mutant had a higher substrate affinity. A series of Val(249) mutants indicated that the hydrophobicity of this residue, which is located at or near the membrane surface, is important for PotE function. Secondary structure predictions of the large outer loop indicated the presence of a hydrophobic alpha-helix in the centre with a hydrophobic region at each end suggesting that the loop was not entirely exposed to the aqueous periplasmic space. The study shows that loop 7-8 is important for PotE function, possibly by forming a re-entrant loop in the channel of the transporter. (C) 2003 Elsevier Ltd. All rights reserved.
Resumo:
Background: Protein tertiary structure can be partly characterized via each amino acid's contact number measuring how residues are spatially arranged. The contact number of a residue in a folded protein is a measure of its exposure to the local environment, and is defined as the number of C-beta atoms in other residues within a sphere around the C-beta atom of the residue of interest. Contact number is partly conserved between protein folds and thus is useful for protein fold and structure prediction. In turn, each residue's contact number can be partially predicted from primary amino acid sequence, assisting tertiary fold analysis from sequence data. In this study, we provide a more accurate contact number prediction method from protein primary sequence. Results: We predict contact number from protein sequence using a novel support vector regression algorithm. Using protein local sequences with multiple sequence alignments (PSI-BLAST profiles), we demonstrate a correlation coefficient between predicted and observed contact numbers of 0.70, which outperforms previously achieved accuracies. Including additional information about sequence weight and amino acid composition further improves prediction accuracies significantly with the correlation coefficient reaching 0.73. If residues are classified as being either contacted or non-contacted, the prediction accuracies are all greater than 77%, regardless of the choice of classification thresholds. Conclusion: The successful application of support vector regression to the prediction of protein contact number reported here, together with previous applications of this approach to the prediction of protein accessible surface area and B-factor profile, suggests that a support vector regression approach may be very useful for determining the structure-function relation between primary sequence and higher order consecutive protein structural and functional properties.
Resumo:
Selection of machine learning techniques requires a certain sensitivity to the requirements of the problem. In particular, the problem can be made more tractable by deliberately using algorithms that are biased toward solutions of the requisite kind. In this paper, we argue that recurrent neural networks have a natural bias toward a problem domain of which biological sequence analysis tasks are a subset. We use experiments with synthetic data to illustrate this bias. We then demonstrate that this bias can be exploitable using a data set of protein sequences containing several classes of subcellular localization targeting peptides. The results show that, compared with feed forward, recurrent neural networks will generally perform better on sequence analysis tasks. Furthermore, as the patterns within the sequence become more ambiguous, the choice of specific recurrent architecture becomes more critical.
Resumo:
The caseins (alpha(s1), alpha(s2), beta, and kappa) are phosphoproteins present in bovine milk that have been studied for over a century and whose structures remain obscure. Here we describe the chemical synthesis and structure elucidation of the N-terminal segment (1-44) of bovine K-casein, the protein which maintains the micellar structure of the caseins. K-Casein (1-44) was synthesised by highly optimised Boc solid-phase peptide chemistry and characterised by mass spectrometry. Structure elucidation was carried out by circular dichroism and nuclear magnetic resonance spectroscopy. CD analysis demonstrated that the segment was ill defined in aqueous medium but in 30% trifluoroethanol it exhibited considerable helical structure. Further, NMR analysis showed the presence of a helical segment containing 26 residues which extends from Pro(8) to Arg(34). This is the first report which demonstrates extensive secondary structure within the casein class of proteins. (c) 2006 Elsevier Inc. All rights reserved.
Resumo:
Conotoxins are small conformationally constrained peptides found in the venom of marine snails of the genus Conus. They are usually cysteine rich and frequently contain a high degree of post-translational modifications such as C-terminal amidation, hydroxylation, carboxylation, bromination, epimerisation and glycosylation. Here we review the role of NMR in determining the three-dimensional structures of conotoxins and also provide a compilation and analysis of H-1 and C-13 chemical shifts of post-translationally modified amino acids and compare them with data from common amino acids. This analysis provides a reference source for chemical shifts of post-translationally modified amino acids. Copyright (C) 2006 John Wiley & Sons, Ltd.