3 resultados para Protein structures
em AMS Tesi di Dottorato - Alm@DL - Università di Bologna
Resumo:
The vast majority of known proteins have not yet been experimentally characterized and little is known about their function. The design and implementation of computational tools can provide insight into the function of proteins based on their sequence, their structure, their evolutionary history and their association with other proteins. Knowledge of the three-dimensional (3D) structure of a protein can lead to a deep understanding of its mode of action and interaction, but currently the structures of <1% of sequences have been experimentally solved. For this reason, it became urgent to develop new methods that are able to computationally extract relevant information from protein sequence and structure. The starting point of my work has been the study of the properties of contacts between protein residues, since they constrain protein folding and characterize different protein structures. Prediction of residue contacts in proteins is an interesting problem whose solution may be useful in protein folding recognition and de novo design. The prediction of these contacts requires the study of the protein inter-residue distances related to the specific type of amino acid pair that are encoded in the so-called contact map. An interesting new way of analyzing those structures came out when network studies were introduced, with pivotal papers demonstrating that protein contact networks also exhibit small-world behavior. In order to highlight constraints for the prediction of protein contact maps and for applications in the field of protein structure prediction and/or reconstruction from experimentally determined contact maps, I studied to which extent the characteristic path length and clustering coefficient of the protein contacts network are values that reveal characteristic features of protein contact maps. Provided that residue contacts are known for a protein sequence, the major features of its 3D structure could be deduced by combining this knowledge with correctly predicted motifs of secondary structure. In the second part of my work I focused on a particular protein structural motif, the coiled-coil, known to mediate a variety of fundamental biological interactions. Coiled-coils are found in a variety of structural forms and in a wide range of proteins including, for example, small units such as leucine zippers that drive the dimerization of many transcription factors or more complex structures such as the family of viral proteins responsible for virus-host membrane fusion. The coiled-coil structural motif is estimated to account for 5-10% of the protein sequences in the various genomes. Given their biological importance, in my work I introduced a Hidden Markov Model (HMM) that exploits the evolutionary information derived from multiple sequence alignments, to predict coiled-coil regions and to discriminate coiled-coil sequences. The results indicate that the new HMM outperforms all the existing programs and can be adopted for the coiled-coil prediction and for large-scale genome annotation. Genome annotation is a key issue in modern computational biology, being the starting point towards the understanding of the complex processes involved in biological networks. The rapid growth in the number of protein sequences and structures available poses new fundamental problems that still deserve an interpretation. Nevertheless, these data are at the basis of the design of new strategies for tackling problems such as the prediction of protein structure and function. Experimental determination of the functions of all these proteins would be a hugely time-consuming and costly task and, in most instances, has not been carried out. As an example, currently, approximately only 20% of annotated proteins in the Homo sapiens genome have been experimentally characterized. A commonly adopted procedure for annotating protein sequences relies on the "inheritance through homology" based on the notion that similar sequences share similar functions and structures. This procedure consists in the assignment of sequences to a specific group of functionally related sequences which had been grouped through clustering techniques. The clustering procedure is based on suitable similarity rules, since predicting protein structure and function from sequence largely depends on the value of sequence identity. However, additional levels of complexity are due to multi-domain proteins, to proteins that share common domains but that do not necessarily share the same function, to the finding that different combinations of shared domains can lead to different biological roles. In the last part of this study I developed and validate a system that contributes to sequence annotation by taking advantage of a validated transfer through inheritance procedure of the molecular functions and of the structural templates. After a cross-genome comparison with the BLAST program, clusters were built on the basis of two stringent constraints on sequence identity and coverage of the alignment. The adopted measure explicity answers to the problem of multi-domain proteins annotation and allows a fine grain division of the whole set of proteomes used, that ensures cluster homogeneity in terms of sequence length. A high level of coverage of structure templates on the length of protein sequences within clusters ensures that multi-domain proteins when present can be templates for sequences of similar length. This annotation procedure includes the possibility of reliably transferring statistically validated functions and structures to sequences considering information available in the present data bases of molecular functions and structures.
Resumo:
The study of protein fold is a central problem in life science, leading in the last years to several attempts for improving our knowledge of the protein structures. In this thesis this challenging problem is tackled by means of molecular dynamics, chirality and NMR studies. In the last decades, many algorithms were designed for the protein secondary structure assignment, which reveals the local protein shape adopted by segments of amino acids. In this regard, the use of local chirality for the protein secondary structure assignment was demonstreted, trying to correlate as well the propensity of a given amino acid for a particular secondary structure. The protein fold can be studied also by Nuclear Magnetic Resonance (NMR) investigations, finding the average structure adopted from a protein. In this context, the effect of Residual Dipolar Couplings (RDCs) in the structure refinement was shown, revealing a strong improvement of structure resolution. A wide extent of this thesis is devoted to the study of avian prion protein. Prion protein is the main responsible of a vast class of neurodegenerative diseases, known as Bovine Spongiform Encephalopathy (BSE), present in mammals, but not in avian species and it is caused from the conversion of cellular prion protein to the pathogenic misfolded isoform, accumulating in the brain in form of amiloyd plaques. In particular, the N-terminal region, namely the initial part of the protein, is quite different between mammal and avian species but both of them contain multimeric sequences called Repeats, octameric in mammals and hexameric in avians. However, such repeat regions show differences in the contained amino acids, in particular only avian hexarepeats contain tyrosine residues. The chirality analysis of avian prion protein configurations obtained from molecular dynamics reveals a high stiffness of the avian protein, which tends to preserve its regular secondary structure. This is due to the presence of prolines, histidines and especially tyrosines, which form a hydrogen bond network in the hexarepeat region, only possible in the avian protein, and thus probably hampering the aggregation.
Resumo:
Organotin compounds are worldwide diffused environmental contaminants, mainly as consequence of their extensive past use as biocides in antifouling paints. In spite of law restrictions, due to unwanted effects, organotin still persist in waters, being poorly degraded, easily resuspended from sediments and bioaccumulated in exposed organisms. The widespread toxicity and the possible threat to humans, likely to be organotin-exposed through contaminated seafood, make organotin interactions with biomolecules an intriguing biochemical topic, apart from a matter of ecotoxicological concern. Among organotins, tributyltin (TBT) is long known as the most dangerous and abundant chemical species in the Mediterranean Sea. Due to its amphiphilic nature, provided by three lipophilic arms and an electrophilic tin core, TBT can be easily incorporated in biomembranes and affect their functionality. Accordingly, it is known as a membrane-active toxicant and a mitochondrial poison. Up to now the molecular action modes of TBT are still partially unclear and poorly explored in bivalve mollusks, even if the latter play a not neglectable role in the marine trophic chain and efficiently accumulate organotins. The bivalve mollusk Mytilus galloprovincialis, selected for all experiments, is widely cultivated in the Mediterranean and currently used in ecotoxicological studies. Most work of this thesis was devoted to TBT effects on mussel mitochondria, but other possible targets of TBT were also considered. A great deal of literature points out TBT as endocrine disrupter and the masculinization of female marine gastropods, the so-called imposex, currently signals environmental organotin contamination. The hormonal status of TBT-exposed mussels and the possible interaction between hormones and contaminants in modulating microsomal hydroxilases, involved in steroid hormone and organotin detoxification, were the research topics in the period spent in Barcelona (Marco Polo fellowship). The variegated experimental approach, which consisted of two exposure experiments and in vitro tests, and the choice of selected tissues of M. galloprovincialis, the midgut gland for mitochondrial and microsomal preparations for subsequent laboratory assays and the gonads for the endocrine evaluations, aimed at drawing a clarifying pattern on the molecular mechanisms involved in organotin toxicity. TBT was promptly incorporated in midgut gland mitochondria of adult mussels exposed to 0.5 and 1.0 μg/L TBT, and partially degraded to DBT. TBT incorporation was accompanied by a decrease in the mitochondrial oligomycin-sensitive Mg-ATPase activity, while the coexistent oligomycin-insensitive fraction was unaffected. Mitochondrial fatty acids showed a clear rise in n-3 polyunsaturated fatty acids after 120 hr of TBT exposure, mainly referable to an increase in 22:6 level. TBT was also shown to inhibit the ATP hydrolytic activity of the mitochondrial F1FO complex in vitro and to promote an apparent loss of oligomycin sensitivity at higher than 1.0 μM concentration. The complex dose-dependent profile of the inhibition curve lead to the hypothesis of multiple TBT binding sites. At lower than 1.0 μM TBT concentrations the non competitive enzyme inhibition by TBT was ascribed to the non covalent binding of TBT to FO subunit. On the other hand the observed drop in oligomycin sensitivity at higher than 1.0 μM TBT could be related to the onset of covalent bonds involving thiolic groups on the enzyme structure, apparently reached only at high TBT levels. The mitochondrial respiratory complexes were in vitro affected by TBT, apart from the cytocrome c oxidase which was apparently refractory to the contaminant. The most striking inhibitory effect was shown on complex I, and ascribed to possible covalent bonds of TBT with –SH groups on the enzyme complexes. This mechanism, shouldered by the progressive decrease of free cystein residues in the presence of increasing TBT concentrations, suggests that the onset of covalent tin-sulphur bonds in distinct protein structures may constitute the molecular basis of widespread TBT effects on mitochondrial complexes. Energy production disturbances, in turn affecting energy consuming mechanisms, could be involved in other cellular changes. Mussels exposed to a wide range of TBT concentrations (20 - 200 and 2000 ng/L respectively) did not show any change in testosterone and estrogen levels in mature gonads. Most hormones were in the non-biologically active esterified form both in control and in TBT-treated mussels. Probably the endocrine status of sexually mature mussels could be refractory even to high TBT doses. In mussel digestive gland the high biological variability of microsomal 7-benzyloxy-4-trifluoromethylcoumarin-O-Debenzyloxylase (BFCOD) activity, taken as a measure of CYP3A-like efficiency, probably concealed any enzyme response to TBT exposure. On the other hand the TBT-driven enhancement of BFCOD activity in vitro was once again ascribed to covalent binding to thiol groups which, in this case, would stimulate the enzyme activity. In mussels from Barcelona harbour, a highly contaminated site, the enzyme showed a decreased affinity for the 7-benzyloxy-4-trifluoromethylcoumarin (BCF) substrate with respect to mussel sampled from Ebro Delta, a non-polluted marine site. Contaminant exposure may thus alter the kinetic features of enzymes involved in detoxification mechanisms. Contaminants and steroid hormones were clearly shown to mutually interact in the modulation of detoxification mechanisms. The xenoestrogen 17α-ethylenyl estradiol (EE2) displayed a non-competitive mixed inhibition of CYP3A-like activity by a preferential bond to the free enzyme both in Barcelona harbour and Ebro Delta mussels. The possible interaction with co-present contaminants in Barcelona harbour mussels apparently lessened the formation of the ternary complex enzyme-EE2-BCF. The whole of data confirms TBT as membrane toxicant in mussels as in other species and stresses TBT covalent binding to protein thiols as a widespread mechanism of membrane-bound-enzyme activity modulation by the contaminant.