979 resultados para protein evolution
Resumo:
Protein engineering is a powerful tool, which correlates protein structure with specific functions, both in applied biotechnology and in basic research. Here, we present a practical teaching course for engineering the green fluorescent protein (GFP) from Aequorea victoria by a random mutagenesis strategy using error-prone polymerase chain reaction. Screening of bacterial colonies transformed with random mutant libraries identified GFP variants with increased fluorescence yields. Mapping the three-dimensional structure of these mutants demonstrated how alterations in structural features such as the environment around the fluorophore and properties of the protein surface can influence functional properties such as the intensity of fluorescence and protein solubility.
Resumo:
L’évolution des protéines est un domaine important de la recherche en bioinformatique et catalyse l'intérêt de trouver des outils d'alignement qui peuvent être utilisés de manière fiable et modéliser avec précision l'évolution d'une famille de protéines. TM-Align (Zhang and Skolnick, 2005) est considéré comme l'outil idéal pour une telle tâche, en termes de rapidité et de précision. Par conséquent, dans cette étude, TM-Align a été utilisé comme point de référence pour faciliter la détection des autres outils d'alignement qui sont en mesure de préciser l'évolution des protéines. En parallèle, nous avons élargi l'actuel outil d'exploration de structures secondaires de protéines, Helix Explorer (Marrakchi, 2006), afin qu'il puisse également être utilisé comme un outil pour la modélisation de l'évolution des protéines.
Resumo:
A lattice model is used to study mutations and compacting effects on protein folding rates and folding temperature. In the context of protein evolution, we address the question regarding the best scenario for a polypeptide chain to fold: either a fast nonspecific collapse followed by a slow rearrangement to form the native structure or a specific collapse from the unfolded state with the simultaneous formation of the native state. This question is investigated for optimized sequences, whose native state has no frustrated contacts between monomers, and also for mutated sequences, whose native state has some degree of frustration. It is found that the best scenario for folding may depend on the amount of frustration of the native structure. The implication of this result on protein evolution is discussed. (c) 2006 American Institute of Physics.
Resumo:
We introduce a computational method to optimize the in vitro evolution of proteins. Simulating evolution with a simple model that statistically describes the fitness landscape, we find that beneficial mutations tend to occur at amino acid positions that are tolerant to substitutions, in the limit of small libraries and low mutation rates. We transform this observation into a design strategy by applying mean-field theory to a structure-based computational model to calculate each residue's structural tolerance. Thermostabilizing and activity-increasing mutations accumulated during the experimental directed evolution of subtilisin E and T4 lysozyme are strongly directed to sites identified by using this computational approach. This method can be used to predict positions where mutations are likely to lead to improvement of specific protein properties.
Resumo:
The prion protein displays a unique structural ambiguity in that it can adopt multiple stable conformations under physiological conditions. In our view, this puzzling feature resulted from a sudden environmental change in evolution when the prion, previously an integral membrane protein, got expelled into the extracellular space. Analysis of known vertebrate prions unveils a primordial transmembrane protein encrypted in their sequence, underlying this relocalization hypothesis. Apparently, the time elapsed since this event was insufficient to create a “minimally frustrated” sequence in the new milieu, probably due to the functional constraints set by the importance of the very flexibility that was created in the relocalization. This scenario may explain why, in a structural sense, the prion protein is still en route toward becoming a foldable globular protein.
Resumo:
We extend an earlier model of protein evolution on a rugged landscape to the case in which the landscape exhibits a variable degree of correlation (i.e., smoothness). Correlation is introduced by assuming that a protein is composed of a set of independent blocks or domains and that mutation in one block affects the contribution of that block alone to the overall fitness of the protein. We study the statistical structure of such landscapes and apply our theory to the evolution by somatic hypermutation of antibody molecules composed of framework and complementarity-determining regions. We predict the expected number of replacement mutations in each region.
Resumo:
The evolution of protein function appears to involve alternating periods of conservative evolution and of relatively rapid change. Evidence for such episodic evolution, consistent with some theoretical expectations, comes from the application of increasingly sophisticated models of evolution to large sequence datasets. We present here some of the recent methods to detect functional shifts, using amino acid or codon models. Both provide evidence for punctual shifts in patterns of amino acid conservation, including the fixation of key changes by positive selection. Although a link to gene duplication, a presumed source of functional changes, has been difficult to establish, this episodic model appears to apply to a wide variety of proteins and organisms.
Resumo:
Divergence of protein sequences and gene expression patterns are two fundamental mechanisms that generate organismal diversity. Here, we have used genome and transcriptome data from eight mammals and one bird to study the positive correlation of these two processes throughout mammalian evolution. We demonstrate that the correlation is stable over time and most pronounced in neural tissues, which indicates that it is the result of strong negative selection. The correlation is not driven by genes with specific functions and may instead best be viewed as an evolutionary default state, which can nevertheless be evaded by certain gene types. In particular, genes with developmental and neural functions are skewed toward changes in gene expression, consistent with selection against pleiotropic effects associated with changes in protein sequences. Surprisingly, we find that the correlation between expression divergence and protein divergence is not explained by between-gene variation in expression level, tissue specificity, protein connectivity, or other investigated gene characteristics, suggesting that it arises independently of these gene traits. The selective constraints on protein sequences and gene expression patterns also fluctuate in a coordinate manner across phylogenetic branches: We find that gene-specific changes in the rate of protein evolution in a specific mammalian lineage tend to be accompanied by similar changes in the rate of expression evolution. Taken together, our findings highlight many new aspects of the correlation between protein divergence and expression divergence, and attest to its role as a fundamental property of mammalian genome evolution.
Resumo:
Dissertation presented to obtain the PhD degree in Computational Biology.
Resumo:
It is often supposed that a protein's rate of evolution and its amino acid content are determined by the function and anatomy of the protein. Here we examine an alternative possibility, namely that the requirement to specify in the unprocessed RNA, in the vicinity of intron-exon boundaries, information necessary for removal of introns (e.g., exonic splice enhancers) affects both amino acid usage and rates of protein evolution. We find that the majority of amino acids show skewed usage near intron-exon boundaries, and that differences in the trends for the 2-fold and 4-fold blocks of both arginine and leucine show this to be owing to effects mediated at the nucleotide level. More specifically, there is a robust relationship between the extent to which an amino acid is preferred/avoided near boundaries and its enrichment/paucity in splice enhancers. As might then be expected, the rate of evolution is lowest near intron-exon boundaries, at least in part owing to splice enhancers, such that domains flanking intron-exon junctions evolve on average at under half the rate of exon centres from the same gene. In contrast, the rate of evolution of intronless retrogenes is highest near the domains where intron-exon junctions previously resided. The proportion of sequence near intron-exon boundaries is one of the stronger predictors of a protein's rate of evolution in mammals yet described. We conclude that after intron insertion selection favours modification of amino acid content near intron-exon junctions, so as to enable efficient intron removal, these changes then being subject to strong purifying selection even if nonoptimal for protein function. Thus there exists a strong force operating on protein evolution in mammals that is not explained directly in terms of the biology of the protein.
Resumo:
BACKGROUND: The nuclear receptors are a large family of eukaryotic transcription factors that constitute major pharmacological targets. They exert their combinatorial control through homotypic heterodimerisation. Elucidation of this dimerisation network is vital in order to understand the complex dynamics and potential cross-talk involved. RESULTS: Phylogeny, protein-protein interactions, protein-DNA interactions and gene expression data have been integrated to provide a comprehensive and up-to-date description of the topology and properties of the nuclear receptor interaction network in humans. We discriminate between DNA-binding and non-DNA-binding dimers, and provide a comprehensive interaction map, that identifies potential cross-talk between the various pathways of nuclear receptors. CONCLUSION: We infer that the topology of this network is hub-based, and much more connected than previously thought. The hub-based topology of the network and the wide tissue expression pattern of NRs create a highly competitive environment for the common heterodimerising partners. Furthermore, a significant number of negative feedback loops is present, with the hub protein SHP [NR0B2] playing a major role. We also compare the evolution, topology and properties of the nuclear receptor network with the hub-based dimerisation network of the bHLH transcription factors in order to identify both unique themes and ubiquitous properties in gene regulation. In terms of methodology, we conclude that such a comprehensive picture can only be assembled by semi-automated text-mining, manual curation and integration of data from various sources.
Resumo:
Phenotypic plasticity allows organisms to produce alternative phenotypes under different conditions and represents one of the most important ways by which organisms adaptively respond to the environment. However, the relationship between phenotypic plasticity and molecular evolution remains poorly understood. We addressed this issue by investigating the evolution of genes associated with phenotypically plastic castes, sexes, and developmental stages of the fire ant Solenopsis invicta. We first determined if genes associated with phenotypic plasticity in S. invicta evolved at a rapid rate, as predicted under theoretical models. We found that genes differentially expressed between S. invicta castes, sexes, and developmental stages all exhibited elevated rates of evolution compared with ubiquitously expressed genes. We next investigated the evolutionary history of genes associated with the production of castes. Surprisingly, we found that orthologs of caste-biased genes in S. invicta and the social bee Apis mellifera evolved rapidly in lineages without castes. Thus, in contrast to some theoretical predictions, our results suggest that rapid rates of molecular evolution may not arise primarily as a consequence of phenotypic plasticity. Instead, genes evolving under relaxed purifying selection may more readily adopt new forms of biased expression during the evolution of alternate phenotypes. These results suggest that relaxed selective constraint on protein-coding genes is an important and underappreciated element in the evolutionary origin of phenotypic plasticity.
Resumo:
Low-complexity regions (LCRs) in proteins are tracts that are highly enriched in one or a few aminoacids. Given their high abundance, and their capacity to expand in relatively short periods of time through replication slippage, they can greatly contribute to increase protein sequence space and generate novel protein functions. However, little is known about the global impact of LCRs on protein evolution. We have traced back the evolutionary history of 2,802 LCRs from a large set of homologous protein families from H.sapiens, M.musculus, G.gallus, D.rerio and C.intestinalis. Transcriptional factors and other regulatory functions are overrepresented in proteins containing LCRs. We have found that the gain of novel LCRs is frequently associated with repeat expansion whereas the loss of LCRs is more often due to accumulation of amino acid substitutions as opposed to deletions. This dichotomy results in net protein sequence gain over time. We have detected a significant increase in the rate of accumulation of novel LCRs in the ancestral Amniota and mammalian branches, and a reduction in the chicken branch. Alanine and/or glycine-rich LCRs are overrepresented in recently emerged LCR sets from all branches, suggesting that their expansion is better tolerated than for other LCR types. LCRs enriched in positively charged amino acids show the contrary pattern, indicating an important effect of purifying selection in their maintenance. We have performed the first large-scale study on the evolutionary dynamics of LCRs in protein families. The study has shown that the composition of an LCR is an important determinant of its evolutionary pattern.
Resumo:
Welche genetische Unterschiede machen uns verschieden von unseren nächsten Verwandten, den Schimpansen, und andererseits so ähnlich zu den Schimpansen? Was wir untersuchen und auch verstehen wollen, ist die komplexe Beziehung zwischen den multiplen genetischen und epigenetischen Unterschieden, deren Interaktion mit diversen Umwelt- und Kulturfaktoren in den beobachteten phänotypischen Unterschieden resultieren. Um aufzuklären, ob chromosomale Rearrangements zur Divergenz zwischen Mensch und Schimpanse beigetragen haben und welche selektiven Kräfte ihre Evolution geprägt haben, habe ich die kodierenden Sequenzen von 2 Mb umfassenden, die perizentrischen Inversionsbruchpunkte flankierenden Regionen auf den Chromosomen 1, 4, 5, 9, 12, 17 und 18 untersucht. Als Kontrolle dienten dabei 4 Mb umfassende kollineare Regionen auf den rearrangierten Chromosomen, welche mindestens 10 Mb von den Bruchpunktregionen entfernt lagen. Dabei konnte ich in den Bruchpunkten flankierenden Regionen im Vergleich zu den Kontrollregionen keine höhere Proteinevolutionsrate feststellen. Meine Ergebnisse unterstützen nicht die chromosomale Speziationshypothese für Mensch und Schimpanse, da der Anteil der positiv selektierten Gene (5,1% in den Bruchpunkten flankierenden Regionen und 7% in den Kontrollregionen) in beiden Regionen ähnlich war. Durch den Vergleich der Anzahl der positiv und negativ selektierten Gene per Chromosom konnte ich feststellen, dass Chromosom 9 die meisten und Chromosom 5 die wenigsten positiv selektierten Gene in den Bruchpunkt flankierenden Regionen und Kontrollregionen enthalten. Die Anzahl der negativ selektierten Gene (68) war dabei viel höher als die Anzahl der positiv selektierten Gene (17). Eine bioinformatische Analyse von publizierten Microarray-Expressionsdaten (Affymetrix Chip U95 und U133v2) ergab 31 Gene, die zwischen Mensch und Schimpanse differentiell exprimiert sind. Durch Untersuchung des dN/dS-Verhältnisses dieser 31 Gene konnte ich 7 Gene als negativ selektiert und nur 1 Gen als positiv selektiert identifizieren. Dieser Befund steht im Einklang mit dem Konzept, dass Genexpressionslevel unter stabilisierender Selektion evolvieren. Die meisten positiv selektierten Gene spielen überdies eine Rolle bei der Fortpflanzung. Viele dieser Speziesunterschiede resultieren eher aus Änderungen in der Genregulation als aus strukturellen Änderungen der Genprodukte. Man nimmt an, dass die meisten Unterschiede in der Genregulation sich auf transkriptioneller Ebene manifestieren. Im Rahmen dieser Arbeit wurden die Unterschiede in der DNA-Methylierung zwischen Mensch und Schimpanse untersucht. Dazu wurden die Methylierungsmuster der Promotor-CpG-Inseln von 12 Genen im Cortex von Menschen und Schimpansen mittels klassischer Bisulfit-Sequenzierung und Bisulfit-Pyrosequenzierung analysiert. Die Kandidatengene wurden wegen ihrer differentiellen Expressionsmuster zwischen Mensch und Schimpanse sowie wegen Ihrer Assoziation mit menschlichen Krankheiten oder dem genomischen Imprinting ausgewählt. Mit Ausnahme einiger individueller Positionen zeigte die Mehrzahl der analysierten Gene keine hohe intra- oder interspezifische Variation der DNA-Methylierung zwischen den beiden Spezies. Nur bei einem Gen, CCRK, waren deutliche intraspezifische und interspezifische Unterschiede im Grad der DNA-Methylierung festzustellen. Die differentiell methylierten CpG-Positionen lagen innerhalb eines repetitiven Alu-Sg1-Elements. Die Untersuchung des CCRK-Gens liefert eine umfassende Analyse der intra- und interspezifischen Variabilität der DNA-Methylierung einer Alu-Insertion in eine regulatorische Region. Die beobachteten Speziesunterschiede deuten darauf hin, dass die Methylierungsmuster des CCRK-Gens wahrscheinlich in Adaption an spezifische Anforderungen zur Feinabstimmung der CCRK-Regulation unter positiver Selektion evolvieren. Der Promotor des CCRK-Gens ist anfällig für epigenetische Modifikationen durch DNA-Methylierung, welche zu komplexen Transkriptionsmustern führen können. Durch ihre genomische Mobilität, ihren hohen CpG-Anteil und ihren Einfluss auf die Genexpression sind Alu-Insertionen exzellente Kandidaten für die Förderung von Veränderungen während der Entwicklungsregulation von Primatengenen. Der Vergleich der intra- und interspezifischen Methylierung von spezifischen Alu-Insertionen in anderen Genen und Geweben stellt eine erfolgversprechende Strategie dar.