5 resultados para Banach Sequence Space
em National Center for Biotechnology Information - NCBI
Resumo:
A technique for systematic peptide variation by a combination of rational and evolutionary approaches is presented. The design scheme consists of five consecutive steps: (i) identification of a “seed peptide” with a desired activity, (ii) generation of variants selected from a physicochemical space around the seed peptide, (iii) synthesis and testing of this biased library, (iv) modeling of a quantitative sequence-activity relationship by an artificial neural network, and (v) de novo design by a computer-based evolutionary search in sequence space using the trained neural network as the fitness function. This strategy was successfully applied to the identification of novel peptides that fully prevent the positive chronotropic effect of anti-β1-adrenoreceptor autoantibodies from the serum of patients with dilated cardiomyopathy. The seed peptide, comprising 10 residues, was derived by epitope mapping from an extracellular loop of human β1-adrenoreceptor. A set of 90 peptides was synthesized and tested to provide training data for neural network development. De novo design revealed peptides with desired activities that do not match the seed peptide sequence. These results demonstrate that computer-based evolutionary searches can generate novel peptides with substantial biological activity.
Resumo:
How large is the volume of sequence space that is compatible with a given protein structure? Starting from random sequences, low free energy sequences were generated for 108 protein backbone structures by using a Monte Carlo optimization procedure and a free energy function based primarily on Lennard–Jones packing interactions and the Lazaridis–Karplus implicit solvation model. Remarkably, in the designed sequences 51% of the core residues and 27% of all residues were identical to the amino acids in the corresponding positions in the native sequences. The lowest free energy sequences obtained for ensembles of native-like backbone structures were also similar to the native sequence. Furthermore, both the individual residue frequencies and the covariances between pairs of positions observed in the very large SH3 domain family were recapitulated in core sequences designed for SH3 domain structures. Taken together, these results suggest that the volume of sequence space optimal for a protein structure is surprisingly restricted to a region around the native sequence.
Resumo:
It is shown that the sequence-ordering tendencies induced by design into different fast-folding, thermally stable native structures interfere. This interference results in a type of quasiorthogonality between optimal native structures, which divides sequence space into fast-folding, thermally stable families surrounded by slow-folding, low stability shells. A concrete example of this effect is provided by using a simple α carbon type model in which a complete correspondence is established between sequence and structure. It is speculated that gaps can occur in the space of protein-like sequences separating the sequence families and resulting in a mechanism for stability and diversity of protein sequence information.
Resumo:
RNA secondary structure folding algorithms predict the existence of connected networks of RNA sequences with identical structure. On such networks, evolving populations split into subpopulations, which diffuse independently in sequence space. This demands a distinction between two mutation thresholds: one at which genotypic information is lost and one at which phenotypic information is lost. In between, diffusion enables the search of vast areas in genotype space while still preserving the dominant phenotype. By this dynamic the success of phenotypic adaptation becomes much less sensitive to the initial conditions in genotype space.
Resumo:
To test a different approach to understanding the relationship between the sequence of part of a protein and its conformation in the overall folded structure, the amino acid sequence corresponding to an α-helix of T4 lysozyme was duplicated in tandem. The presence of such a sequence repeat provides the protein with “choices” during folding. The mutant protein folds with almost wild-type stability, is active, and crystallizes in two different space groups, one isomorphous with wild type and the other with two molecules in the asymmetric unit. The fold of the mutant is essentially the same in all cases, showing that the inserted segment has a well-defined structure. More than half of the inserted residues are themselves helical and extend the helix present in the wild-type protein. Participation of additional duplicated residues in this helix would have required major disruption of the parent structure. The results clearly show that the residues within the duplicated sequence tend to maintain a helical conformation even though the packing interactions with the remainder of the protein are different from those of the original helix. It supports the hypothesis that the structures of individual α-helices are determined predominantly by the nature of the amino acids within the helix, rather than the structural environment provided by the rest of the protein.