34 resultados para Markov chains. Convergence. Evolutionary Strategy. Large Deviations

em National Center for Biotechnology Information - NCBI


Relevância:

100.00% 100.00%

Publicador:

Resumo:

Natural mixing processes modeled by Markov chains often show a sharp cutoff in their convergence to long-time behavior. This paper presents problems where the cutoff can be proved (card shuffling, the Ehrenfests' urn). It shows that chains with polynomial growth (drunkard's walk) do not show cutoffs. The best general understanding of such cutoffs (high multiplicity of second eigenvalues due to symmetry) is explored. Examples are given where the symmetry is broken but the cutoff phenomenon persists.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

Combination of molecular phylogenetic analyses of Chrysomelina beetles and chemical data of their defensive secretions indicate that two lineages independently developed, from an ancestral autogenous metabolism, an energetically efficient strategy that made the insect tightly dependent on the chemistry of the host plant. However, a lineage (the interrupta group) escaped this subordination through the development of a yet more derived mixed metabolism potentially compatible with a large number of new host-plant associations. Hence, these analyses on leaf beetles document a mechanism that can explain why high levels of specialization do not necessarily lead to “evolutionary dead ends.”

Relevância:

40.00% 40.00%

Publicador:

Resumo:

It has been suggested that recombination and shuffling between exons has been a key feature in the evolution of proteins. We propose that this strategy could also be used for the artificial evolution of proteins in bacteria. As a first step, we illustrate the use of a self-splicing group I intron with inserted lox-Cre recombination site to assemble a very large combinatorial repertoire (> 10(11) members) of peptides from two different exons. Each exon comprised a repertoire of 10 random amino acids residues; after splicing, the repertoires were joined together through a central five-residue spacer to give a combinatorial repertoire of 25-residue peptides. The repertoire was displayed on filamentous bacteriophage by fusion to the pIII phage coat protein and selected by binding to several proteins, including beta-glucuronidase. One of the peptides selected against beta-glucuronidase was chemically synthesized and shown to inhibit the enzymatic activity (inhibition constant: 17 nM); by further exon shuffling, an improved inhibitor was isolated (inhibition constant: 7 nM). Not only does this approach provide the means for making very large peptide repertoires, but we anticipate that by introducing constraints in the sequences of the peptides and of the linker, it may be possible to evolve small folded peptides and proteins.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

A technique for systematic peptide variation by a combination of rational and evolutionary approaches is presented. The design scheme consists of five consecutive steps: (i) identification of a “seed peptide” with a desired activity, (ii) generation of variants selected from a physicochemical space around the seed peptide, (iii) synthesis and testing of this biased library, (iv) modeling of a quantitative sequence-activity relationship by an artificial neural network, and (v) de novo design by a computer-based evolutionary search in sequence space using the trained neural network as the fitness function. This strategy was successfully applied to the identification of novel peptides that fully prevent the positive chronotropic effect of anti-β1-adrenoreceptor autoantibodies from the serum of patients with dilated cardiomyopathy. The seed peptide, comprising 10 residues, was derived by epitope mapping from an extracellular loop of human β1-adrenoreceptor. A set of 90 peptides was synthesized and tested to provide training data for neural network development. De novo design revealed peptides with desired activities that do not match the seed peptide sequence. These results demonstrate that computer-based evolutionary searches can generate novel peptides with substantial biological activity.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Two directed evolution experiments on p-nitrobenzyl esterase yielded one enzyme with a 100-fold increased activity in aqueous-organic solvents and another with a 17°C increase in thermostability. Structures of the wild type and its organophilic and thermophilic counterparts are presented at resolutions of 1.5 Å, 1.6 Å, and 2.0 Å, respectively. These structures identify groups of interacting mutations and demonstrate how directed evolution can traverse complex fitness landscapes. Early-generation mutations stabilize flexible loops not visible in the wild-type structure and set the stage for further beneficial mutations in later generations. The mutations exert their influence on the esterase structure over large distances, in a manner that would be difficult to predict. The loops with the largest structural changes generally are not the sites of mutations. Similarly, none of the seven amino acid substitutions in the organophile are in the active site, even though the enzyme experiences significant changes in the organization of this site. In addition to reduction of surface loop flexibility, thermostability in the evolved esterase results from altered core packing, helix stabilization, and the acquisition of surface salt bridges, in agreement with other comparative studies of mesophilic and thermophilic enzymes. Crystallographic analysis of the wild type and its evolved counterparts reveals networks of mutations that collectively reorganize the active site. Interestingly, the changes that led to diversity within the α/β hydrolase enzyme family and the reorganization seen in this study result from main-chain movements.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Two RNases H of mammalian tissues have been described: RNase HI, the activity of which was found to rise during DNA replication, and RNase HII, which may be involved in transcription. RNase HI is the major mammalian enzyme representing around 85% of the total RNase H activity in the cell. By using highly purified calf thymus RNase HI we identified the sequences of several tryptic peptides. This information enabled us to determine the sequence of the cDNA coding for the large subunit of human RNase HI. The corresponding ORF of 897 nt defines a polypeptide of relative molecular mass of 33,367, which is in agreement with the molecular mass obtained earlier by SDS/PAGE. Expression of the cloned ORF in Escherichia coli leads to a polypeptide, which is specifically recognized by an antiserum raised against calf thymus RNase HI. Interestingly, the deduced amino acid sequence of this subunit of human RNase HI displays significant homology to RNase HII from E. coli, an enzyme of unknown function and previously judged as a minor activity. This finding suggests an evolutionary link between the mammalian RNases HI and the prokaryotic RNases HII. The idea of a mammalian RNase HI large subunit being a strongly conserved protein is substantiated by the existence of homologous ORFs in the genomes of other eukaryotes and of all eubacteria and archaebacteria that have been completely sequenced.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Many proximate factors determine a bird’s laying date, including environmental and social stimuli as well as individual responses to internal and external factors. However, the relative importance of these factors has not been experimentally demonstrated. Here we show that (i) large differences in the onset of first clutches between different populations result from variation in different responses to photoperiod and not from variation in responses to any other proximate factors and (ii) the same response mechanism causes maladaptive laying dates in habitats modified by humans. We present, to our knowledge, the first experimental demonstration that a single response mechanism is responsible for evolutionary adaptive intraspecific variation in a vertebrate life history trait.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Nuclear pore complexes (NPCs) are large proteinaceous portals for exchanging macromolecules between the nucleus and the cytoplasm. Revealing how this transport apparatus is assembled will be critical for understanding the nuclear transport mechanism. To address this issue and to identify factors that regulate NPC formation and dynamics, a novel fluorescence-based strategy was used. This approach is based on the functional tagging of NPC proteins with the green fluorescent protein (GFP), and the hypothesis that NPC assembly mutants will have distinct GFP-NPC signals as compared with wild-type (wt) cells. By fluorescence-activated cell sorting for cells with low GFP signal from a population of mutagenized cells expressing GFP-Nup49p, three complementation groups were identified: two correspond to mutant nup120 and gle2 alleles that result in clusters of NPCs. Interestingly, a third group was a novel temperature-sensitive allele of nup57. The lowered GFP-Nup49p incorporation in the nup57-E17 cells resulted in a decreased fluorescence level, which was due in part to a sharply diminished interaction between the carboxy-terminal truncated nup57pE17 and wt Nup49p. Interestingly, the nup57-E17 mutant also affected the incorporation of a specific subset of other nucleoporins into the NPC. Decreased levels of NPC-associated Nsp1p and Nup116p were observed. In contrast, the localizations of Nic96p, Nup82p, Nup159p, Nup145p, and Pom152p were not markedly diminished. Coincidentally, nuclear import capacity was inhibited. Taken together, the identification of such mutants with specific perturbations of NPC structure validates this fluorescence-based strategy as a powerful approach for providing insight into the mechanism of NPC biogenesis.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The function of many of the uncharacterized open reading frames discovered by genomic sequencing can be determined at the level of expressed gene products, the proteome. However, identifying the cognate gene from minute amounts of protein has been one of the major problems in molecular biology. Using yeast as an example, we demonstrate here that mass spectrometric protein identification is a general solution to this problem given a completely sequenced genome. As a first screen, our strategy uses automated laser desorption ionization mass spectrometry of the peptide mixtures produced by in-gel tryptic digestion of a protein. Up to 90% of proteins are identified by searching sequence data bases by lists of peptide masses obtained with high accuracy. The remaining proteins are identified by partially sequencing several peptides of the unseparated mixture by nanoelectrospray tandem mass spectrometry followed by data base searching with multiple peptide sequence tags. In blind trials, the method led to unambiguous identification in all cases. In the largest individual protein identification project to date, a total of 150 gel spots—many of them at subpicomole amounts—were successfully analyzed, greatly enlarging a yeast two-dimensional gel data base. More than 32 proteins were novel and matched to previously uncharacterized open reading frames in the yeast genome. This study establishes that mass spectrometry provides the required throughput, the certainty of identification, and the general applicability to serve as the method of choice to connect genome and proteome.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

In birds and mammals T cells develop along two discrete pathways characterized by expression of either the αβ or the γδ T-cell antigen receptors (TCRs). To gain further insight into the evolutionary significance of the γδ T-cell lineage, the present studies sought to define the chicken TCRγ locus. A splenic cDNA library was screened with two polymerase chain reaction products obtained from genomic DNA using primers for highly conserved regions of TCR and immunoglobulin genes. This strategy yielded cDNA clones with characteristics of mammalian TCR γ chains, including canonical residues considered important for proper folding and stability. Northern blot analysis with the TCRγ cDNA probe revealed 1.9-kb transcripts in the thymus, spleen, and a γδ T-cell line, but not in B or αβ T-cell lines. Three multimember Vγ subfamilies, three Jγ gene segments, and a single constant region Cγ gene were identified in the avian TCRγ locus. Members of each of the three Vγ subfamilies were found to undergo rearrangement in parallel during the first wave of thymocyte development. TCRγ repertoire diversification was initiated on embryonic day 10 by an apparently random pattern of V-Jγ recombination, nuclease activity, and P- and N-nucleotide additions to generate a diverse repertoire of avian TCRγ genes early in ontogeny.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

We report DNA and predicted protein sequence similarities, implying homology, among genes of double-stranded DNA (dsDNA) bacteriophages and prophages spanning a broad phylogenetic range of host bacteria. The sequence matches reported here establish genetic connections, not always direct, among the lambdoid phages of Escherichia coli, phage φC31 of Streptomyces, phages of Mycobacterium, a previously unrecognized cryptic prophage, φflu, in the Haemophilus influenzae genome, and two small prophage-like elements, φRv1 and φRv2, in the genome of Mycobacterium tuberculosis. The results imply that these phage genes, and very possibly all of the dsDNA tailed phages, share common ancestry. We propose a model for the genetic structure and dynamics of the global phage population in which all dsDNA phage genomes are mosaics with access, by horizontal exchange, to a large common genetic pool but in which access to the gene pool is not uniform for all phage.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Despite striking differences in climate, soils, and evolutionary history among diverse biomes ranging from tropical and temperate forests to alpine tundra and desert, we found similar interspecific relationships among leaf structure and function and plant growth in all biomes. Our results thus demonstrate convergent evolution and global generality in plant functioning, despite the enormous diversity of plant species and biomes. For 280 plant species from two global data sets, we found that potential carbon gain (photosynthesis) and carbon loss (respiration) increase in similar proportion with decreasing leaf life-span, increasing leaf nitrogen concentration, and increasing leaf surface area-to-mass ratio. Productivity of individual plants and of leaves in vegetation canopies also changes in constant proportion to leaf life-span and surface area-to-mass ratio. These global plant functional relationships have significant implications for global scale modeling of vegetation–atmosphere CO2 exchange.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

PDB-REPRDB is a database of representative protein chains from the Protein Data Bank (PDB). The previous version of PDB-REPRDB provided 48 representative sets, whose similarity criteria were predetermined, on the WWW. The current version is designed so that the user may obtain a quick selection of representative chains from PDB. The selection of representative chains can be dynamically configured according to the user’s requirement. The WWW interface provides a large degree of freedom in setting parameters, such as cut-off scores of sequence and structural similarity. One can obtain a representative list and classification data of protein chains from the system. The current database includes 20 457 protein chains from PDB entries (August 6, 2000). The system for PDB-REPRDB is available at the Parallel Protein Information Analysis system (PAPIA) WWW server (http://www.rwcp.or.jp/papia/).

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The Dali Domain Dictionary (http://www.ebi.ac.uk/dali/domain) is a numerical taxonomy of all known structures in the Protein Data Bank (PDB). The taxonomy is derived fully automatically from measurements of structural, functional and sequence similarities. Here, we report the extension of the classification to match the traditional four hierarchical levels corresponding to: (i) supersecondary structural motifs (attractors in fold space), (ii) the topology of globular domains (fold types), (iii) remote homologues (functional families) and (iv) homologues with sequence identity above 25% (sequence families). The computational definitions of attractors and functional families are new. In September 2000, the Dali classification contained 10 531 PDB entries comprising 17 101 chains, which were partitioned into five attractor regions, 1375 fold types, 2582 functional families and 3724 domain sequence families. Sequence families were further associated with 99 582 unique homologous sequences in the HSSP database, which increases the number of effectively known structures several-fold. The resulting database contains the description of protein domain architecture, the definition of structural neighbours around each known structure, the definition of structurally conserved cores and a comprehensive library of explicit multiple alignments of distantly related protein families.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Systematic conservation planning is a branch of conservation biology that seeks to identify spatially explicit options for the preservation of biodiversity. Alternative systems of conservation areas are predictions about effective ways of promoting the persistence of biodiversity; therefore, they should consider not only biodiversity pattern but also the ecological and evolutionary processes that maintain and generate species. Most research and application, however, has focused on pattern representation only. This paper outlines the development of a conservation system designed to preserve biodiversity pattern and process in the context of a rapidly changing environment. The study area is the Cape Floristic Region (CFR), a biodiversity hotspot of global significance, located in southwestern Africa. This region has experienced rapid (post-Pliocene) ecological diversification of many plant lineages; there are numerous genera with large clusters of closely related species (flocks) that have subdivided habitats at a very fine scale. The challenge is to design conservation systems that will preserve both the pattern of large numbers of species and various natural processes, including the potential for lineage turnover. We outline an approach for designing a system of conservation areas to incorporate the spatial components of the evolutionary processes that maintain and generate biodiversity in the CFR. We discuss the difficulty of assessing the requirements for pattern versus process representation in the face of ongoing threats to biodiversity, the difficulty of testing the predictions of alternative conservation systems, and the widespread need in conservation planning to incorporate and set targets for the spatial components (or surrogates) of processes.