4 resultados para Evolutionary Information Behaviour
em National Center for Biotechnology Information - NCBI
Resumo:
It is shown that the sequence-ordering tendencies induced by design into different fast-folding, thermally stable native structures interfere. This interference results in a type of quasiorthogonality between optimal native structures, which divides sequence space into fast-folding, thermally stable families surrounded by slow-folding, low stability shells. A concrete example of this effect is provided by using a simple α carbon type model in which a complete correspondence is established between sequence and structure. It is speculated that gaps can occur in the space of protein-like sequences separating the sequence families and resulting in a mechanism for stability and diversity of protein sequence information.
Resumo:
The intercistronic region between the maturation and coat-protein genes of RNA phage MS2 contains important regulatory and structural information. The sequence participates in two adjacent stem-loop structures, one of which, the coat-initiator hairpin, controls coat-gene translation and is thus under strong selection pressure. We have removed 19 out of the 23 nucleotides constituting the intercistronic region, thereby destroying the capacity of the phage to build the two hairpins. The deletion lowered coat-protein yield more than 1000-fold, and the titer of the infectious clone carrying the deletion dropped 10 orders of magnitude as compared with the wild type. Two types of revertants were recovered. One had, in two steps, recruited 18 new nucleotides that served to rebuild the two hairpins and the lost Shine-Dalgarno sequence. The other type had deleted an additional six nucleotides, which allowed the reconstruction of the Shine-Dalgarno sequence and the initiator hairpin, albeit by sacrificing the remnants of the other stem-loop. The results visualize the immense genetic repertoire created by, what appears as, random RNA recombination. It would seem that in this genetic ensemble every possible new RNA combination is represented.
Sequence similarity analysis of Escherichia coli proteins: functional and evolutionary implications.
Resumo:
A computer analysis of 2328 protein sequences comprising about 60% of the Escherichia coli gene products was performed using methods for database screening with individual sequences and alignment blocks. A high fraction of E. coli proteins--86%--shows significant sequence similarity to other proteins in current databases; about 70% show conservation at least at the level of distantly related bacteria, and about 40% contain ancient conserved regions (ACRs) shared with eukaryotic or Archaeal proteins. For > 90% of the E. coli proteins, either functional information or sequence similarity, or both, are available. Forty-six percent of the E. coli proteins belong to 299 clusters of paralogs (intraspecies homologs) defined on the basis of pairwise similarity. Another 10% could be included in 70 superclusters using motif detection methods. The majority of the clusters contain only two to four members. In contrast, nearly 25% of all E. coli proteins belong to the four largest superclusters--namely, permeases, ATPases and GTPases with the conserved "Walker-type" motif, helix-turn-helix regulatory proteins, and NAD(FAD)-binding proteins. We conclude that bacterial protein sequences generally are highly conserved in evolution, with about 50% of all ACR-containing protein families represented among the E. coli gene products. With the current sequence databases and methods of their screening, computer analysis yields useful information on the functions and evolutionary relationships of the vast majority of genes in a bacterial genome. Sequence similarity with E. coli proteins allows the prediction of functions for a number of important eukaryotic genes, including several whose products are implicated in human diseases.