Stochastic dynamics modeling of the protein sequence length distribution in genomes: implications for microbial evolution


Autoria(s): Jain, Rinku; Ramakumar, S
Data(s)

15/11/1999

Resumo

In this paper, we report an analysis of the protein sequence length distribution for 13 bacteria, four archaea and one eukaryote whose genomes have been completely sequenced, The frequency distribution of protein sequence length for all the 18 organisms are remarkably similar, independent of genome size and can be described in terms of a lognormal probability distribution function. A simple stochastic model based on multiplicative processes has been proposed to explain the sequence length distribution. The stochastic model supports the random-origin hypothesis of protein sequences in genomes. Distributions of large proteins deviate from the overall lognormal behavior. Their cumulative distribution follows a power-law analogous to Pareto's law used to describe the income distribution of the wealthy. The protein sequence length distribution in genomes of organisms has important implications for microbial evolution and applications. (C) 1999 Elsevier Science B.V. All rights reserved.

Formato

application/pdf

Identificador

http://eprints.iisc.ernet.in/38793/1/Stochastic_dynamics.pdf

Jain, Rinku and Ramakumar, S (1999) Stochastic dynamics modeling of the protein sequence length distribution in genomes: implications for microbial evolution. In: Physica A: Statistical Mechanics and its Applications, 273 (3-4). pp. 476-485.

Publicador

Elsevier Science

Relação

http://dx.doi.org/10.1016/S0378-4371(99)00370-2

http://eprints.iisc.ernet.in/38793/

Palavras-Chave #BioInformatics Centre #Physics
Tipo

Journal Article

PeerReviewed