Bayesian model accounting for within-class biological variability in Serial Analysis of Gene Expression (SAGE)


Autoria(s): Vêncio, Ricardo ZN; Brentani, Helena ; Patrão, Diogo FC; Pereira, Carlos AB
Contribuinte(s)

UNIVERSIDADE DE SÃO PAULO

Data(s)

26/08/2013

26/08/2013

01/08/2004

Resumo

Abstract Background An important challenge for transcript counting methods such as Serial Analysis of Gene Expression (SAGE), "Digital Northern" or Massively Parallel Signature Sequencing (MPSS), is to carry out statistical analyses that account for the within-class variability, i.e., variability due to the intrinsic biological differences among sampled individuals of the same class, and not only variability due to technical sampling error. Results We introduce a Bayesian model that accounts for the within-class variability by means of mixture distribution. We show that the previously available approaches of aggregation in pools ("pseudo-libraries") and the Beta-Binomial model, are particular cases of the mixture model. We illustrate our method with a brain tumor vs. normal comparison using SAGE data from public databases. We show examples of tags regarded as differentially expressed with high significance if the within-class variability is ignored, but clearly not so significant if one accounts for it. Conclusion Using available information about biological replicates, one can transform a list of candidate transcripts showing differential expression to a more reliable one. Our method is freely available, under GPL/GNU copyleft, through a user friendly web-based on-line tool or as R language scripts at supplemental web-site.

RV is supported by FAPESP 02/046988 fellowship. We thank Tie Koide for critical reading of the manuscript and BIOINFOUSP/RedeVision for computational support.

RV is supported by FAPESP 02/04698-8 fellowship. We thank Tie Koide for critical reading of the manuscript and BIOINFO-USP/Rede-Vision for computational support.

Identificador

1471-2105

http://www.producao.usp.br/handle/BDPI/32732

10.1186/1471-2105-5-119

http://www.biomedcentral.com/1471-2105/5/119

Idioma(s)

eng

Relação

BMC Bioinformatics

Direitos

openAccess

Vêncio et al; licensee BioMed Central Ltd. - This is an open-access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Tipo

article

original article