Separating the wheat from the chaff: unbiased filtering of background tandem mass spectra improves protein identification


Autoria(s): JUNQUEIRA, Magno; SPIRIN, Victor; BALBUENA, Tiago Santana; WARIDEL, Patrice; SURENDRANATH, Vineeth; KRYUKOV, Grigoriy; ADZHUBEI, Ivan; THOMAS, Henrik; SUNYAEV, Shamil; SHEVCHENKO, Andrej
Contribuinte(s)

UNIVERSIDADE DE SÃO PAULO

Data(s)

20/10/2012

20/10/2012

2008

Resumo

Only a small fraction of spectra acquired in LC-MS/MS runs matches peptides from target proteins upon database searches. The remaining, operationally termed background, spectra originate from a variety of poorly controlled sources and affect the throughput and confidence of database searches. Here, we report an algorithm and its software implementation that rapidly removes background spectra, regardless of their precise origin. The method estimates the dissimilarity distance between screened MS/MS spectra and unannotated spectra from a partially redundant background library compiled from several control and blank runs. Filtering MS/MS queries enhanced the protein identification capacity when searches lacked spectrum to sequence matching specificity. In sequence-similarity searches it reduced by, on average, 30-fold the number of orphan hits, which were not explicitly related to background protein contaminants and required manual validation. Removing high quality background MS/MS spectra, while preserving in the data set the genuine spectra from target proteins, decreased the false positive rate of stringent database searches and improved the identification of low-abundance proteins.

Identificador

JOURNAL OF PROTEOME RESEARCH, v.7, n.8, p.3382-3395, 2008

1535-3893

http://producao.usp.br/handle/BDPI/27696

10.1021/pr800140v

http://dx.doi.org/10.1021/pr800140v

Idioma(s)

eng

Publicador

AMER CHEMICAL SOC

Relação

Journal of Proteome Research

Direitos

restrictedAccess

Copyright AMER CHEMICAL SOC

Palavras-Chave #proteomics #LC-MS/MS #sequence similarity searches #background spectra filtering #de novo sequencing #MS BLAST #QUADRUPOLE COLLISION CELL #PEPTIDE SEQUENCE TAGS #COMPREHENSIVE ANALYSIS #AFFINITY PURIFICATION #INTERACTION NETWORKS #LC-MS/MS #SPECTROMETRY #PROTEOMICS #SEARCH #NANOELECTROSPRAY #Biochemical Research Methods
Tipo

article

original article

publishedVersion