Supplementary Information : Hogan, Holland, Holloway, Petit and Read : Read Classification for Next Generation Sequencing, ESANN 2013, April 2013


Autoria(s): Hogan, James M.
Resumo

This item provides supplementary materials for the paper mentioned in the title, specifically a range of organisms used in the study. The full abstract for the main paper is as follows: Next Generation Sequencing (NGS) technologies have revolutionised molecular biology, allowing clinical sequencing to become a matter of routine. NGS data sets consist of short sequence reads obtained from the machine, given context and meaning through downstream assembly and annotation. For these techniques to operate successfully, the collected reads must be consistent with the assumed species or species group, and not corrupted in some way. The common bacterium Staphylococcus aureus may cause severe and life-threatening infections in humans,with some strains exhibiting antibiotic resistance. In this paper, we apply an SVM classifier to the important problem of distinguishing S. aureus sequencing projects from alternative pathogens, including closely related Staphylococci. Using a sequence k-mer representation, we achieve precision and recall above 95%, implicating features with important functional associations.

Formato

application/pdf

Identificador

http://eprints.qut.edu.au/57694/

Relação

http://eprints.qut.edu.au/57694/1/HogHol2PetReaSupp.pdf

Hogan, James M. Supplementary Information : Hogan, Holland, Holloway, Petit and Read : Read Classification for Next Generation Sequencing, ESANN 2013, April 2013. (Unpublished)

Direitos

Copyright 2013 The Author

Fonte

School of Electrical Engineering & Computer Science; Science & Engineering Faculty

Palavras-Chave #060102 Bioinformatics #080100 ARTIFICIAL INTELLIGENCE AND IMAGE PROCESSING #080301 Bioinformatics Software #next generation sequencing #support vector machines
Tipo

Other