An Efficient Parallel Algorithm for Multiple Sequence Similarities Calculation Using a Low Complexity Method


Autoria(s): Marucci, Evandro A.; Zafalon, Geraldo F. D.; Momente, Julio C.; Neves, Leandro A.; Valencio, Carlo R.; Pinto, Alex R.; Cansian, Adriano M.; Souza, Rogeria C. G. de; Yang Shiyou; Machado, Jose M.
Contribuinte(s)

Universidade Estadual Paulista (UNESP)

Data(s)

18/03/2015

18/03/2015

01/01/2014

Resumo

Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)

Processo FAPESP: 06/59592-0

With the advance of genomic researches, the number of sequences involved in comparative methods has grown immensely. Among them, there are methods for similarities calculation, which are used by many bioinformatics applications. Due the huge amount of data, the union of low complexity methods with the use of parallel computing is becoming desirable. The k-mers counting is a very efficient method with good biological results. In this work, the development of a parallel algorithm for multiple sequence similarities calculation using the k-mers counting method is proposed. Tests show that the algorithm presents a very good scalability and a nearly linear speedup. For 14 nodes was obtained 12x speedup. This algorithm can be used in the parallelization of some multiple sequence alignment tools, such as MAFFT and MUSCLE.

Formato

6

Identificador

http://dx.doi.org/10.1155/2014/563016

Biomed Research International. New York: Hindawi Publishing Corporation, 6 p., 2014.

2314-6133

http://hdl.handle.net/11449/117235

10.1155/2014/563016

WOS:000340143200001

WOS000340143200001.pdf

WOS000340143200001.epub

Idioma(s)

eng

Publicador

Hindawi Publishing Corporation

Relação

Biomed Research International

Direitos

openAccess

Tipo

info:eu-repo/semantics/article