Manageable Workflows for Processing Parallel Sequencing Data


Autoria(s): Krachunov, Milko; Kulev, Ognyan; Simeonova, Valeriya; Nisheva, Maria; Vassilev, Dimitar
Data(s)

03/02/2015

03/02/2015

2014

Resumo

ACM Computing Classification System (1998): D.2.11, D.1.3, D.3.1, J.3, C.2.4.

Data analysis after parallel sequencing is a process that uses combinations of software tools that is often subject to experimentation and on-the-fly substitution, with the necessary file conversion. This article presents a developing system for creating and managing workflows aiding the tasks one encounters after parallel sequences, particularly in the area of metagenomics. The semantics, description language and software implementation aim to allow the creation of flexible, configurable workflows that are suitable for sharing and are easy to manipulate through software or by hand. The execution system design provides user-defined operations and interchangeability between an operation and a workflow. This allows significant extensibility, which can be further complemented with distributed computing and remote management interfaces.

Identificador

Serdica Journal of Computing, Vol. 8, No 1, (2014), 1p-14p

1312-6555

http://hdl.handle.net/10525/2426

Idioma(s)

en

Publicador

Institute of Mathematics and Informatics Bulgarian Academy of Sciences

Palavras-Chave #Next-Generation Sequencing #Metagenomics #Workflow Design #Data Analysis #YAML
Tipo

Article