Manageable Workflows for Processing Parallel Sequencing Data
Data(s) |
03/02/2015
03/02/2015
2014
|
---|---|
Resumo |
ACM Computing Classification System (1998): D.2.11, D.1.3, D.3.1, J.3, C.2.4. Data analysis after parallel sequencing is a process that uses combinations of software tools that is often subject to experimentation and on-the-fly substitution, with the necessary file conversion. This article presents a developing system for creating and managing workflows aiding the tasks one encounters after parallel sequences, particularly in the area of metagenomics. The semantics, description language and software implementation aim to allow the creation of flexible, configurable workflows that are suitable for sharing and are easy to manipulate through software or by hand. The execution system design provides user-defined operations and interchangeability between an operation and a workflow. This allows significant extensibility, which can be further complemented with distributed computing and remote management interfaces. |
Identificador |
Serdica Journal of Computing, Vol. 8, No 1, (2014), 1p-14p 1312-6555 |
Idioma(s) |
en |
Publicador |
Institute of Mathematics and Informatics Bulgarian Academy of Sciences |
Palavras-Chave | #Next-Generation Sequencing #Metagenomics #Workflow Design #Data Analysis #YAML |
Tipo |
Article |