Approaches for containerized scientific workflows in cloud environments with applications in life science
Published: 2021-06-29
Formatted citation
Spjuth O, Capuccini M, Carone M, Larsson A, Schaal W, Novella J, Stein JA, Ekmefjord M, Di Tommaso P, Floden E, Notredame C, Moreno P, Khoonsari PE, Herman S, Kultima K, Lampa S.
Approaches for containerized scientific workflows in cloud environments with applications in life science.
F1000Research.
10, 513 (2021).
DOI: 10.12688/f1000research.53698.1
Abstract
Containers are gaining popularity in life science research as they provide a solution for encompassing dependencies of provisioned tools, simplify software installations for end users and offer a form of isolation between processes. Scientific workflows are ideal for chaining containers into data analysis pipelines to aid in creating reproducible analyses. In this article, we review a number of approaches to using containers as implemented in the workflow tools Nextflow, Galaxy, Pachyderm, Argo, Kubeflow, Luigi and SciPipe, when deployed in cloud environments. A particular focus is placed on the workflow tool’s interaction with the Kubernetes container orchestration framework.