Approaches for containerized scientific workflows in cloud environments with applications in life science
Spjuth O, Capuccini M, Carone M, Larsson A, Schaal W, Novella J, Stein JA, Ekmefjord M, Di Tommaso P, Floden E, Notredame C, Moreno P, Khoonsari PE, Herman S, Kultima K, Lampa S.
Approaches for containerized scientific workflows in cloud environments with applications in life science.
Preprints. 2020, 2020010378 (2020). DOI: 10.20944/preprints202001.0378.v1
Containers are gaining popularity in life science research as they encompass all dependencies of provisioned tools and simplifies software installations for end users, as well as offering a form of isolation between processes. Scientific workflows are ideal to chain containers into data analysis pipelines to sustain reproducible science. In this manuscript we review the different approaches to use containers inside the workflow tools Nextflow, Galaxy, Pachyderm, Luigi, and SciPipe when deployed in cloud environments. A particular focus is placed on the workflow tool’s interaction with the Kubernetes container orchestration framework.