Tilte: Developing Scalable Bioinformatics Workflows on the Cancer Genomics Cloud
Presenter: Dr. Jeffrey Grover
Title: Genomics Scientist
Organization: Seven Bridges
Abstract:
The Cancer Genomics Cloud (CGC) is a cloud-based bioinformatics ecosystem supported by the National Cancer Institute (NCI). The CGC allows users to run computational workflows defined in the Common Workflow Language (CWL) on a wealth of large datasets, in place, in the cloud. Users may also upload their own data and take advantage of the scalability of cloud computing for their data analysis. In addition to the hundreds of publicly available bioinformatics workflows in the CGC Public Apps Gallery users can employ a variety of methods to develop their own. These include an integrated graphical user interface for creating workflows, as well as an ecosystem of tools enabling local development and automated deployment of workflows to the CGC. We will detail how to develop efficient workflows for the CGC and how to use best practices such as version control and continuous integration with the CGC, using publicly available tools developed by Seven Bridges.
Resource:
https://www.cancergenomicscloud.org/
https://github.com/rabix/sb-ci
https://github.com/rabix/benten
https://github.com/rabix/sbpack
Presentation Recording