Skip to main content
The NCI Community Hub will be retiring in May 2024. For more information please visit the NCIHub Retirement Page:https://ncihub.cancer.gov/groups/ncihubshutdown/overview
close

Tilte: Developing Scalable Bioinformatics Workflows on the Cancer Genomics Cloud

Presenter: Dr. Jeffrey Grover

Title: Genomics Scientist

Organization: Seven Bridges

Abstract

The Cancer Genomics Cloud (CGC) is a cloud-based bioinformatics ecosystem supported by the National Cancer Institute (NCI). The CGC allows users to run computational workflows defined in the Common Workflow Language (CWL) on a wealth of large datasets, in place, in the cloud. Users may also upload their own data and take advantage of the scalability of cloud computing for their data analysis. In addition to the hundreds of publicly available bioinformatics workflows in the CGC Public Apps Gallery users can employ a variety of methods to develop their own. These include an integrated graphical user interface for creating workflows, as well as an ecosystem of tools enabling local development and automated deployment of workflows to the CGC. We will detail how to develop efficient workflows for the CGC and how to use best practices such as version control and continuous integration with the CGC, using publicly available tools developed by Seven Bridges.

Resource: 

https://www.cancergenomicscloud.org/

https://rabix.io/

https://github.com/rabix/sb-ci

https://github.com/rabix/benten

https://github.com/rabix/sbpack

Presentation Recording

Slides

Created by Alan Zheng Last Modified Fri November 12, 2021 5:33 pm by Alan Zheng