Skip to content

Nextflow logo

Tutorial Nextflow Workflows

We have a collection of notebooks using Nextflow in Gen3 here: https://github.com/uc-cdis/bio-nextflow/tree/master/nextflow_notebooks

Before you start executing any tutorial notebooks, review the information in the Get Set section.

Get set: Download necessary credentials and software

Be ready to execute the tutorial workflows below by gathering credentials and installing necessary software.

Get and replace placeholder values from the Nextflow config

You can find the values to replace the placeholders in thequeue, jobRole and workDir fields in the nextflow.config file in your Nextflow workspace (this is found in /data directory, unless you have moved it to the /pd). These placeholder values will need to be replaced in each of the various tutorial Nextflow notebooks.

Note that you should only copy/paste the value to replace placeholder for each field; do not copy/paste larger sections of the nextflow config, or there could be indentation problems that interfere with the code.

MIDRC credentials

To download GUIDs in the workspace, you will first need to generate a MIDRC credentials on the profile page of the MIDRC portal. For this, please go to data.midrc.org, click on the user icon in the right corner (#1), and open the Profile page (#2). Click on Create API Key (#3). A pop-up window will appear with the key. If you scroll down slightly, you can see the button to download the credentials as a JSON. Credentials are valid for 1 month.

Screenshot showing how to find and save MIDRC credentials

Example Nextflow notebooks

Notebooks with no containers

There are several general Nextflow notebooks that do not use containers at all. If you're new to nextflow and just want to get started with workflow commands, try these notebooks: https://github.com/uc-cdis/bio-nextflow/tree/master/nextflow_notebooks/non_containerized_nextflow_workflows

Notebooks using containers

We have several containerized notebooks using CPU, and other containerized notebooks using GPU.

Notebooks using CPU

You can find the directory with containerized notebooks using CPU here: https://github.com/uc-cdis/bio-nextflow/tree/master/nextflow_notebooks/containerized_cpu_workflows

Some use cases in this directory include:

  • Cancer use case: The chip_cancer example tutorial here includes a Dockerfile, a requirements file, a Nextflow notebook, a collection of python scripts, and a README.
  • DICOM metadata extraction use case: The midrc_batch_demo example tutorial here includes a Dockerfile, a requirements file, two Nextflow notebooks, a collection of python scripts, and a README.

Notebooks using GPU

You can find the directory with containerized notebooks using GPU here: https://github.com/uc-cdis/bio-nextflow/tree/master/nextflow_notebooks/containerized_gpu_workflows

Some use cases in this directory include:

  • Pytorch/cuda test simple use case: The torch_cuda_test example tutorial here includes a Dockerfile, a requirements file, a Nextflow notebook, a simple python script, and a README.
  • COVID Challenge 2022 use case: The covid_challenge_container example tutorial here includes a Dockerfile, a requirements file, a Nextflow notebook, a python script, and a README.