THIS IS A PROOF-OF-CONCEPT REPOSITORY THAT IS UNDER ACTIVE DEVELOPMENT. SYNTAX, ORGANISATION AND LAYOUT MAY CHANGE WITHOUT NOTICE!
nf-dragen is a simple, proof-of-concept pipeline to run the Illumina DRAGEN licensed suite of tools.
The pipeline is built using Nextflow, a workflow tool to run tasks across multiple compute infrastructures in a very portable manner. It uses Docker containers making installation trivial and results highly reproducible. This pipeline has only been tested on AWS Batch.
We have streamlined the process of deploying Nextflow workflows that utilise Illumina DRAGEN on AWS Batch via Tower.
You will need to obtain the following information from the Illumina DRAGEN team:
- Private AMI id in an AWS region with DRAGEN F1 instance availability
- Username to run DRAGEN on the CLI via Nextflow
- Password to run DRAGEN on the CLI via Nextflow
Please see the dragen.nf module implemented in this pipeline for reference.
Any Nextflow processes calling the dragen
command must have:
-
label dragen
(see docs). This is how Tower will determine which processes need to be specifically executed on DRAGEN F1 instances.process DRAGEN { label 'dragen' <truncated> }
-
secret DRAGEN_USERNAME
andsecret DRAGEN_PASSWORD
(see docs). These Secrets will be provided securely to the--lic-server
option when running DRAGEN on the CLI to validate the license.process DRAGEN { secret 'DRAGEN_USERNAME' secret 'DRAGEN_PASSWORD' <truncated> script: """ /opt/edico/bin/dragen \\ --lic-server=\$DRAGEN_USERNAME:\$DRAGEN_PASSWORD@license.edicogenome.com \\ <other_options> """ }
You can use Tower Forge to automatically create a separate AWS Batch queue with dedicated F1 instances to run DRAGEN.
In the Tower UI, go to Compute Environments
-> Add Compute Environment
and fill in the appropriate settings for your AWS Batch environment. Additionally, you will be able to paste your private DRAGEN AMI id as shown in the image below:
Click on Add
to create the Compute Environment.
Please ensure that the
Region
you select contains DRAGEN F1 instances.
As outlined in this blog you can add Secrets to Tower to safely encrypt the username and password information required to run DRAGEN via Nextflow.
In the Tower UI, go to Secrets
-> Add Pipeline Secret
and add both of the Secrets as shown in the images below:
DRAGEN_USERNAME
DRAGEN_PASSWORD
In the Tower UI, go to Launchpad
-> Add Pipeline
. Fill in the appropriate details to add your pipeline and ensure that the Compute Environment and Secrets you created previously are both defined for use by the pipeline:
Click on Add
to create the pipeline and launch it when you are ready!
nf-dragen was originally written by Harshil Patel and Graham Wright and Paolo Di Tommasso, Seqera Labs.
The nf-core pipeline template was used to create the skeleton of this pipeline but there are no plans to contribute it to nf-core at this point.
You can cite the nf-core
publication as follows:
The nf-core framework for community-curated bioinformatics pipelines.
Philip Ewels, Alexander Peltzer, Sven Fillinger, Harshil Patel, Johannes Alneberg, Andreas Wilm, Maxime Ulysse Garcia, Paolo Di Tommaso & Sven Nahnsen.
Nat Biotechnol. 2020 Feb 13. doi: 10.1038/s41587-020-0439-x.