I recently helped the Rockman lab basecall their MinION sequencing data on the Prince HPC, leveraging the power of the GPUs available there. This allowed us to bring the total time required for basecalling down to around five hours, from the two weeks(!) it was going to take on the desktop.

Since more people are beginning to perform MinION sequencing here at the Center for Genomics and Systems Biology, I thought it would be helpful to share the procedure for basecalling with GPUs on the HPC.

First, you’ll need to transfer your data to the HPC. I recommend rsync for this if you’re on a mac. If you’re using Windows, I suggest WinSCP. You’ll also need to know the flowcell and kit that was used for sequencing (see this table for the full list of options), and lastly the output path where you want the basecalled fastq files to go.

Copy the script below to your local directory, modify the first four parameters shown in red (leave --device "auto" intact), then submit it to Slurm like this: sbatch script-name.s

If you have multiple fast5 directories (for example: fast5_pass and fast5_skip), you can combine the fast5 files into one directory, or you can run the script twice, providing a different input path each time.

If you’re doing RNA sequencing, you need to provide the --reverse_sequence argument as well.

The script above should notify you via email when it begins, ends, or if there are any problems, but you can also track it’s status using:

watch squeue -u your_netID

Questions? E-mail me (mk5636) or post in the comments below.

Available Flowcell + Kit Combinations

FlowcellKit
FLO-MIN106SQK-RNA001
FLO-MIN106SQK-RNA002
FLO-MIN107SQK-RNA001
FLO-MIN107SQK-RNA002
FLO-PRO001SQK-LSK109
FLO-PRO001SQK-DCS109
FLO-PRO001SQK-PCS109
FLO-PRO002SQK-LSK109
FLO-PRO002SQK-DCS109
FLO-PRO002SQK-PCS109
FLO-MIN107SQK-DCS108
FLO-MIN107SQK-DCS109
FLO-MIN107SQK-LRK001
FLO-MIN107SQK-LSK108
FLO-MIN107SQK-LSK109
FLO-MIN107SQK-LSK308
FLO-MIN107SQK-LSK309
FLO-MIN107SQK-LSK319
FLO-MIN107SQK-LWP001
FLO-MIN107SQK-PCS108
FLO-MIN107SQK-PCS109
FLO-MIN107SQK-PSK004
FLO-MIN107SQK-RAD002
FLO-MIN107SQK-RAD003
FLO-MIN107SQK-RAD004
FLO-MIN107SQK-RAS201
FLO-MIN107SQK-RLI001
FLO-MIN107VSK-VBK001
FLO-MIN107VSK-VSK001
FLO-MIN107SQK-LWB001
FLO-MIN107SQK-PBK004
FLO-MIN107SQK-RAB201
FLO-MIN107SQK-RAB204
FLO-MIN107SQK-RBK001
FLO-MIN107SQK-RBK004
FLO-MIN107SQK-RLB001
FLO-MIN107SQK-RPB004
FLO-MIN107VSK-VMK001
FLO-PRO001SQK-RNA002
FLO-PRO002SQK-RNA002
FLO-MIN106SQK-DCS108
FLO-MIN106SQK-DCS109
FLO-MIN106SQK-LRK001
FLO-MIN106SQK-LSK108
FLO-MIN106SQK-LSK109
FLO-MIN106SQK-LWP001
FLO-MIN106SQK-PCS108
FLO-MIN106SQK-PCS109
FLO-MIN106SQK-PSK004
FLO-MIN106SQK-RAD002
FLO-MIN106SQK-RAD003
FLO-MIN106SQK-RAD004
FLO-MIN106SQK-RAS201
FLO-MIN106SQK-RLI001
FLO-MIN106VSK-VBK001
FLO-MIN106VSK-VSK001
FLO-MIN106SQK-RBK001
FLO-MIN106SQK-RBK004
FLO-MIN106SQK-RLB001
FLO-MIN106SQK-LWB001
FLO-MIN106SQK-PBK004
FLO-MIN106SQK-RAB201
FLO-MIN106SQK-RAB204
FLO-MIN106SQK-RPB004
FLO-MIN106VSK-VMK001

0 Comments

Leave a Reply

Your email address will not be published. Required fields are marked *