Tag: minion

  • GPU-Accelerated MinION Basecalling On the HPC

    GPU-Accelerated MinION Basecalling On the HPC

    I recently helped the Rockman lab basecall their MinION sequencing data on the Prince HPC, leveraging the power of the GPUs available there. This allowed us to bring the total time required for basecalling down to around five hours, from the two weeks(!) it was going to take on the desktop.

    Since more people are beginning to perform MinION sequencing here at the Center for Genomics and Systems Biology, I thought it would be helpful to share the procedure for basecalling with GPUs on the HPC.

    First, you’ll need to transfer your data to the HPC. I recommend rsync for this if you’re on a mac. If you’re using Windows, I suggest WinSCP. You’ll also need to know the flowcell and kit that was used for sequencing (see this table for the full list of options), and lastly the output path where you want the basecalled fastq files to go.

    Copy the script below to your local directory, modify the first four parameters shown in red (leave --device "auto" intact), then submit it to Slurm like this: sbatch script-name.s

    If you have multiple fast5 directories (for example: fast5_pass and fast5_skip), you can combine the fast5 files into one directory, or you can run the script twice, providing a different input path each time.

    If you’re doing RNA sequencing, you need to provide the --reverse_sequence argument as well.

    The script above should notify you via email when it begins, ends, or if there are any problems, but you can also track it’s status using:

    watch squeue -u your_netID

    Questions? E-mail me (mk5636) or post in the comments below.

    Available Flowcell + Kit Combinations

    FlowcellKit
    FLO-MIN106SQK-RNA001
    FLO-MIN106SQK-RNA002
    FLO-MIN107SQK-RNA001
    FLO-MIN107SQK-RNA002
    FLO-PRO001SQK-LSK109
    FLO-PRO001SQK-DCS109
    FLO-PRO001SQK-PCS109
    FLO-PRO002SQK-LSK109
    FLO-PRO002SQK-DCS109
    FLO-PRO002SQK-PCS109
    FLO-MIN107SQK-DCS108
    FLO-MIN107SQK-DCS109
    FLO-MIN107SQK-LRK001
    FLO-MIN107SQK-LSK108
    FLO-MIN107SQK-LSK109
    FLO-MIN107SQK-LSK308
    FLO-MIN107SQK-LSK309
    FLO-MIN107SQK-LSK319
    FLO-MIN107SQK-LWP001
    FLO-MIN107SQK-PCS108
    FLO-MIN107SQK-PCS109
    FLO-MIN107SQK-PSK004
    FLO-MIN107SQK-RAD002
    FLO-MIN107SQK-RAD003
    FLO-MIN107SQK-RAD004
    FLO-MIN107SQK-RAS201
    FLO-MIN107SQK-RLI001
    FLO-MIN107VSK-VBK001
    FLO-MIN107VSK-VSK001
    FLO-MIN107SQK-LWB001
    FLO-MIN107SQK-PBK004
    FLO-MIN107SQK-RAB201
    FLO-MIN107SQK-RAB204
    FLO-MIN107SQK-RBK001
    FLO-MIN107SQK-RBK004
    FLO-MIN107SQK-RLB001
    FLO-MIN107SQK-RPB004
    FLO-MIN107VSK-VMK001
    FLO-PRO001SQK-RNA002
    FLO-PRO002SQK-RNA002
    FLO-MIN106SQK-DCS108
    FLO-MIN106SQK-DCS109
    FLO-MIN106SQK-LRK001
    FLO-MIN106SQK-LSK108
    FLO-MIN106SQK-LSK109
    FLO-MIN106SQK-LWP001
    FLO-MIN106SQK-PCS108
    FLO-MIN106SQK-PCS109
    FLO-MIN106SQK-PSK004
    FLO-MIN106SQK-RAD002
    FLO-MIN106SQK-RAD003
    FLO-MIN106SQK-RAD004
    FLO-MIN106SQK-RAS201
    FLO-MIN106SQK-RLI001
    FLO-MIN106VSK-VBK001
    FLO-MIN106VSK-VSK001
    FLO-MIN106SQK-RBK001
    FLO-MIN106SQK-RBK004
    FLO-MIN106SQK-RLB001
    FLO-MIN106SQK-LWB001
    FLO-MIN106SQK-PBK004
    FLO-MIN106SQK-RAB201
    FLO-MIN106SQK-RAB204
    FLO-MIN106SQK-RPB004
    FLO-MIN106VSK-VMK001