Demultiplexing asymmetrically barcoded reads from multiplex amplicon nanopore sequencing
0
0
Entering edit mode
7 months ago

I've got a fastq file containing multiplexed amplicon sequencing on ONT, so reads are unpaired and can be sense or antisense. The amplicons are asymmetrically barcoded, meaning distinct samples within the fastq file can be distinguished by the forward barcode at one end and the reverse barcode at the other. The structure of the amplicon is this:

pad - forward barcode - forward adapter - target sequence - reverse adapter - reverse barcode - pad

I've been playing around with cutadapt, but can only get it to demultiplex and name based on one barcode.

micromamba activate bioinfo
BARCODES="path/to/barcodes.fasta"
FASTQ="path/to/data.fastq"
OUT="path/to/temp"
cutadapt -e 0.1 \
    -g file:"$BARCODES" \
    -a file:"$BARCODES" \
    -o "$OUT/demux-{name}.fastq" \
    "$FASTQ" > "$OUT/cutadapt_log.txt"

Ultimately I'm hoping to get it to demultiplex using both F and R barcodes, e.g.: demux-fbc01-rbc01.fastq demux-fbc02-rbc01.fastq etc.

If it's possible to also trim off the barcodes, leaving the target sequence only, that would also be great.

Any help much appreciated!

ONT demultiplex • 388 views
ADD COMMENT

Login before adding your answer.

Traffic: 4353 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6