Looking for a package to detect and analyze the pairing of inline barcodes in merged reads
0
0
Entering edit mode
2.5 years ago
J.E. • 0

I have some fastq files with sequences which I reconstructed by merging forward and reverse reads using bbmerge. The 5' and 3' end of said reads must contain a 6bp inline barcode for contamination control.

I'm looking for a package to extract the 6bp at the beginning and end of the sequence, match them to my database of barcodes while allowing for a set number of nucleotide mismatched (in case of amplification errors), and report:

  1. If no recognizable barcodes are present in sequence.
  2. If barcodes are present and match. If there are mutations in the barcodes, the script must report the number of mismatches.
  3. If barcodes are present but do not match. Again, if there are mutations in the barcode, the script must report the number of mismatches.
  4. The size of the insert if barcodes are present.

I can write a bio python script to perform this procedure, but before reinventing the wheel I'd like to know if there a packages that already implemented and standardized the analyses of inline barcodes and perform the tasks described above.

inline illumina barcoding • 516 views
ADD COMMENT

Login before adding your answer.

Traffic: 3077 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6