Question: BWA mapping with selected mismatch allowed?
gravatar for lim24m
3.0 years ago by
lim24m0 wrote:

This was a problem presented to me from a biologist: I have .fastq file with about 1 million DNA short reads. The design of the DNA templates that I sent for sequencing was something like: ... ...GGTATNNNNNNNNNATGT... ... where the N's are randomized sequence of 9 nucleotide bases, A T G or C.

I have to align them right now, either de novo or to a reference genome (we have the reference genome) without looking at the 9 randomized sequence of bases.

How should I go about this? What tools can I use? Are there any existing DNA/RNA alignment tools out there that can do this for me?

Thank you!

rna-seq alignment • 927 views
ADD COMMENTlink modified 3.0 years ago • written 3.0 years ago by lim24m0

Do these randomized nucleotides have a meaning? Something UMI like? Do you still need them downstream?

ADD REPLYlink written 3.0 years ago by WouterDeCoster39k
  • Are the nine nucleotides barcodes to identify samples?
  • Maybe you want to trim or split your sequences to remove those nucleotides?
  • Concerning the title of the question: There is a parameter to allow a certain number of mismatches in bwa aln (see -n), but not in bwa mem.
ADD REPLYlink written 3.0 years ago by dschika290
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 961 users visited in the last hour