Question

Find occurrence of short pattern in fasta with several sequences and ouput it to bam

1

Entering edit mode

8.2 years ago

mschmid ▴ 180

I have a 20 nucleotide pattern. Now I want to find ALL of its occurences in a fasta file with about 200 sequences.

Then I want to ouput the positions either to a gff or into a sam file to be able to load into IGV. What is the most efficient way to do that?

It can have about 1-2 indels and about 2-4 SNPs. It is just important to find where the approximate pattern occurs often...

alignment • 1.3k views

ADD COMMENT • link updated 8.2 years ago by Sean Davis 26k • written 8.2 years ago by mschmid ▴ 180

score 2 · Accepted Answer · 2016-02-24

2

Entering edit mode

8.2 years ago

Sean Davis 26k

How about good old blat or gmap?

ADD COMMENT • link 8.2 years ago by Sean Davis 26k