Find occurrence of short pattern in fasta with several sequences and ouput it to bam
1
1
Entering edit mode
8.2 years ago
mschmid ▴ 180

I have a 20 nucleotide pattern. Now I want to find ALL of its occurences in a fasta file with about 200 sequences.

Then I want to ouput the positions either to a gff or into a sam file to be able to load into IGV. What is the most efficient way to do that?

It can have about 1-2 indels and about 2-4 SNPs. It is just important to find where the approximate pattern occurs often...

alignment • 1.3k views
ADD COMMENT
2
Entering edit mode
8.2 years ago

How about good old blat or gmap?

ADD COMMENT

Login before adding your answer.

Traffic: 1870 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6