find motif locations in the genome
2
1
Entering edit mode
8.1 years ago
igor 13k

There are many motif-finding tools. Usually they compare a set of sequences against a set of motifs and give you the top occurring motifs. However, I have a specific motif in mind and I would like to find the positions where it occurs in the genome (account for mismatches, of course). Is there a tool that will do that?

motif • 4.3k views
ADD COMMENT
0
Entering edit mode
8.1 years ago
GenoMax 141k

Recent thread that may work: Finding specific k-mer in human genome

ADD COMMENT
0
Entering edit mode

Doesn't k-mer imply that the sequence has to be an exact match?

ADD REPLY
0
Entering edit mode

fuzznuc, EMBOSS program referenced in the thread allows you to specify search patterns (with mismatches): http://emboss.sourceforge.net/apps/cvs/emboss/apps/fuzznuc.html I assume you know the sequence of the motif you want to search. You can specify search patterns based on that.

ADD REPLY
0
Entering edit mode
8.1 years ago

The command-line version of UCSC BLAT can be used locally with -minMatch and -minIdentity options to account for mismatches. It exports a PSL file, which contains position information and can be converted into other formats for operations.

ADD COMMENT
0
Entering edit mode

I think blast also should work, doesn't it?

If not what might be the possible problems

ADD REPLY
0
Entering edit mode

I think BLAT will give you a bit more control over the number of allowed mismatches and other settings. It also directly outputs positional information, which I do not believe blast does without extra work.

ADD REPLY
0
Entering edit mode

I think BLAT needs a lot more modifications. Setting -minIdentity=60 and -minMatch=1 fails to match a 20bp sequence with just 1 mismatch.

ADD REPLY

Login before adding your answer.

Traffic: 2015 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6