match, mismatch, gap parameters for short DNA motifs
0
0
Entering edit mode
6.1 years ago
odoluca ▴ 20

Hello all,

I have millions of short DNA sequences (20-150 nt) and I suspect some of these show similarity. I would like to compare them using pairwise alignment method, however, I am having trouble in determining which match score, mismatch penalty and gap penalty (linear method) I should use. My research didnt result any guideline for such short sequences, so any help is appreciated.

motif pairwise alignment DNA • 2.2k views
ADD COMMENT
1
Entering edit mode

Are these in fasta format? You may want to use CD-HIT to reduce redundancy instead of trying to do pair-wise alignments.

ADD REPLY
0
Entering edit mode

I checked the CD-HIT. Its webservers don't allow any file bigger than 50 MBs. So I have to compile myself. However, I already coded a pairwise comparison algorithm, so I want to stick with it, in which otherwise would take longer. Thanks anyways.

ADD REPLY

Login before adding your answer.

Traffic: 2850 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6