Finding homology stretches in FASTA files

0

Entering edit mode

8.0 years ago

blasco • 0

I would like to find short stretches of sequence (i.e. 18-20 nt) present in two fasta files. The idea is to identify those matches in sequences of otherwise distant organisms, or distant metagenomes. I have seen programs that look for similar reads, but those would not identify short stretches within the reads. Is there any program that can do that?.

next-gen sequencing • 1.2k views

ADD COMMENT • link 8.0 years ago by blasco • 0

0

Entering edit mode

Sounds like you are looking to identify prevalent k-mers (18-20) in your sequences. kmercountexact.sh from BBMap may be worth looking at.

$ kmercountexact.sh in=file.fasta out=counts.txt fastadump=f k=20 overwrite=t

ADD REPLY • link 8.0 years ago by GenoMax 141k

Login before adding your answer.

Similar Posts

Loading Similar Posts

Traffic: 1693 users visited in the last hour

Content Search
Users
Tags
Badges

Help About
FAQ

Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the

version 2.3.6