Question

How to find out the reverse complement of DNA from each FASTA formated sequence file in a directory and generate a new reverse complement FASTA formated files for each of the input files?

0

Entering edit mode

9.1 years ago

Sumit ▴ 20

I have total 2000+ genome sequence files in a directory. I need reverse complement sequence for each of the files and want to generate FASTA formated reverse complement sequence file for each files in the directory.

genome sequence • 4.3k views

ADD COMMENT • link updated 22 months ago by Ram 43k • written 9.1 years ago by Sumit ▴ 20

2

Entering edit mode

Hi, welcome to Biostars. Look into Biopython, Bioperl etc.

ADD REPLY • link 9.1 years ago by Whetting ★ 1.6k

Ram · Accepted Answer · 2015-04-09

4

Entering edit mode

9.1 years ago

Biomonika (Noolean) 3.2k

for file in *.fasta; do seqtk seq -r ${file} > ${file}_revC; done;

not tested, should work, please install seqtk: https://github.com/lh3/seqtk

ADD COMMENT • link updated 22 months ago by Ram 43k • written 9.1 years ago by Biomonika (Noolean) 3.2k

0

Entering edit mode

Alternatively (-l60 to specify fasta line length):

ls *.fasta | sed s,.fasta,, | xargs -i echo seqtk seq -r -l60 {}.fasta \> {}.rev | sh

ADD REPLY • link updated 22 months ago by Ram 43k • written 9.1 years ago by lh3 33k

0

Entering edit mode

I am grateful to you for your help.. Its work....

ADD REPLY • link 9.1 years ago by Sumit ▴ 20