Question

Creating Unique Fasta Headers

0

Entering edit mode

8.1 years ago

peerlesshm • 0

hi,

I have a fast file similar to below; I have already run sed -n and down a regular expression in text wrangler to remove the headers leaving just a > followed by the sequence. The file is pretty massive and i want to know how to make each identifier unique either by using a simple python script or some other command (Not including awk due to assignment restrictions) - I've considered using linenumber = linenumber +1 for a count but I'm unsure.

I want each header to be something like s1234 with the being s1235, s1236 etc or something rather.

Thanks!

>
AGCTCAGATGCTGATCGATAGACTAG
>
GATGCTAGCTAGCTAGATCGATCGAT
>
ACGACTACAGATAGTAGATGATAGAC

fasta python shell • 2.4k views

ADD COMMENT • link updated 12 months ago by Ram 43k • written 8.1 years ago by peerlesshm • 0

score 0 · Answer 1 · 2016-03-20

0

Entering edit mode

8.1 years ago

igor 13k

I think fastx_renamer from the FASTX-Toolkit will do that: http://hannonlab.cshl.edu/fastx_toolkit/commandline.html#fastx_renamer_usage

ADD COMMENT • link 8.1 years ago by igor 13k