Off topic:Trim the ID line of a fasta
2
0
Entering edit mode
6.2 years ago
SaltedPork ▴ 170

I have a fasta file with some very silly ID's (straight from NCBI) that look like this:

>KF859747.1 HIV-1 isolate DEURF09UG005 from Uganda gag protein (gag) gene, complete cds; pol protein (pol) gene, partial cds; vif protein (vif), vpr protein (vpr), tat protein (tat), rev protein (rev), vpu protein (vpu), and envelope glycoprotein (env) genes, complete cds; and nonfunctional nef protein (nef) gene, complete sequence

If all I wanted was the >KF859747 part to be in my fasta, are there any nifty one liners to do this on the command line?

fasta regex • 968 views
ADD COMMENT
This thread is not open. No new answers may be added
Traffic: 2226 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6