Closed:How to edit large txt file and how to find commons
0
0
Entering edit mode
18 months ago
dimitrischat ▴ 140

Hello all. I got a large text file with sequence names(only names) like this:

>TRINITY_DN7758_c0_g1_i11_len_752_path_[8_0-295_10_296-520_12_521-751]

and i would like to find a command in terminal to change the the above names in that text file to this:

>TRINITY_DN7758_c0_g1_i11 len=752

So, i would have a text file with only sequence names like the above(the edited one). I want to subtract from a fasta file that contains these sequence names+amino acid sequence, in order to keep only the ones that are in the text file.

>TRINITY_DN7758_c0_g1_i11 len=752 path=[8:0-295 10:296-520 12:521-751]
CTGTGAAATGGAGGAATATGCGGTTAAGAAAGGAAAACCATGCTACATAAATTCTC.........

Which commands could i use in terminal in oder to do that?

Thanks in advance!

sequencing assembly • 88 views
ADD COMMENT
This thread is not open. No new answers may be added
Traffic: 1360 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6