Regarding fasta file format
1
1
Entering edit mode
8.4 years ago
seta ★ 1.9k

Hi all,

I have a basic question, I recently got an excel file with two columns, one is gene id and another is protein sequence. I want to make a blast database from these sequences. After copy/pasting those in the Notepad (I know copy/pasting is a wrong task), they are like here:

>839263    MASGGKAKYIIGALIGSFGISYIFDKVISDNKIFGGTTPGTVSNK
>837071    MASLLDKAKDFVADKLTAIPKPEGSVTDVDLKDVNRDSVEYLAKV

Could you please help me out how I can change the excel file to a text file with the correct fasta format?

Thanks

fasta • 2.0k views
ADD COMMENT
1
Entering edit mode

"Notpad": I like this better than the original name. "Not" useful for much.

ADD REPLY
3
Entering edit mode
8.4 years ago
sed -e 's/ \+/\n/' input.txt > output.fa
ADD COMMENT
0
Entering edit mode

Thanks for your response. But the format was not changed, and the error of "BLAST options error: file.fa does not match input format type, default input type is FASTA" still appears when I try to make blast database.

ADD REPLY
1
Entering edit mode

Presumably you have a tab between the columns rather than a space, so change the space in the command to \t.

ADD REPLY
0
Entering edit mode

yes, it's tab, I try it. thanks

ADD REPLY

Login before adding your answer.

Traffic: 1949 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6