Strange characters ^? in fasta format sequence file
1
0
Entering edit mode
5.0 years ago
860101959 ▴ 10

Recently, I received a fasta format sequence file from one of my colleges, But there are some strange characters like ^? in the sequence , does anyone knows why and how can I delete these characters? Because there are a lot of ^? in sequences and I don't want to delete manually.

I tried to recognize these characters using vim by \^\? , ^? or \^? but failed. Since the data is output of MEGA maybe there is some reasons in there.

The sequence is like this: ^?MRATGEKRVLQLHELDEFCLDSYENAKIYKEKTERWHNRHIREKEIEVGQQVLMFNSHLKLFSGKLKSRWSGSFTVVAVFPHSKLERIAEDLLIE

sequence • 1.7k views
ADD COMMENT
2
Entering edit mode
5.0 years ago

what is the output of

file your.fasta

must be : ASCII test

ADD COMMENT
0
Entering edit mode

apart from Pierre Lindenbaum remark, does the file contain the typical fasta header lines (starting with > followed by some text denoting the sequence ID/name) ?

ADD REPLY
0
Entering edit mode

Thanks, I think it is encoding problem.

ADD REPLY
0
Entering edit mode

If an answer was helpful you should upvote it, if the answer resolved your question you should mark it as accepted.

Upvote|Bookmark|Accept

ADD REPLY

Login before adding your answer.

Traffic: 1972 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6