non-standard fasta header format
1
0
Entering edit mode
7.7 years ago
teone182 • 0

Hi there!

I've tried googling my problem, but I really couldn't find any proper answer or meaningful explanations to that.

The vast majority of my sequences in the abyss contig fasta files has the standard header format (e.g. >217 1452 43433)

A few of them show a non standard header looking as the following: >23892451 612 30983 145290-,2958386+,6879596+ or >23914100 434 17555 1186186+,...,8272178+

I would be really grateful if you could help me on this matter and tell me how I should interpret this.

Many thanks in advance!

Matteo

abyss assembly • 1.5k views
ADD COMMENT
0
Entering edit mode
7.7 years ago
agata88 ▴ 870

See this: https://github.com/bcgsc/abyss/wiki/ABySS-File-Formats#fa

In ADJ format of Abyss files. Seems like the fasta file is connected to adj Abyss file that is why you have extra information in heathers. And it is described in link above.

Hope it helps :)

Best, Agata

ADD COMMENT
0
Entering edit mode

Hi Agata! Thanks for the quick reply and suggestion! I checked the link you posted. It basically says that the other "non-standard" header fields represent sequences IDs overlapping with the subject sequence. So far, so good...it makes sense. However, when I look for the overlapping sequences IDs (in my example let's say 145290) in my whole assembly fasta file, these do not exist….and this is not possible if the "non-standard" fields are related to the adj specifications. Right? Any other thoughts? Maybe Abyss developers know the trick here! ;) Take care Matteo

ADD REPLY

Login before adding your answer.

Traffic: 2670 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6