Question: How to make AGP file with gap sequence?
gravatar for mskim483
21 months ago by
mskim4830 wrote:

Hi all,

I want to make AGP file from assembled scaffold fasta file. But one of my scaffold have gap, poly-N sequence in front of their sequence.


So, fasta2agp programs show this messeage,

Illegal characters in sequence

Are there any possible ways or programs that can make .agp files with front gaps of scaffolds?

genome software error • 787 views
ADD COMMENTlink modified 11 months ago by ido.idobar10 • written 21 months ago by mskim4830

Not that I know of. It then also make little to no sense to start a scaffold with a 'gap' ...

You can of course simply modify it manually if needed

ADD REPLYlink written 21 months ago by lieven.sterck7.8k

If you submit an AGP file related to fasta sequences having trailing Ns it will not be accepted. At least at EBI from my experience. So you must remove them.

ADD REPLYlink written 11 months ago by Juke344.1k
gravatar for ido.idobar
11 months ago by
ido.idobar10 wrote:

For anyone reaching to this thread with the same issue, I've updated the script to remove N's at the start of scaffolds (which makes little sense as mentioned before by mskim483).
It can be downloaded from here:
Or directly into your shell with: wget --no-check-certificate --content-disposition

ADD COMMENTlink modified 11 months ago • written 11 months ago by ido.idobar10
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1203 users visited in the last hour