Is Gap Closing Mandatory Process For Soapdenovo Assembly?
1
0
Entering edit mode
12.9 years ago
toshnam ▴ 650

Hi all,

I assembled paired-end genome sequences using SOAPdenovo, and got some scaffold sequences.

I found GapCloser program which is designed to close the gaps emerging during the scaffolding process by SOAPdenovo, and ran it against my scaffold sequences.

However I found the GapCloser trimmed 'N' in scaffold sequences only. Should do I remove 'N' sequence in scaffold? Is it mandatory step?

Thanks for your comment in advance.

• 6.5k views
ADD COMMENT
3
Entering edit mode
12.9 years ago
Benm ▴ 710

Gapcloser is use the PE/MP reads to re-map to the scaffolds, if one end of PE/MP is mapped to the sequence(not 'N'), and the other is mapped to the gaps(Ns reads), it will use the gap' s mapping reads to construct the gap's sequence, that named 'GapCloser' process. So you can remove 'N' sequence from the scaffolds.

ADD COMMENT
0
Entering edit mode

Are you saying GapCloser is mandatory process for PE read because PE assembly can include gap sequence?

ADD REPLY
0
Entering edit mode

Because most of gaps should be repeat or non/low sequenced regions, it may be in higher coverage depth or lower, if it is repeat, it will generate too many bubbles, or if it is lower coverage, it also can't be effectively to build contigs by not enough kmers, so sequence in gaps can't be constructed based on de Brujin graph algorithm. GapCloser is a amendatory tool, which allowed low confidence reads to mapped reads to the 'reference' sequence, and it masks exact repeats identified from the shotgun data, it is similar with the algorithm of "RePS" which also developed by BGI.

ADD REPLY
0
Entering edit mode

Thank you for your kind reply.

ADD REPLY

Login before adding your answer.

Traffic: 1849 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6