How denovo assemblers handle N's in reads?
1
0
Entering edit mode
7.2 years ago
saranpons3 ▴ 70

Hello, I would like to know that how N's in reads are handled by denovo assemblers when they assemble the reads? Thanks in advance.

genome assembly • 1.2k views
ADD COMMENT
0
Entering edit mode
7.2 years ago

Depends on the assembler, but typically they are ignored. So, for example, a 101bp read with an N right smack in the middle, if you were assembling with a kmer length of 60, would yield zero kmers since there are no valid 60bp sequences that do not contain an N. Whereas a N-free 101bp read would yield 42 60-mers. A 101bp read with an N at the end would yield 41 60-mers.

ADD COMMENT
0
Entering edit mode

Thanks for your reply Brian.

It means all the reads(101bp) which are having N's right smack in the middle and if my kmer size is 60, then these reads are of no use because they yield zero kmers? Am i right?

What would happen to reads if they have consecutive N's not in the middle and not at the end but at different part of the reads? Should my assembler not consider the k-mers which are having N's? Even a k-mer contains a single N should be not considered?

Thanks in advance.

ADD REPLY
1
Entering edit mode

It depends on the choices of the assembler author, but generally, all kmers containing even a single N are disregarded.

ADD REPLY

Login before adding your answer.

Traffic: 1512 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6