Masking (NNN...) in published sequence caused by low read # or repetitiveness... how to know which?
2
0
Entering edit mode
7.8 years ago
michael.nagle ▴ 100

The NNN... can appear in place of nucleotides in cases where there is low # of reads as well as over repetitive regions when masking is enabled, right?

What does it take to figure which is the cause?

genome masking • 1.4k views
ADD COMMENT
1
Entering edit mode
7.8 years ago

I guess the easiest would be to compare (e.g. in UCSC genome browser) whether the region masked (e.g. by blatting the non-masked part) corresponds to an element in the repeatmasker track. (Unless you need to check more than a few parts and would like to automate this.)

ADD COMMENT
1
Entering edit mode
7.8 years ago
Michael 54k

Probably most N's in scaffolded assemblies are due to gaps between assembled contigs where contigs have been joined into the same scaffold but the intermittent sequence between them has not been determined.

ADD COMMENT

Login before adding your answer.

Traffic: 1954 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6