Question: Masking short sequences between gaps in genome assembly
0
gravatar for pbigbig
7 months ago by
pbigbig190
United States
pbigbig190 wrote:

Hi,

I have assembled a genome, using Illumina Pair-end reads for assembling, Mate pair reads for scaffolding. In resulted fasta files, I notice some patterns like this:

...NNNNNNNNNNNNNNNNGTGTGTAGGATCTCACNNNNNNNNNNNNNNNNNNNNNN...

I would like to hardmask those small "island" sequences between gaps with defined maximum length (e.g masking if < 200bp), could you please give some suggestion?

Thank you very much in advance!

genome assembly masking • 218 views
ADD COMMENTlink modified 7 months ago • written 7 months ago by pbigbig190

When you say masking do you mean remove it ?

ADD REPLYlink modified 7 months ago • written 7 months ago by Titus770

Hi, I mean to hard masking it, which would turn any A C G T to N

ADD REPLYlink written 7 months ago by pbigbig190
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1956 users visited in the last hour