Question: Masking short sequences between gaps in genome assembly
0
gravatar for pbigbig
12 months ago by
pbigbig200
United States
pbigbig200 wrote:

Hi,

I have assembled a genome, using Illumina Pair-end reads for assembling, Mate pair reads for scaffolding. In resulted fasta files, I notice some patterns like this:

...NNNNNNNNNNNNNNNNGTGTGTAGGATCTCACNNNNNNNNNNNNNNNNNNNNNN...

I would like to hardmask those small "island" sequences between gaps with defined maximum length (e.g masking if < 200bp), could you please give some suggestion?

Thank you very much in advance!

genome assembly masking • 287 views
ADD COMMENTlink modified 12 months ago • written 12 months ago by pbigbig200

When you say masking do you mean remove it ?

ADD REPLYlink modified 12 months ago • written 12 months ago by Titus890

Hi, I mean to hard masking it, which would turn any A C G T to N

ADD REPLYlink written 12 months ago by pbigbig200
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1194 users visited in the last hour