Question: How does pilon treat single N characters in polishing genome?
gravatar for ewilbanks
8 months ago by
ewilbanks0 wrote:

Hi folks,

Does anyone know how pilon treats single N characters?

I'm using pilon and some illumina data to polish a pacbio assembled genome with some single N characters as ambiguous bases. I'm confused about how pilon considers these. For many of N instances there should be good support to correct this to an A, C, T, or G, but these aren't being touched by my current attempts. Ideas? Pilon is correcting other ambiguous bases (e.g. R, Y, K) to the correct base, but is ignoring Ns. These single Ns aren't gaps, but ambiguous bases from assembling together overlapping contigs using Geneious's assembler.

The command I'm running is:

java -Xmx120g -jar ~/software/anaconda2/pkgs/pilon-1.22-1/share/pilon-1.22-1/pilon-1.22.jar \
    --genome ref.fasta \
    --frags aln.sorted.bam \
    --unpaired u.sorted.bam \
    --changes --vcf --tracks \
    --threads 16 \
    --fix bases,amb \
    --outdir pilon_02
ADD COMMENTlink modified 8 months ago • written 8 months ago by ewilbanks0
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1116 users visited in the last hour