GATK "We encountered a non-standard non-IUPAC base in the provided reference: '13"
0
0
Entering edit mode
8.5 years ago
Phismil ▴ 20

Hello Team,

I am trying to run GATK

java -jar GenomeAnalysisTK.jar -T RealignerTargetCreator -R reference.fasta -I header.bam -o realigned.intervals

My reference is a trinity assembled genome with GO terms in fasta header and already indexed. I get error:

Bad input: We encountered a non-standard non-IUPAC base in the provided reference: '13

Any idea what is going wrong?

Regards

GATK • 6.7k views
ADD COMMENT
3
Entering edit mode

There's a non-standard non-IUPAC base in your reference.fasta? Output of:

grep -v '^>' reference.fasta | sed 's/[a,A,g,G,c,C,t,T,u,U]//g'

?

Alternatively, it could be related to the encoding of your file, in which case e.g. dos2unix could be a solution. Google knows a lot about your problem..

ADD REPLY
2
Entering edit mode

This will be due to a \r (ascii 13), so you nailed it with dos2unix.

ADD REPLY
0
Entering edit mode

Thanks for your suggestions !

ADD REPLY

Login before adding your answer.

Traffic: 1437 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6