Question: Converting TCGA Bam files to fastq: Picard does not work!
gravatar for jonessara770
21 months ago by
jonessara770130 wrote:


I am trying to convert bam files from TCGA to fastq. Picard gives the following error:

picard.sam.SamToFastq done. Elapsed time: 0.78 minutes.
To get help, see
Exception in thread "main" picard.PicardException: Illegal mate state: H090WADXX130325:1:1106:10520:95300
    at picard.sam.SamToFastq.assertPairedMates(
    at picard.sam.SamToFastq.doWork(
    at picard.cmdline.CommandLineProgram.instanceMain(
    at picard.sam.SamToFastq.main(

The error is due to more than one pair of reads having the same query name.

it has been suggested to use bedtools bamtofastq. This produce the fastq files, however there are duplicated read names that makes my pipeline to crash in downstream steps…

I also tested “resolvepair” script but it does not produce anything…

I would like to either remove these duplicate read names or rename them. Do you have a solution to solve this issue?


sequencing wes • 835 views
ADD COMMENTlink modified 5 months ago by rmh19950 • written 21 months ago by jonessara770130

did you use the latest version of picard ? did you use VALIDATION_STRINGENCY=LENIENT ?

ADD REPLYlink written 21 months ago by Pierre Lindenbaum119k

Thanks for your reply! I ran it again with this version (picard-2.9.0/picard.jar SamToFastq VALIDATION_STRINGENCY=LENIENT) but get the same error.

ADD REPLYlink written 21 months ago by jonessara770130

Do you know which aligner created the BAM? I have found BAM to FASTQ conversion almost impossible in some cases of BAMs originating from RNA-SEQ. I believe its related to conflicting interpretations of mates and pairs.

ADD REPLYlink written 21 months ago by jomo018470

yes, these are aligned by BWA meme

ADD REPLYlink written 21 months ago by jonessara770130

In BWA site I see this Q/A:

With BWA-MEM/BWA-SW, my tools are complaining about multiple primary alignments. Is it a bug? It is not. Multi-part alignments are possible in the presence of structural variations, gene fusion or reference misassembly. However, representing multi-part alignments in SAM has not been finalized. To make BWA work with your tools, please use option `-M' to flag extra hits as secondary.

I believe "SAM has not been finalized" for multi-part alignments is basically the "conflicting interpretations of mates and pairs".

ADD REPLYlink modified 21 months ago • written 21 months ago by jomo018470
gravatar for rmh1995
5 months ago by
rmh19950 wrote:

I understand this is very late, but I believe UNC has provided some code to solve this issue. UBU. GenoMax disucsses the problem in a little detail here I hope this helps anyone currently looking for this solution!

ADD COMMENTlink written 5 months ago by rmh19950
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1721 users visited in the last hour