Entering edit mode
9.3 years ago
eem0306
•
0
I tried to convert 'strelka result file(I already convert strelka.result.file.sam to .bam)' to fastq file.
But, before doing this, I had to sort strelka.result.file.bam
.
So I did,
@samtools sort -n N20.result.sorted.h.bam N20.result.sorted.h.bam.qsort
and then, I got this message
[bam_translate] PG tag "bwa.1.2" on read "C09DFACXX111207:2:2105:7467:160838" encountered with no corresponding entry in header, tag lost
[bam_translate] PG tag "bwa.1.2" on read "C09DFACXX111207:2:2105:7459:162052" encountered with no corresponding entry in header, tag lost
[bam_translate] PG tag "bwa.1.2" on read "C09DFACXX111207:2:2105:7696:2958" encountered with no corresponding entry in header, tag lost
[bam_translate] PG tag "bwa.1.2" on read "C09DFACXX111207:2:2105:7459:163515" encountered with no corresponding entry in header, tag lost
[bam_translate] PG tag "bwa.1.2" on read "C09DFACXX111207:2:2105:7473:193148" encountered with no corresponding entry in header, tag lost
Please, help me
OK.. I'll keep going anyway! Thanks always~!! :)
Anyway, I sorted the bam file and then make fastq file using this sorted bamfile.
Consequently, I got 2 fastq files such as
N80.result.sorted.h.qsort.end1.fq
, andN80.result.sorted.h.qsort.end2.fq
So, what I wonder is the size of the fastq files that are half of the sorted bam file.
Is this because the sorted bam file was splited into 2 paired-end fastq files? or I got the wrong result?
There's no simple relationship between a BAM file size and the size of the fastq version of the alignments in it. The BAM files is compressed, which means that the uncompressed fastq files could easily be larger than it. However, the BAM file also has more information in it and how well it compresses will depend on how everything is structured and arranged.
Just as an example, I'm currently looking at a 2.6GB BAM file. Converting that to fastq produces two 5.5GB uncompressed files (they're about 1.5GB each when compressed).