[samopen] SAM header is present: xxx sequences.
1
1
Entering edit mode
9.3 years ago

Hi,

I am using the line below to convert SAM to BAM and then sort them:

samtools view -bS $mySAM | samtools sort -n -m 6000000000 - $myBAM

Eventhough the process seem to be generating an output, I receive the following log:

[bam_header_read] EOF marker is absent. The input is probably truncated.
[samopen] SAM header is present: 211 sequences.

Is the fact that I am inputting an aligned RNASeq.sam file and it indicates that 211 sequences have SAM header something I need to worry about?

I appreciate your feedback,

Thanks,
Noushin

samtools sam bam • 7.0k views
ADD COMMENT
3
Entering edit mode
9.3 years ago

The "EOF marker is absent" line is a bug in samtools that always occurs when using piped input. Just ignore it.

ADD COMMENT
0
Entering edit mode

Thanks a lot Brian! Is it also normal that only 211 sequences in SAM file appear to have SAM headers?

ADD REPLY
3
Entering edit mode

The number of sequences with headers depends on the reference, not the reads. So for example if you are working with the human genome, excluding auxiliary stuff like unplaced contigs, samtools will report 25 sequences - 22 autosomes, 2 sex chromosomes, and one mitochondrial sequence. This is a result of the reference, and independent of your reads.

ADD REPLY
0
Entering edit mode

Thank you very much for clarifying! That totally makes sense now.

ADD REPLY

Login before adding your answer.

Traffic: 3230 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6