Entering edit mode
10.8 years ago
Whoknows
▴
960
Hi
I used samtools to extract uniquely mapped reads from a Tophat accepted_hits.bam file by command :
samtools view q 10 accepted_hits.bam > UniqueHits.bam
but during Cuffdiff procedure it makes this error
Cufflinks requires that if your file has SQ records in
the SAM header that they appear in the same order as the chromosomes names
in the alignments.
If there are no SQ records in the header, or if the header is missing,
the alignments must be sorted lexicographically by chromsome
name and by position.
- Is my samtools extraction command makes Sam file in output?? And if yes
Can I use below command for creating bam output?? (and then Sorted bam from that)
samtools view -b -q 10 accepted_hits.bam > Unique.bam samtools sort Unique.bam Unique.sorted
Thanks
My
accepted_hits.bamsize is 2.2GB butUnique.bamis 8.6 GB so I'm not sure it could be correct,If
Unique.bamis sorted, why cuffdiff makes error?Without the
-bflag, that's a SAM file. Tophat defaults to sorting alignments, so unless you told it not to do that thenaccepted_hits.bamis already sorted (and filtering a sorted file with samtools will always produce a sorted file).I'm not sure maybe it because of my genome sequence headers, my genome is Wheat and they might have long header for each sequence