Purely based on required header information and data/stats pulled from the BAM file, is there any way to guess the sequencing platform (454, Illumina, Ion Torrent, etc.) used to generate data for a BAM file? Does a tool already exist that does this?
So far all I can find is average read lengths and number produced, and error rate, which vary from platform to platform. Also I thought encoding quality may be useful in this guess too. I found this How To Determine The Version Used To Generate Solexa/Illumina Fastq Files? to be useful, though this too is just a guess of what the encoding could be.
Any ideas of other stats that may be useful would be extremely appreciated.