Entering edit mode
12.8 years ago
Ya
▴
300
First the defintion of the Sequence Alignment/Map (SAM). It is aTAB-delimited. Apart from the header lines, which are started with the ‘@’ symbol, each alignment line consists of:
Column Fields Description
QNAMEQuery template/pair NAMEFLAGbitwise FLAGRNAMEReference sequence NAMEPOS1-based leftmost POSition/coordinate of clipped sequenceMAPQMAPping Quality (Phred-scaled)CIGARextended CIGAR stringMRNMMate Reference sequence NaMe (‘=’ if same as RNAME)MPOS1-based Mate POSistionLENinferred Template LENgth (insert size)SEQquery SEQuence on the same strand as the referenceQUALquery QUALity (ASCII-33 gives the Phred base quality)OPTvariable OPTional fields in the format TAG:VTYPE:VALUE
Let's use this thread to add information on the SAM format that may not always be obvious or well documented.