This is the description I received from the SRA staff (Adam Stine).
The spot model is Illumina GA centric. The flowcells have the locations where the adapters have stuck them to the glass of the lane. There are X and Y coordinates that identify these 'spots'. As the camera reads the fluorescent flashes during sequencing, the coordinates indicate which spot the new base is added to. All of the bases for a single location constitute the spot. There may be one or more divisions of those bases for technical reads (adapters, primers, barcodes, etc) and there will always be at least one biological read (forward, reverse). I usually think of the technical reads as the "known" sequence and the biological as the "unknown". When we store the data, the bases for a single spot are all stored as one string with the description of where the breaks occur as well as the type of read each segment represents. The spot length is the expected total length for all reads (used as a check to make sure we have all the data). As an example, a 2x150 run with a 6bp barcode and 12bp primer on the forward read would have 4 reads.
0 - barcode basecoord 1
1 - primer basecoord 7
2 - forward basecoord 19
3 - reverse basecoord 151
But you only need to explain SRA about the barcode and primer is you submit sequences that contains it..In my case, a third party provided me with the BAM files and I do not have the untrimmed sequences.
So the SPOT datamodel is useful for supplying untrimmed BAM.. yet, enable you to specify where the biological reads begin.
In my case, I have 2X100 bp without index and I am only supplying the Application read with the adapter trimmed. so I simply submit.
0 - forward basecoord 1 (Application read)
1 - reverse basecoord 101 (Application read)