I have been trying to figure out which end of a gene specified in a BED file is the transcription start position. I believe that for the forward (+) strand the transcription start site(TSS) should be in the BED file's start column while for the gene on the reverse strand (-), the TSS would be in the 'end' column of the bed. Is that true? So for example,
chr1 100 5000 Gene1 0 +
chr1 30000 49023 Gene2 0 -
Suppose the information above is from a bed file: chromosome, start, end, feature name, score, strand. Assuming either the start or end coordinate specifies the TSS, is it correct to conclude that 100 is the TSS for Gene1 and 49022 is the TSS for Gene2? (I say 49022 because the last coordinate of a BED feature is not included in the feature.)
that is correct