Hi. I am a beginner in bioinformatics.
I referred to the GenBank data of the TP53 gene from here. I had read that "isoform a" for example, covers all the exons (1-11) of the gene. However in the annotation in the cited Genbank, none of the CDS isoforms starts below the 10897 coordinate.
Similarly most of the mRNA transcripts start from the 1..114 segment and then jump to the 10000+ region. I believe several exons like Exon 5, 4, etc. lie between the 1000 - 10000 coordinates. Is there a mistake in the data or am I missing something conceptually?
Any help would be appreciated.
Thanks for the references. But I think I was wrong in assuming that exons lie in the 1000-10000 genome coordinates. As per my further research those seem to be intron regions.