Hello! I have a quick question regarding TCGA's level 3 miRNA sequencing data.
I notice that the data lists unique count values for miRNAs that share identical mature sequences but are encoded from distinct genomic loci. As these miRNAs are derived from distinct precursors, I assume that the relative abundance of the precursor sequences is used to estimate the contribution each precursor makes to the mature sequence read count. For example, if the relative abundance of pre-mir-92a-1 to pre-mir-92a-2 is 2:1, then it can be surmised that approximately 2/3 of the mature miR-92a reads come from pre-mir-92a-1 and 1/3 from pre-mir-92a-2, assuming the precursors are of equal length.
For the TCGA data, does anyone know whether this was the method that was used to assign counts to unique miRNAs sharing the same mature sequence?
I apologize if the question appears naive. My knowledge in this area is quite limited.
Any help would be greatly appreciated!