Entering edit mode
9.7 years ago
juncheng
▴
220
As far as I know, to calculate allele specific expression (ASE), people take advantage of the fact that we have known SNPs, based on the ratio of SNPs expressed, we can calculate ASE.
The question is how if we have more than one known SNPs on the same transcript (or Gene?), and the rations are likely to be different, how do we treat this, we make a naive average or how to do it exactly?
Best.
Jun
Thanks. Here is a example:
AlleleFreq is calculated by dividing the count of ALT/REF. As you can see, the ratio is not identical across the same transcript. How would you then calculate allele frequency then?
By the way, the data is single cell cancer sample RNAseq.
There's no need to calculate allele frequency. You split the read counts covering each informative SNP according to the allele they support and then count accordingly.
Sorry to use "Allele frequency" term here. What I actually did here shown as "AlleleFreq" is dividing the read counts from one allele to the total read count from that loci.
I added the read count at each base. Could you just give me a example how to calculate allele specific expression at this case?
Assuming that those positions are the only ones that represent that gene, the counts would be 26 for one allele and 63 for the other. You test per-gene, not per-position.
Thanks, I know understood what you mean. But is there the problem of haplotype? The genotype of the two alleles could be TTAT/ CCGC or TTGC/CCAT or TCGT/ CTAC...?
Unfortunately I don't know what your question is. I'm assuming that the reference allele describes one haplotype. If that's not the case, then calculate things accordingly.
Thanks again, you helped me a lot.