I have RNASEQ sequencing data (tumour) in which each sample has approximately 20 million reads. My target is to identify SNPs and predict the isoform expression variability using RNAseq data. Before doing all analysis , I wanted to check coverage of sequence data. Just to check coverage , I used formaula N*L/G where N no of reads per sample, L read length (150 as paired end data 75+75), and human genome size. For sample, with 26130832 reads count, I got coverage 1.3. Does this 1.3 indicates sequence coverage is 1X ? Am I right ? My target is to identity SNPs. Can I proceed with this much amount of data per sample?
Thank you in advance