Low mapping rate
0
0
Entering edit mode
11 weeks ago

I have no experience with RNA-Seq of human plasma, but I tried it with kaggle notebook.

Covid-19 RNA-Seq using HISAT2

I used GRCh38.primary_assembly.genome.fa as reference sequence, but the mapping rate was low (1.18-12.41%).

Does anyone know the reason for the low mapping rate?

human RNA-seq • 365 views
2
Entering edit mode

Is this your own data? You should basically take a few unmapped reads from these alignments (10 or so is fine) and then blast them at NCBI to see what they align to. You can get those reads by using --un-conc filename option with hisat2.

Human plasma should not contain a lot of human reads?

0
Entering edit mode

Thank you for your valuable input. The data used is public database.

The paper mentioned "by pretreatment with rtStar™ tRF&tiRNA Pretreatment Kit (Arraystar Inc., MD, USA)". I thought that the data was apparently for microRNA analysis.

As you suggested, I will also check the unmapped reads with blast. Thank you very much for your kind reply.

2
Entering edit mode

If the data is for miRNA then you probably need to be using a different aligner (e.g use bowtie v.1.x to do gapless alignments) and find a way of making sure to trim the data properly. miRNA data may contain a special adapter that is added directly to miRNA that needs to be trimmed before alignment of the data. If you are not looking to do novel miRNA discovery then you could also use the reference miRNA from miRBase or RNAcentral for human genome.

0
Entering edit mode

Thank you for your comments. It is very informative. I will try to use the reference miRNA first. I am very appreciate your opinion.