Question: How to find mapped regions in the reference genome
0
gravatar for Ashi
4 months ago by
Ashi10
United States
Ashi10 wrote:

Hi,

I am very new to NGS analysis.

I have used BWA for mapping. My reference genome is a set of simulated long reads of E.Coli (I have used Badreads tool for this) and I have mapped short reads to this. Now I want to know for each long read which portions have been mapped. My ultimate goal is to find a distribution of the percentage mapping of the long reads. For example if the long read is 10k length and i know how much 3k length (characters of the long read) has been mapped, then I can say this long read is (10k/3k) covered by the short reads.

Is there any way to do this?

Thank you.

ADD COMMENTlink written 4 months ago by Ashi10
0
gravatar for genomax
4 months ago by
genomax91k
United States
genomax91k wrote:

If you have already done the mapping then calculate the coverage based on the answers here: Tools To Calculate Average Coverage For A Bam File?

ADD COMMENTlink written 4 months ago by genomax91k

Hi @genomax thank you for the help. Using the samtools and awk mentioned in the link that you gave me I got an output like this: Average = 1.08026

What I want is the coverage for each long read that I gave as the reference for mapping in the BWA. Is there a way to get that?

ADD REPLYlink written 4 months ago by Ashi10

Take a look at samtools idxstats in that case.

ADD REPLYlink written 4 months ago by genomax91k

Yes thank you. That works. :)

ADD REPLYlink written 4 months ago by Ashi10
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 812 users visited in the last hour