Question: extract region bam
gravatar for J.F.Jiang
11 months ago by
J.F.Jiang720 wrote:

Hi all,

I want to extract bam of specific amplicon to evaluate the according amplicon performance. I used to use samtools view xx.bam chr:start-end to extract the bam, however, if the two amplicon are totally overlapped, this command will return us the whole reads covering the two region.

For example:

first amplicon:

------------------------------------> <----------------------------------------

second amplicon:



I firstly convert the bam to bed using bedtools, and get those reads name that are exactly match with the start position of the first amplicon. Then paired reads that belong the amplicon 1 were then extract from the bam file, as well as amplicon 2.

Is there any convenient method to seperately extract the exact reads belong to the two amplicons?

Thanks, Junfeng

bam region • 461 views
ADD COMMENTlink modified 11 months ago by h.mon21k • written 11 months ago by J.F.Jiang720
gravatar for h.mon
11 months ago by
h.mon21k wrote:

A couple of suggestions:

1) use from BBTools to split the sequencing before mapping. You could try something like: in=amplicons.fq ref=primers.fa pattern=out%.fq k=21 restrictleft=25 \
    nzo=f refstats=refstats.txt

See the discussion starting from post #25 on this SeqAnswers thread.

2) Maybe and could also do what you want (I am tagging Brian Bushnell and genomax because they can probably clarify this). See discussion starting at post #19 on the same thread as above

ADD COMMENTlink written 11 months ago by h.mon21k


Either or is spliting the raw sequencing file based on matched primers. However, I still want to begin from bam file.

ADD REPLYlink written 11 months ago by J.F.Jiang720
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1274 users visited in the last hour