Question

Extract Alignment By Read Id From A Sam File

2

Entering edit mode

11.1 years ago

Nicolas Rosewick 11k

Hi,

Is there a rapid way to extract alignment from a sam file using read ids (about~100 read ids in average). If the read ids are in a file (one per line), I could do :

cat in.sam | grep -f idFile.txt > out.sam

but with a big sam file (~40Gb) it takes a lot of time.... so is there maybe a method to extract these alignments faster ?

Thanks,

N.

sam read id • 13k views

ADD COMMENT • link updated 11.1 years ago by Pierre Lindenbaum 161k • written 11.1 years ago by Nicolas Rosewick 11k

2

Entering edit mode

well, not really duplicate. It was BAM, not SAM.

ADD REPLY • link 11.1 years ago by Pierre Lindenbaum 161k

0

Entering edit mode

duplicate of

Extracting subsets of reads from a BAM file

ADD REPLY • link 11.1 years ago by Pierre Lindenbaum 161k

score 11 · Answer 1 · 2013-04-05

11

Entering edit mode

11.1 years ago

Pierre Lindenbaum 161k

faster ?

 LC_ALL=C grep -w -F -f idFile.txt  < in.sam > subset.sam

ADD COMMENT • link 11.1 years ago by Pierre Lindenbaum 161k

1

Entering edit mode

+1 for C locale.

ADD REPLY • link 11.1 years ago by Aaronquinlan 12k

0

Entering edit mode

amazingly simple!!!! thanks so much

ADD REPLY • link 10.3 years ago by rob234king ▴ 610

0

Entering edit mode

thanks so much for this! any suggestion on how to also keep the sam header in the output subset.sam file?