Question: Extracting reads from Pacbio Sequel BAM by read name
0
gravatar for roryC
4 months ago by
roryC0
University of Edinburgh
roryC0 wrote:

Hi,

Does anyone know of a good way to filter reads from a Pacbio BAM file (the Sequel data format)? I have identified some contaminant reads which I would like to remove before mapping the BAM file to my draft assembly using pbalign. I want to use Arrow, so I think it is important to keep the original BAM format for mapping (or at least I'm getting errors with Arrow after converting to FASTA and mapping that with pbalign).

Picard FilterSamReads doesn't seem to work with the Pacbio format, neither does pysam.

Cheers!

bam pacbio pbalign arrow sequel • 292 views
ADD COMMENTlink written 4 months ago by roryC0
1

Hello,

Picard FilterSamReads doesn't seem to work with the Pacbio format, neither does pysam.

"doesn't work" is never a good description of the problem. Please give us more details.

fin swimmer

ADD REPLYlink modified 4 months ago • written 4 months ago by finswimmer8.1k
0
gravatar for roryC
4 months ago by
roryC0
University of Edinburgh
roryC0 wrote:

Thanks,

On further inspection it is due to the lack of a SM tag in the Pacbio BAM header, which Picard is expecting: https://github.com/PacificBiosciences/blasr/issues/212

The work-around for now looks like running picard with "VALIDATION_STRINGENCY=LENIENT"

ADD COMMENTlink written 4 months ago by roryC0
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1547 users visited in the last hour