Question: In Gkno'S Fastq-Vcf Pipeline, Can I Write To Disk An Intermediary Bam Aligment Currently Sent To The Stream?
1
gravatar for Carlos Borroto
4.1 years ago by
Carlos Borroto1.5k
Washington Metropolitan Area
Carlos Borroto1.5k wrote:

Hi,

I'm using gkno's 'fastq-vcf' pipeline as a way to run a serial of steps for variant calling. gkno allows the use of bash streams to pipe data between tools without ever writing to disk.

I was wondering if there is away to keep the BAM alignment coming out of "left-align-indels" step in this pipeline. I can see these three steps are writing to the stream: "filter-bam", "realign-gaps", "left-align-indels"

And "variant-call"(freebayes) is the receptor of the final stream. I would like to instead save the intermediary BAM file for further visualization and analysis. If this is possible, do you think this change would cause a big performance hit?

I did look at bamleftalign's(tool used for "left-align-indels" step), it seems like the only option is to write to the stream. Is that right?

• 1.4k views
ADD COMMENTlink modified 4.0 years ago by alistairnward210 • written 4.1 years ago by Carlos Borroto1.5k
2
gravatar for alistairnward
4.0 years ago by
alistairnward210
United States
alistairnward210 wrote:

I have modified how output streams are handled and included an additional pipeline - fastq-bam. This pipeline is identical to the fastq-vcf pipeline except that it terminates after left-aligning indels and produces a BAM file as output. Please update to the most recent version of GKNO and these changes should be available.

ADD COMMENTlink written 4.0 years ago by alistairnward210
1
gravatar for Alex Reynolds
4.1 years ago by
Alex Reynolds19k
Seattle, WA USA
Alex Reynolds19k wrote:

Perhaps you could insert an instance of the UNIX tee command, piping streamed intermediate data to a named pipe that you can process separately. As this is a bash or other shell script, you should be able to read the script and insert the tee statement where you need it.

ADD COMMENTlink modified 4.1 years ago • written 4.1 years ago by Alex Reynolds19k

That's a good suggestion. I might have to integrate 'tee' as a tool in gkno. If I do, that could meant I can easily grab the stream coming out of tools which currently only write to the stream.

Thanks, I will explore this option.

ADD REPLYlink written 4.1 years ago by Carlos Borroto1.5k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1287 users visited in the last hour