Workflow for Variant Analysis
0
1
Entering edit mode
9.0 years ago
gkuffel22 ▴ 100

I have posted this question before and did not receive any responses. Let me try to articulate the issue more effectively this time. I have 200 samples (specifically from 200 different coyotes). I am trying to detect SNPs in the MHC gene. I have created a workflow in galaxy and would like to be able to use this to automate all of the steps required. At the end I would like all of the data to be compiled in one vcf file listed by sample name. The problem is I cannot find a way to have each sample maintain it's unique sample name through the workflow without manually entering this at some point. Any ideas?

Variant-Analysis Galaxy Variant-Detection • 2.1k views
ADD COMMENT
1
Entering edit mode
  1. Command line gives you more control than Galaxy does.
  2. Ask on https://biostar.usegalaxy.org to reach the relevant audience.
ADD REPLY
1
Entering edit mode

You might want to check the internals of the GATK haplotype caller tool. It can accept multiple input BAM files, which is similar to what you need. The general idea is to accept a variable length argument and just replace the colnames() when producing the output (presuming you want to keep the results in the same file). As Ram said, you'd be best off asking on the galaxy site, they're a pretty helpful group.

ADD REPLY

Login before adding your answer.

Traffic: 2417 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6