Question: Workflow for Variant Analysis
1
gravatar for gkuffel22
4.6 years ago by
gkuffel2270
United States
gkuffel2270 wrote:

I have posted this question before and did not receive any responses. Let me try to articulate the issue more effectively this time. I have 200 samples (specifically from 200 different coyotes). I am trying to detect SNPs in the MHC gene. I have created a workflow in galaxy and would like to be able to use this to automate all of the steps required. At the end I would like all of the data to be compiled in one vcf file listed by sample name. The problem is I cannot find a way to have each sample maintain it's unique sample name through the workflow without manually entering this at some point. Any ideas? 

ADD COMMENTlink written 4.6 years ago by gkuffel2270
1

a. Command line gives you more control than Galaxy does.

b. Ask on https://biostar.usegalaxy.org to reach the relevant audience.

ADD REPLYlink written 4.6 years ago by RamRS24k
1

You might want to check the internals of the GATK haplotype caller tool. It can accept multiple input BAM files, which is similar to what you need. The general idea is to accept a variable length argument and just replace the colnames() when producing the output (presuming you want to keep the results in the same file). As Ram said, you'd be best off asking on the galaxy site, they're a pretty helpful group.

ADD REPLYlink written 4.6 years ago by Devon Ryan92k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1290 users visited in the last hour