Hi,
I am running some sequencing data through the Galaxy pipeline and generated a VCF (I think, see below). I am trying to convert the VCF to a FASTA file. I was wondering if there is a method in the Galaxy toolbox to make the conversion. I tried VCF to Tab Delimited and then Tabular to FASTA and got basically nothing. In trying to backtrack to see where my workflow went wrong, I considered my results from the BOWTIE alignment. I am not sure BOWTIE produced any result, but I did not receive an error message so my thought is that it worked. Any guidance would be greatly appreciate and thank you in advance for your help.
Best,
John
VCF output:
##fileformat=VCFv4.1
##fileDate=20171103
##source=Naive Variant Caller version 0.0.2
##reference=file:///galaxy-repl/main/files/022/009/dataset_22009740.dat
##INFO=<ID=AC,Number=A,Type=Integer,Description="Allele count in genotypes, for each ALT allele, in the same order as listed">
##INFO=<ID=AF,Number=A,Type=Float,Description="Allele Frequency, for each ALT allele, in the same order as listed">
##FORMAT=<ID=GT,Number=1,Type=String,Description="Genotype">
##FORMAT=<ID=AC,Number=.,Type=Integer,Description="Allele count in genotypes, for each ALT allele, in the same order as listed">
##FORMAT=<ID=AF,Number=.,Type=Float,Description="Allele Frequency, for each ALT allele, in the same order as listed">
##FORMAT=<ID=NC,Number=.,Type=String,Description="Nucleotide and indel counts">
#CHROM POS ID REF ALT QUAL FILTER INFO FORMAT __NONE__
This is just a header for a VCF file. Was this all that was there in the file? If it was then your bowtie run itself must not have produced any valid data, as you suspect.
I believe you are correct. As an update, I went back and traced all the files prior to bowtie and there is data present that I can visualize using the eyeball icon. When I do this for bowtie the file is blank, however, I received no error message when it ran. Does that mean it is not aligning my sequence or simply that bowtie didn't work and should be run again?
It could be many things. If you click on the "bowtie" step and then on "i" icon what do you see?
Nothing, it is just a blank field, but prior to the bowtie step there is data in all the previous steps.
Try re-running the bowtie job then. Make sure you select all the right options. What kind of data is this? You may want to choose bowtie2, if that is available since it will do gapped alignments.
Sorry, thought you meant when I clicked on the eyeball. I just realized what you were saying, here is what is under the "i" icon:
This galaxy (are you using the public galaxy at PSU or internal mirror) may be set to produce a sorted BAM file directly. In that case you would not be able to see anything since this would be a binary file. Do you see a file size when you click on the name of the step? Example below.
I am not sure if such a tool is available in galaxy. You can try GATK tool, if you have access to bam files locally: https://software.broadinstitute.org/gatk/documentation/tooldocs/current/org_broadinstitute_gatk_tools_walkers_fasta_FastaAlternateReferenceMaker.php