Hi! I am currently working on my undergraduate study about the frequency of missense mutations in early and advanced stages of early luminal breast cancer. The vcf file contains 47 transcriptomic samples--12 early (stage II) and 35 advanced (III)--and were annotated using SnpEff eff on Galaxy. Is there a tool I can use to efficiently count the mutations by sample and by position per chromosome? I would also appreciate any suggestions for downstream or enrichment analysis for my study.
EDIT: I am more than willing to walk you through the RNA-Seq pipeline I am using on Galaxy. Any assistance or suggestions are much appreciated, as I am admittedly new to bioinformatics.