Question: Getting same SNPs both in cancer and normal files when seen in IGV
0
gravatar for db351223
6 days ago by
db3512230
db3512230 wrote:

I am working on Horn cancer. I have 25 cancer samples and 5 normal samples of bos indicus (Kankrej) cattle breed. I did sequencing using Illumina Miseq and the data analysis using following pathway :

  1. Mapping by STAR aligner uisng reference genome of bos taurus.
  2. Got output in .bam file which sorted using SAMtools and also indexed.
  3. Variant calling done using 3 tools : SAMools, VarScan and FreeBayes.
  4. Then, I taken SNPs which are present in all 3 tools using vcf-isec.
  5. So I have total 25 vcf files for horn cancer and 5 vcf files for horn normal containg SNPs.
  6. I have taken SNPs which are present in 90% samples of horn cancer and SNPs which are present in 80% of samples of horn normal using vcf i-sec.
  7. To get SNPs specific to horn cancer, I have subtract Horn Normal vcf file from Horn Cancer vcf file using bedtools. Subtraction confirmation was done using CLC-genomic workbench and VCFtools and found almost same SNPs. So It have to give SNPs which are only present in horn cancer conditions.
  8. To confirm this, I have seen the specific position of SNPs in IGV - Integrative Genomics Viewer.

In IGV, I seen SNP position in Reference genome which I have used, and also in both Cancer and Normal .bam files which I got after mapping. Actually It have to show this SNPs only in cancer samples but not in normal samples but it shown in both cancer as well as normal samples. SNPs found as compared to reference were present in both group files. So used subtraction command to get cancer specific SNPs but final output file showed SNPs that are also present in both conditions. So How I consider them cancer specific SNPs? If anyone has idea how it's happening ? Please guide me in this case.

snp • 100 views
ADD COMMENTlink modified 6 days ago by WouterDeCoster35k • written 6 days ago by db3512230

It is unclear which files you have loaded in IGV, can you elaborate?

ADD REPLYlink written 6 days ago by WouterDeCoster35k

Reference file, Cancer samples' .bam file which I got after mapping, Normal samples' .bam file which I got after mapping,

ADD REPLYlink written 6 days ago by db3512230
2
gravatar for Chris Miller
6 days ago by
Chris Miller20k
Washington University in St. Louis, MO
Chris Miller20k wrote:

To get SNPs specific to horn cancer, I have subtract Horn Normal vcf file from Horn Cancer vcf file using bedtools.

Don't do this. The assumptions that germline variant callers make about VAF distributions and such don't lend themselves well to calling cancer variants. You need to use a true somatic caller. Mutect, Strelka, Varscan (in tumor/normal mode), etc.

See this previous question for more information: Difference Between Somatic And Germline Variant Calling?

ADD COMMENTlink written 6 days ago by Chris Miller20k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1087 users visited in the last hour