Question

RNA variant calling - SNP and indels files

0

Entering edit mode

8 months ago

littlebioinformatician • 0

Hello, I am performing a variant calling pipeline for RNA-seq data of genetically modified mice from a C57Bl/6 background. I am using nextflow optimized pipeline with GATK4 good practices (https://nf-co.re/rnavar/1.0.0/) and this pipeline requires both dbSNP VCF and known indels VCF files for the reference genome (in this case I'm using GRCm38). Which is the most suitable source to obtain this information? Does this information depend on the mouse strain? Thank you so much for your help!

mouse variant_calling genomics RNA • 979 views

ADD COMMENT • link 8 months ago by littlebioinformatician • 0

1

Entering edit mode

what the error log just in case if you are not getting desired output?

ADD REPLY • link 8 months ago by 1769mkc ★ 1.3k

0

Entering edit mode

Thank you for your reply. It directly does not run the pipeline. I had to deactivate the module of GATK base calibration to run the pipeline

ADD REPLY • link 8 months ago by littlebioinformatician • 0

score 1 · Answer 1 · 2025-02-24

1

Entering edit mode

8 months ago

swbarnes2 15k

Known SNPs are probably more important for sequencing wild individuals, not pure-bred strains. It's not like you need to filter away known variants as compared to a reference to identify novel ones;they should all be novel.

You could try making dummy empty vcfs, just with headers. Then they won't try to filter away anything. Or a vcf with just the genetic changes you know this background possesses.