Parameter to filter a VCF of SNPs?
1
1
Entering edit mode
3.4 years ago
Picasa ▴ 610

Hi,

I have done a SNPs analysis and now I have a VCF.

I would like to filter it but I don't know what is usually done. For instance I know that I can filter by:

  • Remove samples that have > x% missing data
  • Remove SNPs that have > x% missing data
  • Remove SNPs with >x% MAF

Do you have have other recommendation?

And also what are the standard parameters (x %) ?

Thanks for your help.

vcf SNP parameters • 1.4k views
ADD COMMENT
2
Entering edit mode
3.4 years ago

Remove samples that have > x% missing data

search for the bad samples How to check which samples has more uncalled genotypes in multi-sample vcf and remove them with bcftools

Remove SNPs that have > x% missing data

gatk select variants https://software.broadinstitute.org/gatk/documentation/tooldocs/3.8-0/org_broadinstitute_gatk_tools_walkers_variantutils_SelectVariants.php with option --maxNOCALLfraction

Remove SNPs with >x% MAF

gatk select variants https://gatkforums.broadinstitute.org/gatk/discussion/5567/selectvariants-on-af

ADD COMMENT

Login before adding your answer.

Traffic: 3092 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6