While doing a variant assessment for patients exomes (~16 vcf files - 16 patients), I find that some variants predicted to be pathogenic, however, the phenotype is not associated with these variants.
I also found that some pathogenic predicted variants do exist in more than a patient! so I'm thinking of doing an intersection of all the vcf files and use file containing the common variants in my variant assessment workflow!
- Is this approach OK?
- I want to build up a database for the common variants in our population, will this strategy help?
- What are the recommended tools?
Appreciated to your usual help!