Entering edit mode
3 days ago
yesquokkan
•
0
Hello,
I am looking for SNPs in a specific bacterial species genome.
How can I determine whether the number of SNPs detected in my dataset is reasonable?
I understand that the expected number of SNPs can vary by species, but how can I establish an appropriate baseline or reference for the species I’m studying?
Thank you in advance.
There are a lot of factors that can impact the number of identified SNPs. These include things like:
The list goes on. So finding an expected number would likely require someone knowledgeable with the specific system and species you are using.
If your bacterial species is a (human) pathogen, you might want to check outbreak analysis papers of that species.
In general, you find there how many SNPs are reasonable to define 2 assemblies as being part of a single source.