Entering edit mode
3.7 years ago
Bioinfo_learner
▴
40
I have performed all-vs-all BLASTp of 23 strains of a species. I DID NOT add genomes incrementally. I have found out the core genome of all strains but want to make accumulation curve ( A graph showing How much core genome varies by incrementally adding strains one by one-couldn't attach pic).
Do I have to repeat the entire BLASTp process or can I still make the accumulation curve from blastResultsfinal.tsv ?
METHODOLOGY I followed for BLASTp:
- Proteome of Strain 1 vs goodProteinsdb
- Proteome of strain 2 vs good Prtoeinsdb and so on.
- Merged all BLASTp results into 1 single file (Output: blastResultsfinal.tsv)
- Found orthologs via orthoMCL and MCL using blastResultsfinal.tsv (Output: groups.txt)
- Found core, accessory,singleton genome from groups.txt.