I'm new to bioinformatics, have only been to one iPlant workshop and I'm trying to get a head start on this project for next year.
For this project, I'll need to use comparative genomics software to find pathogenicity genes, which we plan to silence with HIGS for pest control, while making sure these genes are not conserved in symbionts.
I've been recommended VennBLAST. I've been able to find the genome of one pathogen of interest and two symbionts of interest. Until this project starts and I can start sequencing myself next year, I'll only be able to work with data from NCBI and other researchers. How many genomes of pathogens and symbionts will I need to identify candidate pathogenicity genes without much noise?
Where would you recommend I begin and what early steps should I take? Thanks for the help.