I read that one can search SRA database to retrieve sequence data from it and make comparisons with one's own sequencing reads. So if I found a strain of pathogenic E. coli, and I want to know if it is a totally new strain, other than initially blasting and mapping to reference genomes, I can search SRA to see if anyone else might be working on it or retrieve related data for comparison.
What I understand is that SRA is an archive of NGS data from ongoing projects which includes raw reads and draft assemblies. But I don't know how to use it or if what I am thinking of using it for is correct.
How do I compare my strain sequence read with those deposited in SRA, to see if there is a similar strain in there?
How do I even choose a experiment set to blast my sequence(s) against to begin with? Do I have to do a literature search to find relevant papers for SRA Experiment set (SRX) first?
Appreciate any direction on this topic.
Thank you for the link to that page. I'll look through the different search strategies listed within.