The pathogen specific resources might be a useful starting point. For example:
If you are including viruses then UniProtKB may be a useful source, since it details organism/host for viruses. Sadly they don't seem to have included other organism/host relationships.
Other possible sources that come to mind are:
- metabolic pathway databases. For example KEGG PATHWAY has a set of pathways related to infectious disease that could be useful.
- microarray expression databases. For example ArrayExpress contains details of experiments looking for changes in gene expression related to various disease states (try a search for terms like 'infected').
While I suspect that these will suffer from similar limitations to the PPI data they are worth looking at. Additional pairings from the analysis of the INSDC databases, suggested Roderic, will extend coverage. As will text-mining of the literature.
modified 7.0 years ago
7.0 years ago by
Hamish • 3.1k