How to get required information from InterProscan results in xml format?

0

Entering edit mode

8.1 years ago

seta ★ 1.9k

Hi all,

. I locally run InterProScan (interproscan-5.14-53.0) for finding the protein domain on translated de novo assembled transcriptome. I used the following command:

./interproscan.sh -i file1.pep -t p -o file1_intro.xml -f xml -appl Pfam, ProDom, ProSiteProfiles -goterms -iprlookup

It generated a 60 MB file, but I don't know how to extract the required information from this output. It was my first experience and I have no idea how to handle the results. Could you please let me know if I should sort or filter the output based on something like e-value, similar to blast. If your response is positive, please kindly tell me how I can do it? Also, I want to know which protein domains/families existed in my query file and what are their frequencies?

Thanks

alignment InterProscan transcriptome • 2.6k views

ADD COMMENT • link 8.1 years ago by seta ★ 1.9k

0

Entering edit mode

Hello, did you filter your result?? because I have the same question. I don know if working with all the GO terms that I got as a results of running interproscan to do an GO enrichement analysis, or first do the filter based on the evalue.

ADD REPLY • link 3.3 years ago by Latifa • 0

Login before adding your answer.