How to define blast online to filter out PREDICTED and hypothetical proteins
1
0
Entering edit mode
4.8 years ago
hodayabeer ▴ 10

Hi, I am trying to blast some proteins using 'blastp' vs. some organisms using the Non- redundant protein sequences database (nr)

I have found a solution that is applicable only for blastn by using an Entrez Query of: all[filter] NOT predicted[title] but unfortunately it doesn't work and I still get as a result the 'PREDICTED' proteins.

I will be glad to hear if someone has a solution to that.

Thank you in advance

blastp • 2.7k views
ADD COMMENT
0
Entering edit mode

It would be easier to simply do the blast as usual and then filter the bits you don’t want out of the text result file after the fact.

ADD REPLY
0
Entering edit mode

Perhaps you can add use the option to exclude the XM/XP models? If this is a computationally expensive search, you may just want to stick to the solution of jrj.healey and filter the results after the fact. That way, if you ever need the entire result set, you can always go back to it without having to run the search again.

ADD REPLY
1
Entering edit mode
4.8 years ago
Mensur Dlakic ★ 27k

There is no such option for NCBI's blast to filter out predicted proteins. You may have to download the NR database, remove the proteins you don't want, and perform local BLASTp searches.

I hope you realize that many proteins have words predicted and hypothetical in their annotations without actually being predicted or hypothetical proteins. Also, some proteins would have few or no matches against a database devoid of predicted and hypothetical proteins.

ADD COMMENT

Login before adding your answer.

Traffic: 2782 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6