Which Mass-Spectrometry Search Algorithm Do You Use For Peptide/Protein Identification?
6
10
Entering edit mode
13.0 years ago

The wide choice of search engines for mass-spectrometry data in proteomics make it hard to choose one. Which software do you use for peptide/protein identification and why?

proteomics software • 6.3k views
ADD COMMENT
1
Entering edit mode
13.0 years ago

We use mascot for the peptide identification and use Scaffold for the protein identification. Scaffold, is a commercial implementation of peptide prophet and seems to do a very good job of clustering the bands together and identifying the right proteins.

Also there are a lot of annotations, graphical comparisons, etc included.

I must admit we haven't done a thorough comparison to what's out there in the last year because this works so well.

ONe thing we keep going back to testing is the Trans-Proteomic Pipeline (TPP). This open source tool is also a very good tool. We just prefer the little features of Scaffold a bit more.

ADD COMMENT
1
Entering edit mode
13.0 years ago

It seems that the question does not receive much attention so I would like to pop it up a little. :)

As for our laboratory, we rely on X!Tandem for peptide identification and use utilites from TPP and OpenMS (http://www.openms.de) + custom Python scripts for the rest of the work.

The reason for using X!Tandem is not really deep. We just took what was the most popular open tool (and X!Tandem is quite popular according to the HUPO and ABRF studies). And we have not yet seen a study which clearly shows the superiority of one peptide ID tool over the others.

ADD COMMENT
1
Entering edit mode
12.6 years ago
Craig ▴ 30

I've had great success with OMSSA. Once you get past the command-line interface it is pretty simple to use. It provides good results and is multi-threaded so it's relatively fast. I've never had an issue with its results files being too big, although by default it does output the top 30 hits per spectrum so that could be an explanation. If you have issues with it, post a question at http://www.sharedproteomics.com/forum/ and it will get answered quickly.

ADD COMMENT
0
Entering edit mode

Unfortunately, it seems like this project is dead:

OMSSA was a tandem mass spectra search engine. This search engine is no longer available. Due to budgetary constraints NCBI has discontinued OMSSA. Historical binaries are available from our ftp server ftp://ftp.ncbi.nih.gov/pub/lewisg/omssa/CURRENT/. If you have any specific questions, please feel free to contact us at info@ncbi.nlm.nih.gov.

ADD REPLY
0
Entering edit mode
12.9 years ago
Brianbalgley ▴ 110

More is better... Several papers have shown that the algorithms ID overlapping, complementary populations of spectra. We use OMSSA, X!Tandem, X!Hunter, Comet and Myrimatch so far. You will ID more peptides and validate many peptides which are ID'ed by multiple algorithms. Then use IDPicker, MassSieve or Scaffold to parse and combine the outputs. We have an app which spins up MPI clusters on AWS to do this, proteomecluster.com (not open source, sorry). An open source alternative is PepArML. It adds on a machine learning function post-search to pull out more peptides. We have a version of MassSieve that can filter on PepArML outputs if you are interested.

ADD COMMENT
0
Entering edit mode
12.9 years ago
Roman Zenka ▴ 10

We use Mascot, Sequest, X!Tandem (and rarely OMSSA) together, combining the results in Scaffold.

Why? We like Scaffold, so we support what Scaffold supports: Mascot is the golden standard, many seem to swear by Sequest, X!Tandem is free and easy to use, OMSSA was a bit tricky because of huge result files (Scaffold could not parse the binary output).

ADD COMMENT
0
Entering edit mode
12.9 years ago
Fra • 0

Hello,

is there anybody who compared (Mascot) Scaffold's results with TPP's ones?

I've found that Scaffolds identifies a larger amount of proteins (2-3 times) than TPP.

Thank You.

ADD COMMENT

Login before adding your answer.

Traffic: 2941 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6