Question

Which Mass-Spectrometry Search Algorithm Do You Use For Peptide/Protein Identification?

10

Entering edit mode

13.0 years ago

Anton Goloborodko ▴ 280

The wide choice of search engines for mass-spectrometry data in proteomics make it hard to choose one. Which software do you use for peptide/protein identification and why?

proteomics software • 6.3k views

ADD COMMENT • link updated 12.9 years ago by Craig ▴ 30 • written 13.0 years ago by Anton Goloborodko ▴ 280

score 1 · Answer 1 · 2011-05-12

We use mascot for the peptide identification and use Scaffold for the protein identification. Scaffold, is a commercial implementation of peptide prophet and seems to do a very good job of clustering the bands together and identifying the right proteins.

Also there are a lot of annotations, graphical comparisons, etc included.

I must admit we haven't done a thorough comparison to what's out there in the last year because this works so well.

ONe thing we keep going back to testing is the Trans-Proteomic Pipeline (TPP). This open source tool is also a very good tool. We just prefer the little features of Scaffold a bit more.

score 1 · Answer 2 · 2011-05-12

It seems that the question does not receive much attention so I would like to pop it up a little. :)

As for our laboratory, we rely on X!Tandem for peptide identification and use utilites from TPP and OpenMS (http://www.openms.de) + custom Python scripts for the rest of the work.

The reason for using X!Tandem is not really deep. We just took what was the most popular open tool (and X!Tandem is quite popular according to the HUPO and ABRF studies). And we have not yet seen a study which clearly shows the superiority of one peptide ID tool over the others.

score 1 · Answer 3 · 2011-09-28

1

Entering edit mode

12.6 years ago

Craig ▴ 30

I've had great success with OMSSA. Once you get past the command-line interface it is pretty simple to use. It provides good results and is multi-threaded so it's relatively fast. I've never had an issue with its results files being too big, although by default it does output the top 30 hits per spectrum so that could be an explanation. If you have issues with it, post a question at http://www.sharedproteomics.com/forum/ and it will get answered quickly.

ADD COMMENT • link 12.6 years ago by Craig ▴ 30

0

Entering edit mode

Unfortunately, it seems like this project is dead:

OMSSA was a tandem mass spectra search engine. This search engine is no longer available. Due to budgetary constraints NCBI has discontinued OMSSA. Historical binaries are available from our ftp server ftp://ftp.ncbi.nih.gov/pub/lewisg/omssa/CURRENT/. If you have any specific questions, please feel free to contact us at info@ncbi.nlm.nih.gov.

ADD REPLY • link 10.6 years ago by Michael 54k

score 0 · Answer 4 · 2011-05-18

More is better... Several papers have shown that the algorithms ID overlapping, complementary populations of spectra. We use OMSSA, X!Tandem, X!Hunter, Comet and Myrimatch so far. You will ID more peptides and validate many peptides which are ID'ed by multiple algorithms. Then use IDPicker, MassSieve or Scaffold to parse and combine the outputs. We have an app which spins up MPI clusters on AWS to do this, proteomecluster.com (not open source, sorry). An open source alternative is PepArML. It adds on a machine learning function post-search to pull out more peptides. We have a version of MassSieve that can filter on PepArML outputs if you are interested.

score 0 · Answer 5 · 2011-06-15

We use Mascot, Sequest, X!Tandem (and rarely OMSSA) together, combining the results in Scaffold.

Why? We like Scaffold, so we support what Scaffold supports: Mascot is the golden standard, many seem to swear by Sequest, X!Tandem is free and easy to use, OMSSA was a bit tricky because of huge result files (Scaffold could not parse the binary output).

score 0 · Answer 6 · 2011-06-20

0

Entering edit mode

12.9 years ago

Fra • 0

Hello,

is there anybody who compared (Mascot) Scaffold's results with TPP's ones?

I've found that Scaffolds identifies a larger amount of proteins (2-3 times) than TPP.

Thank You.

ADD COMMENT • link 12.9 years ago by Fra • 0