Dear Biostars community,
We would like to introduce new software for peptide sequence clustering. Simply, it takes input peptide sequences, identifies clusters of peptides sharing sequence motifs, and generates multiple sequence alignments of resulting clusters. So if you have something like this:
YRPLPTVPRYIE SRYLSSVPERNY YLFRALPDVPRN WYHGLPAVPMYN YAPMLPTRYVGE WVVRPLPVVGAF GPLLPVRSADLL YMPALPSRNWGP VPPVPARSHGVF SYVPGVPLRNLA YRALPVFPTMSE WRSLPVIPTYEF
You'll get this:
--YRPLPTVPRYIE --YRALPVFPTMSE YLFRALPDVPRN-- -WYHGLPAVPMYN- WVVRPLPVVGAF-- --WRSLPVIPTYEF
---GPLLPVRSADLL --YMPALPSRNWGP- --YAPMLPTRYVGE- ---VPPVPARSHGVF -SYVPGVPLRNLA-- SRYLSSVPERNY---
The software is primarily meant to process data resulting from combinatorial peptide library experiments (such as phage display) sequenced using NGS, but no limits are put on the origin of data. It can process up to millions of unique peptide sequences.
software + manual + source code: