MSA filtering for protein-protein contact prediction
1
0
Entering edit mode
3.8 years ago

Hello! I am attempting to extract a contact prediction between two proteins that have been shown to interact (but no structure has been determined). I am currently obtaining a list of sequences from ProteinA and ProteinB (separately) and generating a MSA for each individually. Should I filter my sequences or edit my MSA alignment in some way for DCA to work better between the two proteins? I ask because I don't know if I should use trimAl or try to remove sequences 90% redundant.

alignment • 612 views
ADD COMMENT
0
Entering edit mode
3.8 years ago
Mensur Dlakic ★ 27k

It would be helpful to tell us what is the prediction method you plan to use. But generally speaking, you don't need to do any trimming or redundancy removal, as most DCA methods know how to weigh sequences. The most important thing for good contact prediction is to get a deep and diverse alignment. There is no hard definition of what a deep alignment is, but for practical purposes let's say >1000 sequences. Depending on proteins of interest, it may be a good idea to search through metagenomic and environmental databases.

ADD COMMENT
0
Entering edit mode

Thank you! For the prediction method, I am planning on using plmDCA (https://github.com/pagnani/PlmDCA). I'll try to find many sequences for the alignment then!

ADD REPLY

Login before adding your answer.

Traffic: 2447 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6