Question

Colabfold: predicting structure of homotrimer vs 3 linked units

0

Entering edit mode

8 months ago

Stroodes ▴ 10

Hi, I have a protein with pdb available shown as a homotrimer. I'm interested in making a concatenated version of this protein, with 3 units linked with a long Gly linker (i.e. ChainA:ChainA:ChainA vs Chain A-(Gly)n-Chain A-(Gly)n-Chain A)

When I run 3 repeats of the protein on colabfold it works fine (better if I allow it to use templates as PDB is available), however when I try to run the linked trimer it fails. In my mind, with a long enough linker, it should give very similar results. Does anyone have any tips for improving structure prediction in this scenario?

I don't want to give away the protein of interest yet, but imagine running this with a streptavidin dimer/tetramer where it's a single protein linked by a 20 or 30x Gly linker.

Many thanks in advance!

colabfold alphafold • 1.4k views

ADD COMMENT • link updated 8 months ago by biomarco ▴ 50 • written 8 months ago by Stroodes ▴ 10

1

Entering edit mode

What are the errors you are getting for the failed Gly linked trimer? What step is failing? Are you potentially running out of memory in the GPU you are using?

Potentially trying to use one of the larger GPUs could alleviate the issue, something like a V100 or A100 if it is a memory error.

You could also try running the analysis as a homodimer first to check the effect of different length linkers. That said, the first multimers were created using this concept, and this is why AF2-Multimer was developed. To my knowledge the non-linker multimer structures are better predicted.

ADD REPLY • link 8 months ago by dthorbur ★ 1.9k

0

Entering edit mode

Thanks, that's good idea to run the dimer first (kinda obvious now that I think of it). The structures are just messy combinations, whereas the actual protein complex has substantial intersubunit contacts with threading loops and threefold symmetry.

I should have added that I'm running this through the Cosmic website. On Google colab I run out of free GPU time before I can get a homotrimer. I suspect that running a linked dimer/trimer on multimeter mode would help (option on Google colab alphafold, not sure on Cosmic). I should just pay for Google colab I guess, the cost is negligible compared to the kits and time needed to clone these constructs.

Given that there's already a cryo structure of this, is there any way to force it onto a template? I'm just trying to get a rough idea of necessary linker length (the termini already have a few flexible AAs).

I'm new to this so I'm also struggling to get my head around the parameters such as suitable # of models vs recycles.

ADD REPLY • link 8 months ago by Stroodes ▴ 10

1

Entering edit mode

Yeah, I suspect you're exceeding the limits of free allowances and that's the reason it's failing.

Setting up GCP and colabfold is pretty easy and for a single complex it'll be cheap if you use the cost-efficient T4s. You usually can get some free credits too so may not even have to pay for this project. I think the length limit is ~2500 AAs at the moment.

From experience, I doubt you'll manage to get a good resolution of such a complex interaction. We couldn't replicate the interaction between Roq1 and XopQ for example. And whilst it's close and general orientation is correct, there are a lot of fine scale differences. This is especially true for unstructured looping regions in my work. Good luck!

ADD REPLY • link 8 months ago by dthorbur ★ 1.9k

score 1 · Answer 1 · 2023-08-17

1

Entering edit mode

8 months ago

Mensur Dlakic ★ 27k

I don't think you need to model this at all. If you know the subunit arrangement, simply create an extended linker that can stretch enough from the C-terminus of subunit 1 to the N-terminus of subunit 2, and the same for subs 2 and 3.

An average distance of the the (G4S)4 linker is 20-50 angstroms, so that should give you some idea what is needed once you calculate the distance between your residues in question.

ADD COMMENT • link 8 months ago by Mensur Dlakic ★ 27k

0

Entering edit mode

Thanks for this, the second link is particularly useful. I guess I wanted to model it because it would be cool to make use of alphafold, and the pretty pictures might help convince my supervisor that this is possible.

It's also not a perfect straight line between both termini (minor deviation around the outside required). I'm also concerned that too long a linker will allow multimerisation in the opposite direction (i.e. clockwise vs anticlockwise), as is seen for concatenated cys-loop receptors. For now I'll probably start with a long linker and validate experimentally before optimising. I'll let you guys know if I manage to model the dimer.

ADD REPLY • link 8 months ago by Stroodes ▴ 10

score 1 · Answer 2 · 2023-08-18

1

Entering edit mode

8 months ago

Stroodes ▴ 10

Just a quick update to say that I got this working. I only tried the dimer so far, with a very long Gly linker on Cosmic Alphafold multimer. Of the 5 models, 3 were the intended orientation whereas 2 were the opposite (anti/clockwise). I'll play around with linker length to see if I can force the correct confirmation and eventually try the trimer.

Thanks for the help everyone!

ADD COMMENT • link 8 months ago by Stroodes ▴ 10

0

Entering edit mode

If you want your fusion protein to stick to the trimer's arrangement, one thing you could try is to edit the pdb in such a way that you put all the residues in one single chain, leaving gaps of n residues at the linker sites. In such way you could potentially use the resulting structure as a template for a more conventional homology modeling software that would fill the gaps with the predetermined number of Gly you put in the sequence, and the positioning of the initial subunits would be strictly respected. Not sure it will work out, but I would give it a try.

ADD REPLY • link 8 months ago by biomarco ▴ 50