Hi,
I have two queries. If someone could help me find the solution, it would be great.
- I am working on camelid VHH sequences. I've pre-processed the NGS raw reads (QC filter, merge PE reads, de-duplicate). From the filtered dataset of 1000s of nanobodies (VHH), I want to select a few top 20-30 high-affinity functional nanobodies using a computational approach first, then validate by experimentation. What should be the criteria for such selection? How can I select top candidate sequences? If I consider homology modeling and molecular docking, it seems impractical to do this on 1000s of sequences. Plus, since I am a newbie in this field, so using ML algorithms to predict affinities of unknown sequences is not really my cup of tea.
- From a fatsq file of VHH and VH sequences, I want to filter out VHH sequences to use for downstream analysis. There are hallmark residues of VHH that could be used as a parameter to filter VHH, but this parameter alone is not enough. I want to use the additional cysteine motif (another hallmark of VHH only) other than the conserved, which also lies in VH, as a supporting parameter to make the filtration more valid. But I don't know the motif in the first place. Does anyone know the hallmark cysteine motif of VHH?
Thanking in advance.
For anyone who was wondering what
nanobodies
are: