Question

Help and idea with TCR/BCR analysis

1

Entering edit mode

18 months ago

zizigolu ★ 4.3k

Hi

I need your personal point of view please

I have two bulk RNA-seq patients (PBMC) on which I run mixcr

For the same two patients I have 5' single cell on which I run TRUST4

I am seeing more clones derived by sc RNA-seq and more clonality derived by Bulk RNA-seq

I named my bulk RNA-seq samples as TCR and my sc RNA-seq as TRUST

Rplot02 Rplot07

Please, personally do you have any idea what could be an interpretation for this?

Thank you for any help

BCR TCR scRNA-seq • 1.9k views

ADD COMMENT • link updated 15 months ago by mizraelson ▴ 60 • written 18 months ago by zizigolu ★ 4.3k

score 2 · Answer 1 · 2023-01-25

If you have non-enriched (I mean non-V(D)J-enriched, after emulsion) 5' single-cell 10x data, prepared according to the 10x's manual, then, very approximately, you should catch 30~50% of the T-cells, with 20~30% of the cells having both TRA+TRB chains (the main factor here seems to be the quality of size selection, preformed in the wet lab, depending on it you can get significantly more or less), and somewhat more B-cells (as the level of expression is higher). This is not to mention that PBMC contains ~50% T-cells and ~10% B-cells (again, actual numbers in the samples may be significantly different, these are average numbers). For the V(D)J enriched library, virtually 100% of the T-/B-cells and TCRs/BCRs must be reconstructed by MiXCR.

As for the the bulk RNAseq, it depends on many factors and can be anywhere between 1 CDR3 per 10^5 to 1 CDR3 per 10^7 reads in the sample.

And the last important point in this respect is that single-cell and rna-seq datasets are obviously prepared on different sets of cells, so it might be even harder to find the intersection between them, because of the cell sampling. This will highly depend on the repertoire structure, how many expanded clones are there in the mix.

As for the comparison with TRUST and other software packages, there are several very important types of problems, associated with analysis of such type of low yield libraries, that, if not properly accounted for, will lead to incorrect conclusions about the datasets in question.

There are many non-TCR / -IG sequences which look like one, such sequences may yield false CDR3's, and what is more dangerous, reproducible false CDR3's, that will look like false overlap between samples. MiXCR was thoroughly tuned (on real and in-silico generated data), to prevent this from happening. So, for RNASeq, it gives zero false CDR3s of this sort. to increase the total yield, it is beneficial to find partial sequences with only parts of CDR3's and assemble the whole CDR3 from such halves. This procedure should, again, be very strictly controlled, because all CDR3s consists of similar parts (V, D and J genes) and false intersection can be easily found. Resulting sequence will be a chimeric sequence which is not actually present in the sample (the false positive). This type of false-positives will just falsely increase the diversity, and is not that easy to spot without control datasets. and the most obvious source of false diversity is sequencing and amplification errors, which creates similar CDR3 but with one or two substitutions or, less often, indels.

all those sources of false-positives are very strictly controlled in MiXCR (by tuned aligners, NDN-aware partial-assembly algorithms and multi-layer error corrections respectively). MiXCR results showed high level of reliability in many studies.

Also, MiXCR supports single cell analysis so it makes more sense to compare data aanalysed with the same software.

score 1 · Answer 2 · 2022-10-19

1

Entering edit mode

18 months ago

Jeremy ▴ 890

My first thought is that clonotypes with a count of 1 could just be sequencing errors. I know you're not supposed to cross-post, but this seems like a perfect question for the AIRR community. You can join their Slack channel by sending an email to them as described in the following link:

AIRR Slack

ADD COMMENT • link 18 months ago by Jeremy ▴ 890

1

Entering edit mode

Thank you

Now I have become a member there (free for two years)

How I can ask my questions there?

ADD REPLY • link 18 months ago by zizigolu ★ 4.3k

1

Entering edit mode

On the Slack channel, you can choose "#computational_questions" under "Channels" on the left side bar. Then you can post a question just like on Biostars.

ADD REPLY • link 18 months ago by Jeremy ▴ 890