Why seed and fuzzy-search in MIGEC?
Entering edit mode
4.1 years ago
CY ▴ 720

I was playing around with test data of MISEG (for the purpose of analyzing TCR data with UMI). The barcode.txt records the adaptor sequence + UMI (marked as N). The adapter sequences is either lower or upper cased indicating fuzzy or seed search according to the manual.

I am curious about the portion of sequence before UMI.

  1. As far as I understand, the sequencing before UMI should be i7 index (library index). So all the sequences (around 20 bases) before UMI are i7 index? Should not the library index already be removed during demultiplex?

  2. Why fuzzy or seed search? What is the intuitive explanation for this and what are the sequences corresponding to these two part?

MIGEC MUI tcr • 674 views

Login before adding your answer.

Traffic: 2028 users visited in the last hour
Help About
Access RSS

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6