Question: How to move UMI tags to the header
0
gravatar for newbinf
13 months ago by
newbinf0
newbinf0 wrote:

I have complete sequences from 10x Chromium that look like the following:

|----my 16 bp cell barcode----|---8bp 10x UMI---|--SO--|-------------------cDNA (100 bp)----------------------| SO = switch oligo

The reads are demultiplexed and don't have the Illumina barcodes in the sequence. I want to map the cDNA (sequence of interest), however I want to preserve the data provided by my barcode, 10x UMI, and switch oligo (about 40 bp). This is so I can identify duplicates and which cell each read originated from.

I can found trimming softwares (cutadapt and trimmomatic) that can delete the the 40bp. However, because I want to retain this information, I do not want to simply trim the sequences. On the other hand, keeping the sequences have lead to 70-90% not mapping because of the 40bp of non-genetic information.

I have thought about 'cutting and pasting' the information into the header, but am not aware of a program that already does this.

How to I align these reads while preserving the data provided by the 40bp barcodes?

rna-seq alignment umi • 707 views
ADD COMMENTlink written 13 months ago by newbinf0
2
gravatar for genomax
13 months ago by
genomax73k
United States
genomax73k wrote:

Have you looked at umitools (https://github.com/CGATOxford/UMI-tools )?

ADD COMMENTlink written 13 months ago by genomax73k

After skimming through the documentation, this looks like what I need. Thank you!

ADD REPLYlink written 13 months ago by newbinf0
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 2348 users visited in the last hour