Question

RNA seq sample reads don't map

0

Entering edit mode

6.0 years ago

sheliostrow • 0

I am kinda new to the RNA seq field. we have an experiment, with 18 mouse samples. I used the common pipeline: FastQC, cutadapt, STAR, htseq and deseq2. for some reason, one of the 18 samples didn't align well to the mouse dataset, though it passed the QC, only 8%. all other were around 90%. I though the sample was contaminated so I blasted 2000 unmapped reads and found no match what so ever.

what could have happened?

rna-seq • 1.0k views

ADD COMMENT • link 6.0 years ago by sheliostrow • 0

0

Entering edit mode

Can you post a couple example reads from the highly problematic sample? What sequencer were these run on? At the end of the day figuring out what went wrong is mostly to avoid that happening next time. For this dataset, you'll want to just exclude the sample with 8% alignment.

ADD REPLY • link 6.0 years ago by Devon Ryan 104k

0

Entering edit mode

sample reads:

@NB500985:63:H55NWBGX5:1:11101:17562:1029 1:N:0:ACTTGA
TTTTTNTTTAACCACAAAGCAAATGGTAATAATTTTAATTCAACCACCGATCAAAAAGAAGGAAGGTAAATGTTCTCCACAGAC
+
AAAAA#EEEAEEEEEEE6EEEE6EEEEEEEE<EEEAEEEEAEEE<AE<A/EEEEEEEEE/E/EEEEEEEEEAEA<EE<AAE//<
@NB500985:63:H55NWBGX5:1:11101:4276:1030 1:N:0:ACTTGA
TTTCTNTATTTGTCGTTCGATTTTCATAATTTTTGAATATGTTGCATTTGTTTACGTCACAAACCTTTTGATCGAACAAAATTT
+
AAAAA#EEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEAEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEE

the sequencer was the Illumina NextSeq500

you are right about moving on with the analysis not including the problematic sample. but, as you said, i want to know what went wrong and in which stage.

ADD REPLY • link updated 6.0 years ago by Devon Ryan 104k • written 6.0 years ago by sheliostrow • 0

0

Entering edit mode

The first read is from a mouse, the second seems to be nothing known. Note that NextSeqs produce a bit more noise than the other Illumina sequencers, but since you don't seem to have GGGG stretches I suspect that's not the culprit here.

ADD REPLY • link 6.0 years ago by Devon Ryan 104k

0

Entering edit mode

another thing is that in the fastqc report i see a low CG percentage - 36%, and a graph of Per base sequence content https://ibb.co/fjTw8c

ADD REPLY • link 6.0 years ago by sheliostrow • 0

0

Entering edit mode

You need to post the image somewhere and then link to it.

ADD REPLY • link 6.0 years ago by Devon Ryan 104k

0

Entering edit mode

https://ibb.co/fjTw8c

ADD REPLY • link 6.0 years ago by sheliostrow • 0

0

Entering edit mode

As long as the other samples looked similar that's fine. The bit at the beginning looks like the normal "random hexamer priming" effect.