Question

How To Deal With Multiple Hits When Performing Mirna Expression Analysis

3

Entering edit mode

13.4 years ago

Doctoroots ▴ 800

Hi all, ive encountered a problem while working on miRNA expression analysis from an illumina run. usually when i work on the illumina sequence data, i remove all the reads that do not map to anywhere in the reference sequence or that have multiple alignments (non unique). this leaves me with only the uniquely mapped reads to analyze, which reduces the ambiguity of the results and makes the analysis more conclusive.

BUT, it seems that when aligning to miRNA reference (miRBase) there are miRNAs in the reference that are very similar, so similar that most of the reads that align to one of them, aligns to the others as well thus causing the reads to be non unique and to be discarded.

this of course cause a major deviation in the expression results and must be delt with.

how do you suggest i should deal with this issue?

mirna gene alignment • 4.3k views

ADD COMMENT • link updated 13.1 years ago by Alastair Kerr 5.3k • written 13.4 years ago by Doctoroots ▴ 800

1

Entering edit mode

"multiple alignment" usually means aligning multiple homologous sequences against each other. I think you mean "multiple hits".

ADD REPLY • link 13.4 years ago by Michael Kuhn 5.0k

0

Entering edit mode

you are right, thank you

ADD REPLY • link 13.4 years ago by Doctoroots ▴ 800

Ram · Answer 1 · 2010-11-18

2

Entering edit mode

13.4 years ago

Alastair Kerr 5.3k

I normalise by the number of times a read maps, so that a reads only counts once in total. See my explanation in this thread Note that this approach works best if you have told the mapper to show all repeats. Also some tools such as bowtie will do a random assignment for you to get a similar result.

ADD COMMENT • link updated 4.6 years ago by Ram 43k • written 13.4 years ago by Alastair Kerr 5.3k

score 1 · Answer 2 · 2010-11-18

1

Entering edit mode

13.4 years ago

Michael Kuhn 5.0k

I would use same procedure of aligning reads against your reference to align the reference against itself. So if you have A and A' in the reference and both are similar (according to your cutoffs), then if a read hits both A and A' I would not discard it as non-unique.

ADD COMMENT • link 13.4 years ago by Michael Kuhn 5.0k

0

Entering edit mode

Wouldn't a read hitting both A and A' mean that by definition A and A' are similar?

ADD REPLY • link 13.4 years ago by Aaron Statham ★ 1.1k

0

Entering edit mode

yeah, but thats the issue were discussing, similar elements in the reference cause reads with multiple optimal hits. Michael is suggesting to keep the reads that align to similar elements (that you found by aligning the reference to itself).

ADD REPLY • link 13.4 years ago by Doctoroots ▴ 800