Tool to Create Sequence Logo Plot with Indels or Alignment-Aware Position Frequency Matrix
1
0
Entering edit mode
3 months ago

Given an alignment file (e.g. SAM, BAM), has anyone come across a tool that can create the equivalent of a sequence logo plot that also includes indels? Or, similarly, a tool to generate a position frequency matrix that includes indels?

My goal is to illustrate how a set of reads relates to the reference genome in aggregate over a given region.

I'm imagining a hybrid of Samtools Tview (e.g. below) and sequence logo plots such that insertions and deletions in multiple reads are collapsed and represented by frequency.

enter image description here

sequence-logo • 449 views
ADD COMMENT
0
Entering edit mode

you could combine a sequence logo with a splice graph but there's nothing about either of them that allows you to relate neighboring positions with each other

ADD REPLY
0
Entering edit mode
3 months ago
Mensur Dlakic ★ 27k

Most logo programs work without displaying indels. However, it may work if you can extract the alignment from this region and make a hidden Markov model (HMM) from it. There are tools to model indels in HMMs.

ADD COMMENT

Login before adding your answer.

Traffic: 1611 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6