Plotting "sequence alignments" in python (matplotlib)?
Entering edit mode
9.0 years ago
knpayne2 • 0

Hi all,

My boss has given me a task to do the following:

  1. Take fasta files, parse them using biopython
  2. Take the corresponding sequences and take the part of each sequence that is between two restriction sites (KpnI and BamHI).
  3. Then, I am to plot the sequences together on one plot, sorted by length, and highlight the bases that correspond

to a certain kind of amino acid sequence that these code for.

The end result is supposed to look like a pretty version of: (the parts in brackets are supposed to be the highlighted bases)

Seq 1 | ATCGGATC .... [ATCG .. ] ...

Seq 2 | ACCATC ... [ some more highlighted bases, not necessarily in the same position, or with the same length] ...


Seq p | Some more bases.

My boss would like this to be put together with python, preferably with matplotlib. I am a lowly statistician by training, and could probably crack something like this off in R, but am not as familiar with matplotlib.

By trying to look at some examples, I imagine I could try something like this plot,

but I'm unsure how to get started. Is there anyone that has come across a similar problem?

matplotlib python sequencing gene • 4.4k views
Entering edit mode

a. If your boss has given you a task, you should try something yourself before asking for help.

b. If you're asking for help, you should give us a lot more details, including the steps you've taken to solve the problem, so we know you're not taking a shortcut.

Entering edit mode

hahaha, sorry I hit enter too quickly.


Login before adding your answer.

Traffic: 2814 users visited in the last hour
Help About
Access RSS

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6