Software for plotting gene presence absence matrix
3
0
Entering edit mode
7.7 years ago
natasha ▴ 110

Hi

I have a gene presence-absence matrix of accessory genes, for multiple strains of bacteria. I would like to produce a good figure with this data, and maybe include a tree as well . I was wondering what the best software programmes are available to do this? As I have >4000 genes, I don't each gene to be labelled!

Thanks

R Python accessory genes matrix • 4.2k views
ADD COMMENT
0
Entering edit mode

how many strains do you have ?

ADD REPLY
0
Entering edit mode

50 strains. I make a mistake, I have 1800 genes to plot not 4000.

ADD REPLY
1
Entering edit mode
7.7 years ago

Maybe try a PCA on your data. 4000 observations is quiet big to plot in a figure..

or you could do both (a. PCA ; b. heatmap)

ADD COMMENT
0
Entering edit mode
7.7 years ago

Perhaps a heatmap would do ?

ADD COMMENT
1
Entering edit mode

For 4000 observations? I doubt whether that would send a clear message. Although a heatmap would be a nice combination with a hierarchical clustering tree.

You want to produce a good figure, but you should think first which message you want that figure to contain.

ADD REPLY
0
Entering edit mode

Because I am looking at bacteria of the the same ST, the accessory genes are what make each isolate unique from one another. Sorry I made a mistake -- I only have 1800 genes to plot (4000 is the core genome)

ADD REPLY
0
Entering edit mode

That's still a lot. What is the message your figure should send? You could try as suggested a PCA with an additional hierarchical clustering (tree) and a PCA and see how that looks. There are many R packages that can do that for you.

ADD REPLY
0
Entering edit mode
7.7 years ago

How about a gene presence/absence network, or project the presence/absence onto a known phylogeny, like Figures 2 and 3 in the CoPAP paper? http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3692100/

ADD COMMENT

Login before adding your answer.

Traffic: 2670 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6