How run MACS2 with Pan Troglodytes (Chimp) genome?
0
0
Entering edit mode
7.6 years ago
LuisNagano ▴ 90

I want to run the peak calling from MACS2 with the Pan Troglodytes (Chimpanzee) genome, but in the MACS2 manual, there are only 4 genomes to use (hs, mm, ce, dm) in -g, how must I run MACS2 for chimp genome?

Peak Calling MACS2 Chimpanzee Pan Troglodytes • 2.2k views
ADD COMMENT
0
Entering edit mode

It's the mappable genome size or effective genome size which is defined as the genome size which can be sequenced. Because of the repetitive features on the chromsomes, the actual mappable genome size will be smaller than the original size, about 90% or 70% of the genome size.

ADD REPLY
0
Entering edit mode

MACS2 should accept the -g parameter expressed as actual number rather than one of the 4 strings that denote the species for which the effective genome size was pre-calculated.

The chimp genome size is around 3.3e9 (a bit larger than human). Going with assumptions and estimates, if you assume the effective genome size to be 70% (as is for mouse) then you would specify -g 2.31e9; if the effective genome size was 90% (as is for human) then you would specify -g 2.97e9.

In all this, I wouldn't really know how to simply calculate the effective genome size (in fact, wouldn't it depend on read length?). I just saw this post which maybe talks about this: Effective genome size of UCSC hg38

ADD REPLY

Login before adding your answer.

Traffic: 1814 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6