Question: How much does effective genome size affect the macs2 output?
gravatar for urjaswita
15 months ago by
United States
urjaswita60 wrote:

This seems to be a simple question, but I couldn't find an answer anywhere. How much does the accuracy of effective genome size affect macs2 output for ChiP-Seq data? For example, if I use total genome size as effective genome size, or total bases - Ns, will it really affect the output a lot? Or just trivially?

Thanks for your insights.


ADD COMMENTlink modified 11 months ago by Biostar ♦♦ 20 • written 15 months ago by urjaswita60

You would overestimate the significance of your peaks I would say. Effective gs for human is 2.7Gb, and total is I think 3.3Gb. Macs uses a poisson distribution to model the peaks, so play around with the formula in the paper, using an imaginary summit readcount as lambda and varying genome sizes, so you should get a feeling for how much the p values are affected. Why is this important btw?

ADD REPLYlink modified 11 months ago • written 15 months ago by ATpoint9.4k

Thanks a lot. Actually I am working on H3K4me3 and H3K27me3 data on some newly sequence animal genomes. I tried to get the effective genome size using GEM, but couldn't run it (I'm still new to this kind of analysis), and the documentation of GEM is not great. I found here How do I compute the effective genome size? that genome size -Ns could be an option.

Could you please advise on a easy to run tool to calculate this stats? Thanks again for your answer!

ADD REPLYlink written 15 months ago by urjaswita60

Have you read all of the answers to the linked question? They present several good options - have you tried/checked any of them? If your species of interest have their total haploid DNA content determined here, then you can convert that directly to bp to get the effective genome size. If not, you could try to determine that amount experimentally yourself.

Have you tried just removing all Ns from the genome and counting the remaining bases and using that value?

ADD REPLYlink written 14 months ago by jared.andrews071.5k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1264 users visited in the last hour