Data size for ChIPseq and RNAseq data
1
0
Entering edit mode
6.9 years ago
adelest.ad • 0

Hello,

I would like to have an estimation of the size occupied by all the files of a ChIPseq/RNAseq project after the different steps of mapping/peak calling (fasq==>bam==>bed/bam/bigwig etc..). For example if I have a ChIPseq 1IP +Input in duplicate and 2 RNAseqs in duplicate? If someone has an idea about the different files and their sizes it would be very nice. Thank you in advance for your reply

RNA-Seq ChIP-Seq • 3.2k views
ADD COMMENT
3
Entering edit mode
6.9 years ago

There's no estimate that can be given for this, it depends entirely on the read depth and length and what is actually kept. In the simplest case, guesstimate ~5GB for an gzipped fastq file and the same for a sorted BAM file of that data. A bigWig file is normally <1GB unless your data is really sparse and you're doing base-pair resolution.

ADD COMMENT
0
Entering edit mode

Thank you very much for your quick answer. I understand that this depend on many parameters but the question come from my boss that need to have an idea of the volume we will occupied. In my case for ChIPseq/RNAseq we will be with 25 millions of reads and 75bp. With the number you gave me I will be able to make the calcul. Could you just tell me for the RNAseq and ChIPseq if I forgot some files: RNAseq:fasq.gz==>bam ChIPseq:fasq.gz==>bam==>bam or bed bigwig

ADD REPLY
0
Entering edit mode

Any other files will be of insignificant size (e.g., read counts from featureCounts).

ADD REPLY
0
Entering edit mode

Thank you for your reply and your time. Have a nice day.

ADD REPLY
0
Entering edit mode

If an answer was helpful you should upvote it, if the answer resolved your question you should mark it as accepted. Upvote|Bookmark|Accept

ADD REPLY

Login before adding your answer.

Traffic: 3082 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6