Methods to analyze EM-seq data
Entering edit mode
11 weeks ago
Lluís R. ★ 1.2k

I'm analyzing a dataset with enzymatic methyl-seq data (EM-seq).

I got the data already pre-processed by a colleague and I asked her how to proceed with the statistical analysis but my boss is now asking to confirm the analysis plan due to the results we obtained.

The package used (DSS) resorts back a t-test, which doesn't allow to:

  1. create a "volcano plot" (it creates just a continuous line with the statistic used and the corresponding p-value),
  2. It doesn't "just" use M-values as my boss is used to (Not possible to use raw heatmaps or similar aproches).

I found several packages in Bioconductor for Methylation, most of them seem designed for arrays but I am not sure if they are adapted to EM-seq or not (like limma was for microarrays but was adapted to RNA-seq via the voom approach).

Other packages are for whole genome bisulfite sequencing (WGBS) but seem to be used too in arrays.

Looking up for papers I found relatively few that use EM-seq with very little detail of how they were analyzed.

What is the current recommended practices for analyzing EM-seq data? I would appreciate to pointers to reviews comparing analysis methods of EM-seq and/or WGBS. If there are no such reviews any good and recent paper using these methods would be fine too (Special points if it is in the context of viral infections).

methylation WGBS EM-seq • 512 views
Entering edit mode
11 weeks ago
mark.ziemann ★ 1.9k

For all intents and purposes, EM-seq is just like WGBS, albeit the quality tends to look better with EM-Seq. If you want to do differential methylation calling, you will first need to map the reads to the genome and extract the C/T counts at each site. For this I recommend the Biscuit Suite of tools. Then you can load the data into R and conduct the differential methylation analysis with Methylkit or another package for differential methylation of bisulfite seq data.

Entering edit mode

Many thanks for the links. The dataset I got is already in the DSS object for R processing. I'll check the Methylkit package. I wasn't sure if these make some assumptions that EM-seq later do not hold.


Login before adding your answer.

Traffic: 2736 users visited in the last hour
Help About
Access RSS

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6