Differential methylation rate analysis using modkit for m6A
1
1
Entering edit mode
11 days ago
baibhu1234 ▴ 20

Hello everyone,

I am using modkit to analyse the results from Dorado m6A_DRACH methylation base-calling.

(1) I have generated the bedmethyl file from bam file. Now i need a filter criteria for "coverage" and "mod_rate" to get rid of noisy predictions.

Can we directly use the filter on column "Nvalid_cov" as >=20 reads? or do we need to normalise it for per million reads?

(2) for Differential methylation analysis between conditions i am using dmr pair, following command

modkit dmr pair -a c6_r1.bed.gz -a c6_r2.bed.gz -a c6_r3.bed.gz -b dr6_r1.bed.gz -b dr6_r2.bed.gz -b dr6_r3.bed.gz -o dmr_result --ref Genome.fa --base A --threads 96 --log-filepath dmr_result.log
  1. How does modkit make the unified list of sites from both conditions with replicates?
  2. How does modkit tools handle the sites which are present in one condition and not in another?
  3. What kind of test modkit applies to get the DMR sites?

Thanks

m6A modkit Differential-methylation-analysis • 397 views
ADD COMMENT
0
Entering edit mode

Looks like you already opened an issue on modkit GitHub. That is probably the best place to ask this: https://github.com/nanoporetech/modkit/issues/364

If you get an answer there please come back to this thread and post it here.

ADD REPLY
1
Entering edit mode
10 days ago
baibhu1234 ▴ 20

This is what i got reply from Modkit developers

"You don't actually need to filter your input data for DMR. The model won't assign a high score or significant p-value to sites with very low coverage. You can find details about the model in the documentation. That being said, you may want to simply ignore positions with low valid coverage so you don't have them in the output, there is a --min-valid-coverage option for that."

"You do not have to perform any normalization, however there are --max-coverages and --cap-coverages options if you have very imbalanced data. With your command, the replicates are matched (meaning you have 3 of each), so you will see the balanced output as well."

"A site must be present in at least 1 replicate from each condition If a site is not present in any of the replicates in one condition, it will not be scored (there's nothing to compare!)."

Again question arises, that if we are only using the sites present in both conditions then what about the sites which are only modified in one condition only, they might be very interesting to see as they have very contrast modification between them.

ADD COMMENT
1
Entering edit mode

Thanks!

ADD REPLY

Login before adding your answer.

Traffic: 2661 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6