Total CpG genome count
0
0
Entering edit mode
5.8 years ago
Baylie_321 ▴ 30

Hiya, I'm trying to get a total count of all CpGs present in a non-model organism genome. I have used bam2nuc in bismark to count dinucleotides - am I correct in thinking the count of CGs will be the total CpG count of the genome?

Best wishes,

Rebekh

RRBS Bismark Methylation CpG • 3.6k views
ADD COMMENT
2
Entering edit mode

If you want to count only CGs from fasta (nt) in R:

library (Biostrings)
data(yeastSEQCHR1)
yeast1 <- DNAString(yeastSEQCHR1)
dinucleotideFrequency(yeast1)['CG']
 CG 
7089

in python:

s = 'ATATTGCGAAAGAACGTAATTTTATCGAAAAATCGATGTcgcgcg'
print(s.upper().count("CG"))

7
ADD REPLY
0
Entering edit mode

Couldn't you also count GCs nucleotides as it will be a CpG on the reverse strand ?

ADD REPLY
0
Entering edit mode

I was meaning more is the programme suitable for this

ADD REPLY
0
Entering edit mode

A GC is a GC on the other strand, not a CpG.

ADD REPLY
0
Entering edit mode

Cheers :) I'll give the C program a go!

ADD REPLY
0
Entering edit mode

Hello rebekah_321,

Don't forget to follow up on your threads.

If an answer was helpful, you should upvote it; if the answer resolved your question, you should mark it as accepted. You can accept more than one if they work.

Upvote|Bookmark|Accept

ADD REPLY
0
Entering edit mode

I tried emboss - but doesn't it just report CpG islands?

ADD REPLY
0
Entering edit mode

Yes I have used emboss to predict CpG islands - now I want all CpGs not just islands

ADD REPLY
0
Entering edit mode

Yes I have used emboss to predict CpG islands - now I want all CpGs not just islands

ADD REPLY

Login before adding your answer.

Traffic: 3832 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6