This should be a stupid question, but I wonder why the amount of A is close to the amount of T , and same for C, G.
I downloaded the hg19 genome, which contains sequences from 1 strand of all chromosomes. So, there is nothing to do with complementarity because I only have 1 strand!
I create a script which compute the amount of each bases on hg19, I get this following results:
a 854963149 c 592966724 g 593325228 t 856055361 n 239850802 total 3137161264
And as you can see, A is close to T (approx. 27%) and C is close to G ( approx. 18%).
I expected different values for each bases.. or 25% for each one! So, could you explain why?