Hi,
I was looking to find the expected value/frequency of finding a subsequence, within a sequence of length L.
For example, assuming that all nucleotides are equally probable, how many times would we expect to find the pattern 'ATTG' in a sequence of length = 20.
I tried looking in biostrings and IRanges (bioconductor), but didn't find what I was looking for.
many thanks!