I have NGS data of a gene, it's sequenced by SBH technology.as sequencing library, a hexamer library is used (4 in power of 6). like this :
TTTGAGGTGCAGATAGCTTGCTTTATTTTGTTGTTACTATCTCAAGGAGG TCCAACAATTATAACTAACAATTGAATTTATACTTGCATGAAAAGAACTA CATCAAATTGACATTTTGGGCAATTAGTAATATTGTTTAAAATTTAACAA CAGCTTTATTTTGTTGTTGTTCTTTACTTTTTGCTGTGGCTCATTGCTTA GGTGCCCAGGTTTTTCAGGTGCAATTAAAATTTAGAACTACCACACAAAG GCATTGGCTGCACTCTGGGACCTCCAAGAGTTGGCACTGCTCTGGCATAG GAATACTTGAATAGCTTGGTTAAATGAAGGGATGGCCAGGAGATGTTACT . . .
I want to calculate the following things :
i) how many percent of gene I can discover uniquely with this hexamer library (assume all hexamer are used in library)
ii) how many different hexamer are present in this gene
Can somebody guide me how can I calculate them ?