Question: create null sequences with gkmSVM for mouse genome mm10
3
gravatar for fusion.slope
21 months ago by
fusion.slope220
fusion.slope220 wrote:

Hello,

does anybody of you has ever tried to generate null model of DNA sequences in mouse using the gkmSVM package? It works perfectly for human but for mouse it does not. I am wondering if any of you have ever used this package for mice and had the same problem.

The error problem when I use the genNullseqs function is:

library(gkmSVM)
library(BSgenome.Mmusculus.UCSC.mm10)
library(BSgenome.Mmusculus.UCSC.mm10.masked)
library(IRanges)

genome=BSgenome.Mmusculus.UCSC.mm10.masked

fileBedBreaks="Rep1.intersec.Rep2.cov.major2.sort.uniq.bed"
fileFastaPos="Rep1.intersec.Rep2.cov.major2.all.sort.uniq.bed.pos.fa"
fileBedNeg="Rep1.intersec.Rep2.cov.major2.all.sort.uniq.Random.gkmSVM.bed"
fileFastaNeg="Rep1.intersec.Rep2.cov.major2.all.sort.uniq.Random.gkmSVM.fa"
genNullSeqs(inputBedFN=fileBedBreaks,nMaxTrials=5,xfold=2,genome=genome,
            outputPosFastaFN=fileFastaPos,outputBedFN=fileBedNeg,outputNegFastaFN=fileFastaNeg)

Error in normalizeDoubleBracketSubscript(i, x, exact = exact) : subscript "TRF" matches no name

TRF is related with the tandem repeats because it means Tandem Repeats Finder. But what does it means the problem??

Thanks in advance for any reply

dna gkmsvm R • 597 views
ADD COMMENTlink modified 4 months ago by sarahmcclymont20 • written 21 months ago by fusion.slope220
2
gravatar for sarahmcclymont
4 months ago by
sarahmcclymont20 wrote:

I hit this same problem and figured out that it's because the BSgenome.Mmusculus.UCSC.mm10.masked on Bioconductor (as of May 2020) only has two masks included (AGAPS and AMB), whereas previous genomes included the TRF mask as well, which is what gkmSVM is looking for.

Thankfully, on GitHub, Zhang (Frank) Cheng has added the TRF mask to the mm10 genome at https://github.com/biomystery/BSgenome.Mmusculus.UCSC.mm10.masked

I had to uninstall the Bioconductor version of mm10 and reinstalled using remotes::install_github("biomystery/BSgenome.Mmusculus.UCSC.mm10.masked")

ADD COMMENTlink written 4 months ago by sarahmcclymont20

awesome! now null sequences model also for the mouse genome!

ADD REPLYlink written 4 months ago by fusion.slope220
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 597 users visited in the last hour