Question: Pseudo-peaks in upstream regions
0
gravatar for boczniak767
2.8 years ago by
boczniak767680
Poland
boczniak767680 wrote:

Hi, I'd like to create a bed file defining subsequence (randomly taken 100-400bp, different for each input sequence) of 1kb sequences in bed file. I've looked of course at bedtools, and searched the web but haven't find anything useful.

bedtools random will generate fixed length sequences restricted by the chromosomes' boundaries

As I'd like to use bed file to extract sequences from genome I've also considered extraction of 1kb fasta sequences (using my bed file) and trimming it. But haven't find the answer how to do that pseudo-random trimming.

sequence bed • 673 views
ADD COMMENTlink modified 2.8 years ago • written 2.8 years ago by boczniak767680

It seems that I've found the solution. Because I need such pseudo-peaks as a background for analysis of peaks detected in real data, the easiest solution is to call peaks the same way on some random data. I'll try samtools merge and samtools -s commands to create the subsample of randomized alignments from my bam files and call peaks using my standard parameters.

ADD REPLYlink modified 2.8 years ago • written 2.8 years ago by boczniak767680

I think it's the end of my monologue ;-) It's turns out that using peaks called on random data is not efficient - in fact I've got less ranges than from real data, although I used bigger file.

Eventually I've used bedtools shuffle with -incl option with 2k upstream gene sequences. As input -i I've used randomized positions with regard to genes from multiplied peak file.

ADD REPLYlink written 2.8 years ago by boczniak767680
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1849 users visited in the last hour