Question: How To Write Data In A Granges Object To A Bed File.
7
gravatar for Ram
5.4 years ago by
Ram130
Germany
Ram130 wrote:

Can anybody suggest how to write Granges object list to bed file?

Thanks a lot.

bed • 28k views
ADD COMMENTlink modified 2.7 years ago by endrebak760 • written 5.4 years ago by Ram130
3
  1. Open whatever internet search engine you prefer (e.g., google);
  2. Enter: Granges to bed file;
  3. Profit.
ADD REPLYlink modified 5.4 years ago • written 5.4 years ago by PoGibas4.8k
3

Your link currently gives this thread as the first hit. Profit indeed.

ADD REPLYlink modified 7 months ago • written 7 months ago by eric.kern13150
23
gravatar for Devon Ryan
5.4 years ago by
Devon Ryan90k
Freiburg, Germany
Devon Ryan90k wrote:

Given a GRanges object:

gr <- GRanges(seqnames = Rle(c("chr1", "chr2", "chr1", "chr3"), c(1, 3, 2, 4)),
  ranges = IRanges(1:10, end = 7:16, names = head(letters, 10)),
  strand = Rle(strand(c("-", "+", "*", "+", "-")), c(1, 2, 2, 3, 2)))

You can simply:

df <- data.frame(seqnames=seqnames(gr),
  starts=start(gr)-1,
  ends=end(gr),
  names=c(rep(".", length(gr))),
  scores=c(rep(".", length(gr))),
  strands=strand(gr))

write.table(df, file="foo.bed", quote=F, sep="\t", row.names=F, col.names=F)

to write that to foo.bed. The only trick is remembering the BED uses 0-based coordinates. If you have a GRangesList rather than a GRanges object, just use unlist(gr) in place of gr (things should still be in the same order).

ADD COMMENTlink modified 5.4 years ago • written 5.4 years ago by Devon Ryan90k

Thanks you so much for your reply!! I will try with it.

ADD REPLYlink written 5.4 years ago by Ram130

Dear dpryan,

I have similar kind if Granges mentioned above:

GRanges with 2515 ranges and 6 metadata columns:
       seqnames               ranges strand   |             Conc
          <Rle>            <IRanges>  <Rle>   |        <numeric>
   851     chrI [15059848, 15071787]      *   | 16.4150832178115
  1412   chrIII [  249517,   252803]      *   | 9.93391864180872
  3416     chrX [  108921,   114715]      *   | 14.5870600573661
  2224    chrIV [  851252,   855627]      *   | 10.5743489064907
  1604   chrIII [ 4431526,  4439773]      *   | 11.0537011405054
   ...      ...                  ...    ... ...              ...
  2453    chrIV [ 7011494,  7013670]      *   | 9.54973169373811
  2897     chrV [ 1743061,  1744363]      *   | 8.42611771396342
  3075     chrV [ 8460316,  8461383]      *   | 8.19169221695555
  2163   chrIII [13529231, 13531151]      *   | 9.38284126048039
  2655    chrIV [11863005, 11864250]      *   | 8.41453042457874

But with this i am finding difficult to convert to bed file? Can you suggest anything for this.

Thanks a lot for your help!

ADD REPLYlink modified 5.4 years ago by Devon Ryan90k • written 5.4 years ago by Ram130

The same basic process should work. If you want the Conc values to be stored in the scores column of the BED file, then just replace the scores=... line above with something like scores=elementMetadata(gr)$Conc, (assuming the GRanges is named gr).

ADD REPLYlink modified 5.4 years ago • written 5.4 years ago by Devon Ryan90k

Thanks a lot for your reply. But the problem in it is that there are 2515 ranges and in result it is only showing 10 ranges, how it can be possible? Thanks

ADD REPLYlink written 5.4 years ago by Ram130
2

The following works for me:

> cat foo.txt
name    start    stop    Conc
chrI    15059848    15071787    16.4150832178115
chrIII    249517    252803    9.93391864180872
chrX    108921    114715    14.5870600573661
chrIV    851252    855627    10.5743489064907
chrIII    4431526    4439773    11.0537011405054
chrIV    7011494    7013670    9.54973169373811
chrV    1743061    1744363    8.42611771396342
chrV    8460316    8461383    8.19169221695555
chrIII    13529231    13531151    9.38284126048039
chrIV    11863005    11864250    8.41453042457874
chrIV    11863006    11864251    9.41453042457874
chrIV    11863007    11864252    7.41453042457874

And then in R:

library(GenomicRanges)
d <- read.delim("foo.txt", header=T)
gr <- GRanges(seqnames=Rle(d$name),
    ranges = IRanges(d$start, end=d$stop),
    strand = Rle(strand(c(rep("*", length(d$name))))),
    Conc = d$Conc)

df <- data.frame(seqnames=seqnames(gr),
  starts=start(gr)-1,
  ends=end(gr),
  names=c(rep(".", length(gr))),
      scores=elementMetadata(gr)$Conc,
      strands=strand(gr))
    write.table(df, file="foo.bed", quote=F, sep="\t", row.names=F, col.names=F)

You might have to convert the "*" strands to ".", I don't recall off-hand what the BED format requires there.

ADD REPLYlink modified 5.4 years ago • written 5.4 years ago by Devon Ryan90k

today I needed this, googled it and got this hit, what  useful site :-)

ADD REPLYlink written 5.0 years ago by Istvan Albert ♦♦ 80k
3

Ditto!  Especially when you need to be able to do this asap because someone's conference is over the weekend and they want some additional graphs!!

Thank you Istvan and Devon for tirelessly helping everybody out!

ADD REPLYlink written 4.1 years ago by Anna S500
8
gravatar for vj
5.4 years ago by
vj390
UK
vj390 wrote:

You can try the rtracklayer package. It gives you options to export in various formats including the bed format.

ADD COMMENTlink written 5.4 years ago by vj390

In particular the export() function is what the OP is looking for.

ADD REPLYlink written 2.7 years ago by Giovanni M Dall'Olio26k
0
gravatar for Zhilong Jia
3.7 years ago by
Zhilong Jia1.4k
London
Zhilong Jia1.4k wrote:

get  a DataFrame object by mcols(gr) and  then write out.

ref: http://bioconductor.org/packages/release/bioc/vignettes/GenomicRanges/inst/doc/GenomicRangesIntroduction.pdf

 

ADD COMMENTlink written 3.7 years ago by Zhilong Jia1.4k

mcols() gives you the extra (meta) columns, but not the coordinates, which are what's really needed for a BED file. In fact the minimal BED file representation of GRanges object doesn't require any of those columns.

ADD REPLYlink written 3.7 years ago by Devon Ryan90k

You're right. The result of `mcols()` actually give me what I need instead of bed file. I forget the author's question. Thank you.

ADD REPLYlink written 3.7 years ago by Zhilong Jia1.4k

You may need to substract 1 from the start coordinates.

ADD REPLYlink written 2.7 years ago by Giovanni M Dall'Olio26k
0
gravatar for endrebak
2.7 years ago by
endrebak760
endrebak760 wrote:

If you only have one metadata column and you would like to keep it, this modification of Devon Ryan's answer works:

df <- data.frame(seqnames=seqnames(gr),
starts=start(gr)-1,
ends=end(gr),
names=c(rep(".", length(gr))),
scores=elementMetadata(gr)[,1],
strands=strand(gr)

For my data this gives:

  seqnames   starts     ends names scores strands
1     chrY 10515750 10515760     .      1       *
2     chrY 10519610 10519620     .      1       *
3     chrY 10534770 10534780     .      1       *
4     chrY 10540160 10540170     .      1       *
5     chrY 10554860 10554870     .      1       *
6     chrY 10560630 10560640     .      1       *
ADD COMMENTlink written 2.7 years ago by endrebak760
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 904 users visited in the last hour