Question: Comparative boxplots with R
0
6.1 years ago by
l0ka10
Italy
l0ka10 wrote:
sample NAF TAF
00450 0.5211098 0.5310629
00450 0.5193542 0.5286942
00450 0.5262824 0.5199457
00450 0.5230269 0.5252758
00450 0.5169092 0.5223112
00160 0.5221299 0.5324644
00160 0.5319794 0.5531024
00160 0.5233437 0.5358770
00160 0.5242215 0.5224607
00160 0.5152723 0.5229491
00810 0.5127049 0.5222062
00810 0.5263669 0.5320754
00810 0.5157763 0.5267149
00810 0.5433680 0.5671679
00810 0.5242678 0.5248383

Hi, I have a big data frame like this one above. I have to do a boxplot to compare NAF with TAF, by sample name. There are around 100 different samples, so I should split the data. How should I do?

With this code:

`boxplot(NAF ~ sample, TAF ~ sample, data=data, las=2, varwidth=T)`

it plots only NAF, not TAF...

R • 4.8k views
modified 6.1 years ago by EagleEye6.7k • written 6.1 years ago by l0ka10

I think I had done it once , by lattice graphics, I can't remember full code but the basic one which could help you could be like this

#melt the data

library(reshape2)

data=melt(data)

library(lattice)

bwplot(sample ~ NAF | TAF, data=data)

just play around with parameters of bwplot

ADD REPLYlink modified 6.1 years ago • written 6.1 years ago by Manvendra Singh2.1k
3
6.1 years ago by
Martombo2.7k
Seville, ES
Martombo2.7k wrote:

my suggestion would be to use the packages reshape2 and ggplot2. with those you can easily reassemble the data and plot it in a nicer way than the R default.

you just need the following commands:

```library(ggplot2)
library(reshape2)

reshaped_data=melt(data,id="sample")

p=ggplot(reshaped_data,aes(x=as.factor(sample),y=value,fill=variable))

p+geom_boxplot()```
2
6.1 years ago by
London, UK
Giovanni M Dall'Olio27k wrote:

There are many ways to do this.

For example, with ggplot2:

```library(ggplot2)
ggplot(d) + geom_boxplot(aes(x='NAF', y=NAF)) + geom_boxplot(aes(x='TAF', y=TAF)) + facet_wrap(~sample, ncol=2) + theme_bw() + scale_x_discrete('x axis label') + scale_y_continuous('value')```

An alternative is to reshape your dataset in a 'long' format:

```library(reshape2)
d.long = melt(d, id.vars='sample')
ggplot(d, aes(x=variable, y=value)) + geom_boxplot()
# you can even use lattice
library(lattice)
bwplot(value~variable|sample, d.long, layout=c(2,2))
```

You can use the ncol and nrow parameters in facet_wrap (or with the layout argument if you use lattice) to adjust the number of panels displayed in each row/column. If you have a lot of samples, the best thing to do is to add another column classifying the samples into smaller groups, and then plot one page per group.

heh I was 2 mins late...

you posted while I was writing :),

My comment is useless now :)

Sorry, next time you will be faster than me ;-)

ADD REPLYlink written 6.1 years ago by Giovanni M Dall'Olio27k

Thanks, it works very well!

1
6.1 years ago by
Siva1.7k
United States
Siva1.7k wrote:

You can try BoxPlotR which generates side-by-side box plots. This tool can be run online or locally.

1
6.1 years ago by
EagleEye6.7k
Sweden
EagleEye6.7k wrote:

I hope this will work:

mydata_frame <- data.frame(values=c(input[,2],input[,3]),vars = rep(c("NAF","TAF"), times = c(length(input[,2]),length(input[,2]))))

vars = rep(c("NAF","TAF"), times = c(length(input[,2]),length(input[,2]))))

values=c(input[,2],input[,3])

boxplot(values ~ vars, data = mydata_frame)