Question: Creating a targets matrix
0
gravatar for Sethzard
14 months ago by
Sethzard10
Sethzard10 wrote:

As part of a group software development project we have been given 9 sets (3 sample types each containing 3 replicates) of assembled bat transcriptome sequences and their FPKM scores. The samples are the bats before being infected, 8 hours after being infected and 24 hours after being infected. The aim is to produce a list of the most differentially expressed genes and visual representations of the relatedness between samples (PCA plots or the like).

We have blasted the samples and extracted the FPKM scores into lists. We then put the scores into a single matrix, scoring 0.01 for any genes which weren't present in the blast. We then imported this into R and created a data matrix. We have logged the FPKM scores in the data matrix and converted into an expression set. We were planning on using limma to analyse the differences between treatment groups. Assuming we do what is the best way of creating a targets matrix (something which limma seems to require) for the analysis?

If anyone has a better idea of how to perform the analysis which can be done quickly I wouldn't say no but due to having been left in the lurch by someone we're running short on time.

Thanks in advance.

rna-seq limma • 581 views
ADD COMMENTlink modified 14 months ago by theobroma221.1k • written 14 months ago by Sethzard10

You have been given a sub-par setting because FPKM should not be used for differential gene expression analysis, instead such analysis should be based on raw counts which can be transformed as required. In particular, FPKM are not suitable for analysis by limma, instead you need voom transformed raw counts. I recommend to ask you supervisor for the raw data, then do a DE analysis using DEseq or limma.

ADD REPLYlink written 14 months ago by Michael Dondrup43k

Sadly I am all too aware that it's suboptimal but he's said that that's what he wants us to use. I've seen a few places you can use a set of logged scores in Limma like here but I'm struggling to work out how to do it with what we've been given.

ADD REPLYlink written 14 months ago by Sethzard10
2

That is sad indeed. I still think one should not use anything but state-of-the-art approaches in teaching, and therefore, I suggest to point your instructor to this thread. This community has a dedication to teaching and can support your project, but we also have a responsibility to deliver high quality. I hope this feed-back will be valuable for all parties involved in your project.

ADD REPLYlink modified 14 months ago • written 14 months ago by Michael Dondrup43k
3
gravatar for theobroma22
14 months ago by
theobroma221.1k
theobroma221.1k wrote:

Creating a targets matrix is very easy, the same is true for the contrasts matrix!! See the limma user guide, chapter 8, pg. 36. If you still haven't a clue after reading, post your targets matrix and/ or contrasts matrix code and we can go from there. Thanks.

ADD COMMENTlink written 14 months ago by theobroma221.1k
1

I have read that manual so many times yet somehow passed over that bit. THANK YOU!

ADD REPLYlink written 14 months ago by Sethzard10
0
gravatar for theobroma22
14 months ago by
theobroma221.1k
theobroma221.1k wrote:

Since you have FPKM and the library size and you did blast on those sequences, in the blast result you can get the gene length. Take the FPKM value and multiply it by the gene length and the library size. This will give you the count value. You can use this matrix of non-negative integers for limma. I hope you can pass your class too!

ADD COMMENTlink written 14 months ago by theobroma221.1k

We don't have the library size given to us. By have the library size do you mean calculate the library size from the lengths of each sequence? And thanks :)

ADD REPLYlink written 14 months ago by Sethzard10

Library size would be the total number of reads sequenced per sample.

ADD REPLYlink written 14 months ago by WouterDeCoster27k
0
gravatar for theobroma22
14 months ago by
theobroma221.1k
theobroma221.1k wrote:

Here is a response from Dr. Smyth, the author and creator of limma: https://support.bioconductor.org/p/56275/

ADD COMMENTlink written 14 months ago by theobroma221.1k

I've seen that post, which is why I'm not trying to voom it. I'm trying to use the Log2 scale as he suggested and I've run face first into a wall.

ADD REPLYlink written 14 months ago by Sethzard10
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 627 users visited in the last hour