Can someone provide me with a clear cut answer to the below:
Can I use the normalised .FPKM matrix file which feeds into EdgeR from RSEM abundance counts to annotate the top X amount of sequences unregulated/down-regulated between 2 conditions in Blast2goPro
I am reading FPKM values are unreliable....? What I am doing currently is working with the above mentioned normalised FPKM matrix file via Excel to filter out sequences present between conditions to try and group the sequences being up-regulated /down-regulated in respect to their GO mapping and annotations in Blast2go Pro. I work out fold change, then Log2 value. Would this be bullshit? Whilst I have wonderful heat maps generated by EdgeR to show diff. expression can I be confident in using the same FPKM values used to generate said heat maps for annotation analysis? Someone with a bit of knowledge in regards to this please help me.
I have used Trinity/ RSEM/EdgeR pipeline. Alternative methods are welcome but I'm needing somewhat speedy replies.
Any help appreciated.Thanks.
Are you just using edgeR for heatmaps (in which case, why bother, just use heatmap.2) or are you trying to use it for statistics too?
I am using EdgeR for heat maps, but I also want statistics from it. I want to find the top 1000 upregulated genes between 2 conditions, then once I find these, I want to feed the sequences into Blast2GOPro via the Trinity .fasta file I already have with the sequences in it.
I posted a new comment while you were writing this reply, so I'll just refer to it below.
or would it make more sense to use the EdgeR.de.results files?
BTW, regarding using edgeR (or limma/voom or DESeq(2)) with FPKM/RPKM values, I'll just link to one of Gordon Smyth's many replies on the subject from the bioconductor email list.