Dear experts,
although (or exactly because of that) I read a lot of manuals and papers dealing with RNA-Seq for the analysis of differential expressed genes, I´m unfortunately don´t really sure about the pipeline/parameter I should use. So, I really hope that some of you can give me any advice.
My experimental setup in short: 2 WT samples and 2 patient samples (same family, same unknown mutation), 150 bp paired-end. I´m searching for genes that are differentially expressed. Especially, I´m searching for a gene which is not expressed in the patients, but in the WT (perhaps because of a deletion of an exon or suchlike). I merged the WT and patient samples and analyze the files with the help of TopHat2 and Cuffdiff2. As I learned in some forum discussions, I can´t use any statistics for filtering, because I don´t have biological replicates. So I´ve just calculated the TPMs and ranked them regarding their log2fold changes. But I have different problems: many of the genes have in one condition FPKM/TPM = 0 and I don´t know, how to proceed with them. Should I add a pseudocount? Or should I eliminate them? But regarding my questioning I ´don´t want to cut them off actually. Can anyone help me with this topic?
Thanks a lot in advance!!
Did you try to make a heatmap by cummeRbund library in R? This library is prepared for reading the diff_out files from cuffdiff program.
Here is a link to tutorial : http://compbio.mit.edu/cummeRbund/manual_2_0.html Best, Agata
I don't think I understand correctly ... how showing the differential expression (log2fc=+/-inf ) may be not necessary in heatmap? What would you like to have on this heatmap?