Question: Using Bioconductor package for gene expression analysis
1
gravatar for genomics Newbie
4 months ago by
genomics Newbie20 wrote:

We have RNA-Seq sequences in bam format and would like to perform gene expression analysis for specific genes. The most important step for us is to prepare count matrices.

Questions…

  1. Is Bioconductor’s rnaseqGene the best tool to use for gene expression with RNA-Seq sequences? It appears to encompass all steps?
  2. Is gene expression analysis for RNA-seq data with DESeq2 a better tool to just obtain counts?
  3. After the gene expression analysis is complete, which tools are best to perform survival analysis using the counts ?

Thank you.

rna-seq gene expression • 267 views
ADD COMMENTlink modified 4 months ago by Charles Warden6.1k • written 4 months ago by genomics Newbie20
1

The best thing to do is try to either obtain raw FASTQs (from which the BAMs were generated) or the raw counts (generated from the BAMs). If you are starting with BAMs, you are likely to encounter errors because you will need to obtain a gene annotation file (usually GTF) that is compatible with your BAM.

Then, you can check some other discussions like this one: Can anyone suggest a good tutorial to learn RNA-seq analysis?

ADD REPLYlink written 4 months ago by igor7.3k

What is your starting data?, i.e., what data do you have right now? That will dictate the program ('tool') that you eventually use.

New programs are released almost daily, so, the field is flooded with a diverse range of programs to use.

ADD REPLYlink written 4 months ago by Kevin Blighe37k

We currently have rna-seq sequences in bam format.

ADD REPLYlink written 4 months ago by genomics Newbie20

Great! How were they produced? I am imagining that you will say TopHat, Tophat2, or HISAT2

ADD REPLYlink written 4 months ago by Kevin Blighe37k

The sequences were provided to us in .bam format.

ADD REPLYlink written 4 months ago by genomics Newbie20
1

Okay, but it is actually of high importance to understand how the BAMs were produced. That is, which alignment program was used, and how it was used.

ADD REPLYlink written 4 months ago by Kevin Blighe37k
2
gravatar for Charles Warden
4 months ago by
Charles Warden6.1k
Duarte, CA
Charles Warden6.1k wrote:

I think rnaseqGene is more of a workflow / tutorial than a package, but I would say that it is an OK starting point.

In general, there isn't going to be one single strategy that can be guaranteed to work for your data (for example, I would at least test a couple other differential expression strategies, such as limma-voom and edgeR), and taking some time to test out different analyses on your data should help you gain confidence in your results.

However, if you don't have any experience in the area, I think this is a decent place to begin learning about some commonly used programs. Another possible example would be RnaSeqGeneEdgeRQL, but I don't really know what to say about the overall tutorial (except you should look beyond any individual workflow). I would also recommend making sure you are familiar with the R-base functions (sometimes, those can be useful, even if specialized programs also exist).

ADD COMMENTlink written 4 months ago by Charles Warden6.1k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1341 users visited in the last hour