Hello! I am having some trouble figuring out how to use Salmon. I have around 30 different samples which I trimmed using bbmap then aligned them using STAR. I have all of the BAM files from this alignment. Should I merge them all before running Salmon or because they are all unique samples, do I run them separately? I also aligned them to the UMD3.1 cattle genome. Is this ok for Salmon or should I align it to a different reference? I am very new to all this and trying to teach myself as I go. So if anyone has any other sites that could help me out, that would be great!
Thanks!
There are a few conceptual issues that might help you in the analysis:
You said you aligned the reads with STAR to the UMD3.1 cattle genome. With such alignments you cannot quantify using salmon. To use Salmon you'll need to work with a transcriptome, available from here ftp://ftp.ensembl.org/pub/release-90/fasta/bos_taurus/ (you'll want to download cDNA).
If your interest is in finding unannotated splice sites or transcripts in the cow then you ought to be aligning to the genome as you did; you could then run a variety of tools to analyze the results; salmon doesn't do that.
Salmon can be used to take STAR alignments to the transcriptome and quantify them. That is, you could feed it the STAR alignments to the ENSEMBL cDNA. It can also quantify directly from the reads by pseudoalignment (the distinction is explained here https://liorpachter.wordpress.com/2015/11/01/what-is-a-read-mapping/). I don't know of a benchmark that has tested whether alignment with STAR + Salmon is better or worse than Salmon with pseudoalignment.
You asked about whether to quantify the samples jointly or not. That depends on the downstream analysis you will perform with them.