Dear Biostars community, I am very new to genetic analysis. I have just finished extract human RNA gene read counts from fastq files to raw count files in order to conduct differential analysis in edgeR package. This link describes how I did that.
However, after I obtained the read counts file, I notice that I got a sequence of lines in the count file like this:
... uc001adk.4 10 uc001adl.3 0 uc001adm.6 0 uc001ado.4 0 uc001adp.4 0 ...
@Pierre Lindenbaum point out to me that ucxxxxxx.x is an transcript id (thank you very much :) ). Also, I have noticed that there are others like:
which are known as ensembl gene id.
May I know what are the differences or connections between these two?
And What is the connection between these different gene id and gene symbols?
Thank you all in advance!