Dear Biostars community, I am very new to genetic analysis. I have just finished extract human RNA gene read counts from fastq files to raw count files in order to conduct differential analysis in edgeR package. This link describes how I did that.
However, after I obtained the read counts file, I notice that I got a sequence of lines in the count file like this:
...
uc001adk.4 10
uc001adl.3 0
uc001adm.6 0
uc001ado.4 0
uc001adp.4 0
...
@Pierre Lindenbaum point out to me that ucxxxxxx.x is an transcript id (thank you very much :) ). Also, I have noticed that there are others like:
ENSG00000162367
which are known as ensembl gene id.
May I know what are the differences or connections between these two?
And What is the connection between these different gene id and gene symbols?
Thank you all in advance!
in fact , this is a transcript id
Thanks, I will edit this. So, the transcript id can then be converted to gene id?