Entering edit mode
26 days ago
pinheirofabiano
▴
110
I have a raw counts table from a RNA-Seq experiment, but the gene IDs are really confusing.. I have both the gene IDs and the gene names, but some gene IDs are in ensembl notation, while others are in MSTRG notation..
some gene_names are missing.. I can't find HEPCIDIN (HAMP), for example...
some gene_names are blank, especially with the MSTRG notation...
some genes appear in duplicate...
which(Raw_count$gene_name=="S100A8")
1 80455 80510
some of them are together in a single MSTRG ID...
Can you help me understand the logical of this kind of notation? How can I find my GOIs?
More on MSTRG identifiers -- How to deal with MSTRG tag without relevant gene name?