Question: Transcripts ID matching in tximport for edgeR
gravatar for Sharon
3.3 years ago by
Sharon510 wrote:

I am using tximport to convert transcripts counts from Salmon to gene counts for edgeR. Here is how my Salmon counts look like:

Name    Length  EffectiveLength TPM     NumReads
NM_000014.5     4945    4714.81 16.5914 4870.8
NM_000015.2     1317    962.778 0.0333619       2
NM_000016.5     2623    1865.02 5.28102 613.276
NM_000017.3     1964    1925.07 1.67686 201

But my tx2gene is like that, with no version (.x) after the transcript id:


tximport throws the following error :

None of the transcripts in the quantification files are present in the first column of tx2gene. Check to see that you are using the same annotation for both

My question is: Is it safe to remove the version .x numbers from my Salmon counts to match tx2gene file? The transcripts in Salmon will match tx2gene if I removed the version .x. Like for example convert NM_000014.5 in Salmon counts to NM_000014?


edger rna-seq salmon tximport • 1.8k views
ADD COMMENTlink modified 3.3 years ago by WouterDeCoster45k • written 3.3 years ago by Sharon510
gravatar for WouterDeCoster
3.3 years ago by
WouterDeCoster45k wrote:

Yes, safe.

You are looking for the ignoreTxVersion option of tximport:

logical, whether to split the tx id on the ’.’ character to remove version information, for easier matching with the tx id in gene2tx (default FALSE)

ADD COMMENTlink written 3.3 years ago by WouterDeCoster45k

Thanks a lot DeCoster. Works well now.

ADD REPLYlink written 3.3 years ago by Sharon510

You're welcome ron. Glad to help.

ADD REPLYlink written 3.3 years ago by WouterDeCoster45k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1686 users visited in the last hour