CLCBio Genomics workbench - protein coding genes only
2
0
Entering edit mode
4.5 years ago

Hi,

I map RNA-seq fastq files to the mouse genome (GRCm38) using the RNA-seq Analysis tool available in CLC Genomics workbench 9. As a result I get mapping results to ~46K genes. This is including non-coding genes. I would like to only map to protein-coding genes as I'm working with sequencing data of libraries that are prepared with Illumina TruSeq kit that captures mRNA by Oligodt beads.

Does anyone have experience with RNA-seq analysis in CLC Genomics? Is there a way of only mapping protein-coding genes in CLC Genomics

Best, Annika

RNA-Seq CLCBio Protein_coding • 1.2k views
ADD COMMENT
2
Entering edit mode

Since you are using a commercial product the company should support you with problems like this. But perhaps you are lucky and someone here can answer it too...

ADD REPLY
1
Entering edit mode
2.3 years ago

Annika - you can use a track file of annotation types for your reference genome to mask out / in only what you want to see. Alternatively, you can subselect the expression table results to only those feature types ("CDS" types in this case) that you want to keep and then do all your downstream analysis on subset results.

ADD COMMENT
0
Entering edit mode
4.5 years ago
h.mon 33k

There are several polyA ncRNA, so the kit would capture them as well. Anyway, best would be to map against all genome, then later restrict the analysis to the regions of interest, to avoiding mapping biases.

As WouterDeCoster pointed, few people here use CLC software, so chances are you wont get help specifically for it. But CLC support is attentive, should you contact them I am sure they will help you quickly.

ADD COMMENT

Login before adding your answer.

Traffic: 1624 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6