Question: Question about exon/intron counting in RNA-seq dataset analysis
0
gravatar for nanoide
6 months ago by
nanoide30
nanoide30 wrote:

Hi, So I'm currently analyzing some RNA-seq datasets and wanted to get expression at the gene level. So I got raw counts with featureCounts and then got normalized counts using DESeq2.

My question is, I have huge genes with very high introns. I'm afraid the normalization is taking into account the whole gene length, so expression in exons of genes with huge introns would be underrated. I believe DESeq-2 normalizes depending on the total on reads but I don't know if the fact that I have huge introns might be confounding in oter parts of the analysis. Maybe I can perform the analyses at the exon level and then somehow combine the counts per gene? Don't know if that's crazy

Any thoughts? Thanks for any help

ADD COMMENTlink modified 6 months ago by swbarnes27.0k • written 6 months ago by nanoide30
5
gravatar for caggtaagtat
6 months ago by
caggtaagtat860
caggtaagtat860 wrote:

I think you can use featureCount to counts reads on exons and then summarize data of all exons of one gene. So intron lengths should not be a problem.

Edit: as long as there are exons annotated in your gtf file

Each entry in the provided annotation file is taken as a feature (e.g. an exon). A meta-feature is the aggregation of a set of features (e.g. a gene). The featureCounts program uses the gene_id attribute available in the GTF format annotation (or the GeneID column in the SAF format annotation) to group features into meta-features, ie. features belonging to the same meta-feature have the same gene identifier.

ADD COMMENTlink modified 6 months ago • written 6 months ago by caggtaagtat860

Thank you for your useful answer! Will take it into account. Regards

ADD REPLYlink written 6 months ago by nanoide30
1
gravatar for swbarnes2
6 months ago by
swbarnes27.0k
United States
swbarnes27.0k wrote:

DESeq knows nothing about your features except the names. How could it? You never give it that information. If the gtf you gave to FeatureCounts had proper exon and intron information, you are fine.

ADD COMMENTlink written 6 months ago by swbarnes27.0k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1496 users visited in the last hour