Problem featurecounts 0 assigned with canfam4, only less than 20% with canfam3 dog genome
0
0
Entering edit mode
3.1 years ago

I ran into a problem analyzing RNA-seq using dog genome (using galaxy). I had a trouble trying to align and then use featurecounts. In the table below you can see our numbers for CanFAM 4 (1rst column) and CanFAM 3.0.

So we aligned using canfam3 within galaxy and canfam 4 that we downloaded from NCBI in fna.gz format. Alignment with hisat2 went well with a majority of aligned reads. However using featurecounts with annotations from UCSC, as you can see for canFAM3, we have some assigned reads but less than 20 % with a lot of no_features but also multimapping.

Using CanFAM4, which has many more transcripts, we thought we would increase the assigned category, but instead we lose everything in assigned, have many more multimapping and no_features and lose the ambiguity group.

Dog is less well annotated than human or mouse but with canfam 4 we should have better assignment of reads over transcripts. Why do we have 0 assigned now? and so many multimapping and no_features? Is it possible our annotations and genome don’t match well?

FYI, our RNA-seq are paired and stranded. Do you need any other info?

We are trying now to download canFAM4 genome from genbank instead of NCBI maybe we should do the same with annotations (but I’m not sure which one to use, I know when I am in UCSC)?

Assigned    0   10  824  955
Unassigned_Unmapped 1 462  466  2  285  520
Unassigned_Read_Type    0   0
Unassigned_MultiMapping 17  176  987    11  328  584
Unassigned_NoFeatures   51  033  117    40  504  328
Unassigned_Overlapping_Length   0   0
Unassigned_Ambiguity    0   13  927

thanks for any help.

rna-seq • 590 views
ADD COMMENT
0
Entering edit mode

in the table above readers cannot tell what information is contained in what column, you have to be more specific

as general advice, make sure you are using the correct annotation file with the correct alignment file.

when you have multiple reference files, multiple alignments, and multiple annotation files, it does not take much to use the wrong file and get the wrong results

ADD REPLY

Login before adding your answer.

Traffic: 1781 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6