Why did I get more unique genes than unique transcripts w/ STAR -> featureCounts?
Entering edit mode
9 days ago
O.rka ▴ 650

Here's my STAR command:

STAR --genomeDir human/GRCh38.p13_GENCODE.39_ERCC92 --readFilesIn veba_output/preprocess/31O0P_S1/output/trimmed_1.fastq.gz veba_output/preprocess/31O0P_S1/output/trimmed_2.fastq.gz --outFileNamePrefix star_output/31O0P_S1/intermediate/1__star/ --runThreadN 1 --outReadsUnmapped Fastx  --readFilesCommand zcat && samtools sort -@ 1 -T star_output/31O0P_S1/tmp/samtools_sort star_output/31O0P_S1/intermediate/1__star/Aligned.out.sam > star_output/31O0P_S1/intermediate/1__star/mapped.sorted.bam

Here's my transcript-level featureCounts:

featureCounts -a human/GRCh38.p13_GENCODE.39_ERCC92/gencode.v39.primary_assembly.annotation.ERCC92.gtf -o star_output/31O0P_S1/intermediate/2__featurecounts/featurecounts.transcripts.tsv -F GTF -g transcript_id --tmpDir star_output/31O0P_S1/tmp/featurecounts -T 1  star_output/31O0P_S1/intermediate/1__star/mapped.sorted.bam

Here's my gene-level featureCounts:

featureCounts -a human/GRCh38.p13_GENCODE.39_ERCC92/gencode.v39.primary_assembly.annotation.ERCC92.gtf -o star_output/31O0P_S1/intermediate/2__featurecounts/featurecounts.genes.tsv -F GTF -g gene_id --tmpDir star_output/31O0P_S1/tmp/featurecounts -T 1  star_output/31O0P_S1/intermediate/1__star/mapped.sorted.bam

In every case I'm getting more unique genes than transcripts which doesn't make any sense to me since transcripts are a subset of genes enter image description here

rnaseq genomics star fastq alignment • 142 views

Login before adding your answer.

Traffic: 1021 users visited in the last hour
Help About
Access RSS

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6