HTSeq-count: gene models/transcripts are in FASTA format not gtf/gff
0
0
Entering edit mode
4.5 years ago
sbrashidx • 0

I've generated .SAM files with RNA-seq reads aligned to a reference genome using STAR. I ultimately want to pull out differential expressed genes from my dataset.

I've chosen to use HTSeq-count to generate count files to ultimately use in either DEseq2 or EdgeR. It seems I need to provide a gff/gtf file with transcripts or gene models. I have access to both the gene models and transcripts for the genome, but both files are in FASTA format.

What can I do to solve this issue? Is there another program, besides HTSeq I could use to generate count files?

Thank you so much!

HTSeq RNA-Seq • 1.0k views
ADD COMMENT
0
Entering edit mode

You could use kallisto and then import into DESeq2 via tximport. The DESeq2 vignette at bioconductor has an example for this.

ADD REPLY
0
Entering edit mode

Which reference genome did you use? I am sure you can get the matching gtf/gff files somewhere.

ADD REPLY

Login before adding your answer.

Traffic: 2663 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6