from fasta to gtf format?
0
0
Entering edit mode
7 weeks ago
debitboro ▴ 200

Dear Biostars community,

I have a fasta file containing the sequences of a set of genes (with coordinates in the header). The fasta file is obtained after performing de novo assembly by a former colleague. I want to see if those genes are differentially expressed between two samples. Since I don't have the .gtf file of those genes, I don't know how to perform the DE analysis.

What I did:

I performed an alignment of the two samples against the reference genome (the same genome already used to obtain the fasta file containing the genes), and obtained .bam files. To get the matrix of count per read per sample, I need to provide a gtf file to htseq-count. But I don't know how to convert the fasta file to a gtf file?

Any help?

gtf DE fasta • 125 views
ADD COMMENT
0
Entering edit mode

I have a fasta file containing the sequences of a set of genes (with coordinates in the header).

If your reference only contained the gene fasta you could simply use samtools idxstats to get read counts for each entry.

ADD REPLY
0
Entering edit mode

I have a fasta file containing the sequences of a set of genes (with coordinates in the header).

If you show some example, we can suggest how to convert them to GFF if possible

ADD REPLY

Login before adding your answer.

Traffic: 1286 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6