Hello,
I was running HTSeq count script and I keep getting an error saying "Error: Feature EDO49941-1 does not contain a 'gene_id' attribute"? I'm guessing its because I used a .gff3 instead of .gtf. So if I were to convert the file, should I use the method makeTxDdfromGFF() from the genomicFeatures package or is there a better way?
UPDATED QUESTION: I tried using the -i command line option but keep getting the error below, so I was hoping some one just help me figure out how to use the -i command line option so it works with my .gff3 file? Thanks.
Error: Feature EDO49941-1 does not contain a 'ID' attribute
Here is the command I used:
htseq-count -i ID  /home/ubuntu/data/rnaseq/nematostella/venus/alignment/1_STARAligned.out.sam /home/ubuntu/data/rnaseq/nematostella/venus/genome/Nematostella_vectensis.GCA_000209225.1.33.gff3
Here are a couple lines of my .gff3 file:
NEMVEscaffold_1 JGI gene    25970   37222   .   -   .   ID=gene:NEMVEDRAFT_v1g196074;biotype=protein_coding;description=Predicted protein  [Source:UniProtKB/TrEMBL%3BAcc:A7REN5];gene_id=NEMVEDRAFT_v1g196074;logic_name=jgi_nemve
NEMVEscaffold_1 JGI gene    40959   43943   .   -   .   ID=gene:NEMVEDRAFT_v1g196075;biotype=protein_coding;description=Predicted protein  [Source:UniProtKB/TrEMBL%3BAcc:A7REN6];gene_id=NEMVEDRAFT_v1g196075;logic_name=jgi_nemve
NEMVEscaffold_1 JGI pseudogene  46093   47031   .   +   .   ID=gene:NEMVEDRAFT_v1g5443;biotype=pseudogene;gene_id=NEMVEDRAFT_v1g5443;logic_name=jgi_nemve
NEMVEscaffold_1 JGI gene    47232   48589   .   +   .   ID=gene:NEMVEDRAFT_v1g237689;biotype=protein_coding;description=Predicted protein  [Source:UniProtKB/TrEMBL%3BAcc:A7REN7];gene_id=NEMVEDRAFT_v1g237689;logic_name=jgi_nemve
Thanks.
Have you tried
-i gene_idwhich is the default.Yeah I started with that, -i gene_id. But it says "gene_id" is not found so I wasn't confused? I thought I may need to change it since it was .gff3 file.