Question: Error in piRNA gtf file while featurecounts step in STAR aligner?
0
gravatar for geethapriyanka7
11 months ago by
geethapriyanka70 wrote:

Hello I have a problem during featurecounts step in STAR aligner. I got BAM file through indexing and mapping of my data, but to quantify piRNA counts using featurecounts I'm facing gtf file(exon line missing from 3rd column)error.

WARNING no features were loaded in format GTF.
|| Failed to open the annotation file /home/bioinformatics/Desktop/pirnadb.v1_7_6.hg38.gtf, or its format is incorrect, or it contains no 'exon' features.

When I compared miRNA gtf file wih piRNA i found out that Start-codon column is missing from piRNAdb gtf file. As a alternative step I have checked gff3 file through ht-seq counts tool, which ended up giving same error.

Hope you would suggest me any other way to obtain quality reads of piRNA and a way to download apt gtf file.

Thanking you in advance.

alignment R • 378 views
ADD COMMENTlink modified 11 months ago by michael.ante3.6k • written 11 months ago by geethapriyanka70
0
gravatar for michael.ante
11 months ago by
michael.ante3.6k
Austria/Vienna
michael.ante3.6k wrote:

Hi,

It seems, that your GTF doesn't follow the (loose) standards. See e.g. the UCSC format FAQs about GFF2 and GTF.

The easiest way to repair your GTF is to use bioawk like :

bioawk -c gff '{$feature="exon"; print} ' pirnadb.v1_7_6.hg38.gtf

Nevertheless, the gene_id and transcript_id are missing in the attributes section and need to be included as well. Assuming, tzhe piRNA code is unique, you can use sed to insert the missing ids:

sed 's/piRNA_code \(\"hsa-piR-[0-9][0-9]*\"\;\)/gene_id \1 transcript_id \1/g'

You can pipe these two commands together bioawk ... | sed ... > new_pirnadb.gtf

I hope this will solve the issue.

Cheers,

Michael

ADD COMMENTlink written 11 months ago by michael.ante3.6k

hi, Michael Thank you for your suggestion. I did try your method. But I cant add the Gene_id and Transcripts_id in my output file. Hope you can see through this error.

Thank you in advance. Regards, Geetha.

ADD REPLYlink written 11 months ago by geethapriyanka70

Hi Geetha,

Did you receive an error? Can you provide the first couple of lines from your input and output gtf?

ADD REPLYlink written 11 months ago by michael.ante3.6k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1213 users visited in the last hour