Question: How to integrate augustus and InterProScan annotation?
0
gravatar for ciemanek
21 months ago by
ciemanek110
The Netherlands/Amsterdam
ciemanek110 wrote:

Hi all,

I have a set of SNPs for an organism for which annotation is still in progress. I obtained set of SNPs of interest and to get functional information I have GFF file with genes predicted by Augustus and GFF3 file with proteins annotation from InterProScan. Is there any tool that would let me integrate this information or will I be forced to write my own script? I didn't manage to find what I need so far but maybe I'm missing something.

So far I annotated my SNPs with augustus GFF file using snpEff but I can't do the same with InterProScan GFF3 file as I am having more or less this kind of output of building database for my organism:

Reading GFF3 data file : 'path_to_file' Total: 0 markers added.

Create exons from CDS (if needed): 
Exons created for 0 transcripts.

Deleting redundant exons (if needed): 
    Total transcripts with deleted exons: 0

Collapsing zero length introns (if needed): 
    Total collapsed transcripts: 0
Reading sequences   :
Reading FASTA file: 'path_to_reference'
    Reading sequence 'scf_1', length: 2205567
    Adding genomic sequences to exons:  Done (0 sequences added, 0 ignored).
    Reading sequence 'scf_2', length: 1244474
    Adding genomic sequences to exons:  Done (0 sequences added, 0 ignored).
    Reading sequence 'scf_3', length: 1199189
    Adding genomic sequences to exons:  Done (0 sequences added, 0 ignored).
    Reading sequence 'scf_4', length: 3287760
    Adding genomic sequences to exons:  Done (0 sequences added, 0 ignored).

And later on:

Total: 0 sequences added, 0 sequences ignored.
Adjusting transcripts: 
Adjusting genes: 
Adjusting chromosomes lengths: 
Ranking exons: 
Create UTRs from CDS (if needed): 
Correcting exons based on frame information.

Remove empty chromosomes: 
    Removing empty chromosome: 'g4789.t1'
    Removing empty chromosome: 'g7757.t1'
    Removing empty chromosome: 'g5919.t1'

It's the same for all scaffolds and genes - it looks to me like my genes are treated as chromosomes. How could I use this GFF3 file to biuld snpEff database for the annotation?

Regards, Agata

ADD COMMENTlink written 21 months ago by ciemanek110
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1041 users visited in the last hour