Question

Functional annotation pipeline

0

Entering edit mode

6.9 years ago

sbchua.1990 ▴ 50

I have assembled a transcriptome using Trinity and clustering it using CD-hit with 0.95 threshold to reduce redundancy.

For the clustered transcripts, I performed blastx against nr database (restricted to basidiomycota taxon due to computational power). I imported the blast result into blast2GO to retrieve GO terms.

Using the same clustered transcripts, I used Transdecoder (default settings) to predict protein sequences. Then, the protein sequences (Transdecoder outputs) were used as queries in http://weizhong-lab.ucsd.edu/metagenomic-analysis/server/cog/ and http://www.kegg.jp/blastkoala/ for functional annotations for both COG and KEGG.

I have seen others perform blastx on both COG and KEGG database therefore not too sure about my different approaches.

Am I heading toward right direction?

RNA-Seq Assembly sequence blast • 2.8k views

ADD COMMENT • link updated 6.8 years ago by colindaven 6.4k • written 6.9 years ago by sbchua.1990 ▴ 50

2

Entering edit mode

Nothing wrong with your approach, but you will be left with the task of integrating the data.

I would use Trinotate to annotate a Trinity assembly. It annotates with blast and hmmer searches against uniprot and pfam, respectively, and the infers GO annotations from those. Additional blast searches may be loaded as well, but these are not used for GO anotation.

ADD REPLY • link 6.9 years ago by h.mon 35k

0

Entering edit mode

Thanks for your reply. I would consider your recommendation.

ADD REPLY • link 6.9 years ago by sbchua.1990 ▴ 50

score 0 · Answer 1 · 2017-07-17

0

Entering edit mode

6.8 years ago

colindaven 6.4k

I like Interproscan for functional annotation. It is surprisingly quick and easy to use, as long as you don't try to run it on a cluster.

https://github.com/ebi-pf-team/interproscan

ADD COMMENT • link 6.8 years ago by colindaven 6.4k