Functional annotation pipeline
1
0
Entering edit mode
6.9 years ago
sbchua.1990 ▴ 50

I have assembled a transcriptome using Trinity and clustering it using CD-hit with 0.95 threshold to reduce redundancy.

For the clustered transcripts, I performed blastx against nr database (restricted to basidiomycota taxon due to computational power). I imported the blast result into blast2GO to retrieve GO terms.

Using the same clustered transcripts, I used Transdecoder (default settings) to predict protein sequences. Then, the protein sequences (Transdecoder outputs) were used as queries in http://weizhong-lab.ucsd.edu/metagenomic-analysis/server/cog/ and http://www.kegg.jp/blastkoala/ for functional annotations for both COG and KEGG.

I have seen others perform blastx on both COG and KEGG database therefore not too sure about my different approaches.

Am I heading toward right direction?

RNA-Seq Assembly sequence blast • 2.8k views
ADD COMMENT
2
Entering edit mode

Nothing wrong with your approach, but you will be left with the task of integrating the data.

I would use Trinotate to annotate a Trinity assembly. It annotates with blast and hmmer searches against uniprot and pfam, respectively, and the infers GO annotations from those. Additional blast searches may be loaded as well, but these are not used for GO anotation.

ADD REPLY
0
Entering edit mode

Thanks for your reply. I would consider your recommendation.

ADD REPLY
0
Entering edit mode
6.8 years ago

I like Interproscan for functional annotation. It is surprisingly quick and easy to use, as long as you don't try to run it on a cluster.

https://github.com/ebi-pf-team/interproscan

ADD COMMENT

Login before adding your answer.

Traffic: 2263 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6