Hi all,
I came across this post on finding protein domains from gene fusions,so used this software "Oncofuse".
Oncofuse: Prediction Of Driver Gene Fusions From Ngs Data
My input is the STAR-fusion output. My command is following which runs,but the output from Oncofuse is an empty file.
java -Xmx1G -jar $onco_dir/Oncofuse.jar $fusionresults_dir/star-fusion.fusion_candidates.final.abridged starfusion AVG $onco_dir/out_onco.txt
Log file is below:
more oncofuse.sh.o515938.1
Dec 22, 2015 12:20:12 PM org.codehaus.groovy.runtime.m12n.MetaInfExtensionModule newModule
WARNING: Module [groovy-all] - Unable to load extension class [org.codehaus.groovy.runtime.NioGroovyMethods]
[Tue Dec 22 12:20:13 EST 2015] Loading RefSeq data, assuming hg19
[Tue Dec 22 12:20:13 EST 2015] Reading input file, assuming STARFUSION format
Ignoring line #fusion_name    JunctionReads    SpanningFrags    Splice_type    LeftGene    LeftBreakpoint    RightGene    RightBreakpoint
Ignoring line NCOA4--RET    57    35    ONLY_REF_SPLICE    NCOA4^ENSG00000138293.15    chr10:51582939:+    RET^ENSG00000165731.13    chr10:43612032:+
Ignoring line DHRS3--CCDC27    28    14    ONLY_REF_SPLICE    DHRS3^ENSG00000162496.4    chr1:12638746:-    CCDC27^ENSG00000162592.4    chr1:3683099:+
Ignoring line RET--NCOA4    9    6    ONLY_REF_SPLICE    RET^ENSG00000165731.13    chr10:43610184:+    NCOA4^ENSG00000138293.15    chr10:51584616:+
Ignoring line RP11-680G10.1--GSE1    4    2    ONLY_REF_SPLICE    RP11-680G10.1^ENSG00000261567.1    chr16:85391249:+    GSE1^ENSG00000131149.13    chr16:85682158:+
Ignoring line RP11-680G10.1--GSE1    1    2    ONLY_REF_SPLICE    RP11-680G10.1^ENSG00000261567.1    chr16:85391249:+    GSE1^ENSG00000131149.13    chr16:85667520:+
[WARNING] No valid fusions in input file!
[Tue Dec 22 12:20:13 EST 2015] Mapping breakpoints to known genes (this is going to filter a lot)
[Tue Dec 22 12:20:13 EST 2015] 0 fusions mapped out of 0. Dropped fusion candidates: 0 - 5'FPG not mapped, 0 - 3'FPG not mapped, 0 - mapped to same gene, 0 - discordant FPG directions
[Tue Dec 22 12:20:13 EST 2015] Getting features for FPG parts
[Tue Dec 22 12:20:13 EST 2015] Loading expression-related data
[Tue Dec 22 12:20:14 EST 2015] Loading domain and protein interaction interface-related data
[Tue Dec 22 12:20:15 EST 2015] Loading gene ontology data (time-consuming)
[Tue Dec 22 12:20:17 EST 2015] Annotating FPG parts
[Tue Dec 22 12:20:17 EST 2015] =Stage #1: raw data
[Tue Dec 22 12:20:17 EST 2015] ==Domain & PII related
[Tue Dec 22 12:20:17 EST 2015] ==Tissue specific
[Tue Dec 22 12:20:17 EST 2015] =Stage #2: some computations
[Tue Dec 22 12:20:17 EST 2015] ==Domain & PII related (time-consuming)
[Tue Dec 22 12:20:17 EST 2015] ==Tissue specific
[Tue Dec 22 12:20:17 EST 2015] Initializing classifier
[Tue Dec 22 12:20:17 EST 2015] Classifying fusions
[Tue Dec 22 12:20:17 EST 2015] Writing output
Let me know if any suggestions regarding using this software.
Thanks,
Ron
Hi,
I removed the Splice type column,it does not work.But the example file you mentioned gives results.It has extra columns
probably because of that?
Yes, it appears that the example file was from older version/format. Changing the columns so that they are the same as in example file should work.
So I would add two columns with 0's.Hope it does not impact the results.
I added the 2 columns to get the output.Can we get any graphical output?