Entering edit mode
21 months ago
Melissa
•
0
Hi folks,
I assembled a transcriptome in Trinity v2.8.5 using the --include_supertranscripts
parameter. These are the deflines for the .fasta file:
>TRINITY_DN8_c3_g1_i1 len=330 path=[0:0-329]
>TRINITY_DN8_c1_g1_i1 len=271 path=[0:0-270]
>TRINITY_DN8_c2_g1_i1 len=357 path=[0:0-356]
>TRINITY_DN8_c0_g1_i4 len=2132 path=[0:0-1596 2:1597-1673 3:1674-1734 4:1735-1789 8:1790-1797 9:1798-1927 11:1928-2025 12:2026-2066 13:2067-2096 15:2097-2131]
These are the deflines for the .SuperTrans.fasta
file:
>TRINITY_DN8_c1_g1
>TRINITY_DN8_c0_g1
>TRINITY_DN8_c0_g2
>TRINITY_DN8_c0_g3
>TRINITY_DN10_c1_g1
>TRINITY_DN10_c2_g1
To my understanding, g1
indicates gene 1, i1
indicates isoform 1 of gene 1, len
is the length of the transcript and path
indicates the nodes of the de Buijn graph transversed by the transcript. What is indicated by DN8
, D10
and c0
, c1
, c2
, etc? Ive been searching around but cannot find an explanation. Thank you!
Per ChatGPT
Let's break down the components of this header:
For Supertranscript headers :
Length Information (len=1200):
Supertranscript Indicator (super=1):
Path Information (path=[1234:0-547 567:200-899]):