Question: How can I transfer gene models to a new assembly?
7 months ago
O.rka210 wrote:

Here's my data:

sample_A: Canonical assembly with gene models (sample_A.fasta, sample_A.gff3)

sample_B: Mutant and de-novo assembly. No gene models (sample_B.fasta)

I want to transfer the gene models from sample_A to sample_B.

I thought this would be straightforward but it's definitely not. There are some instances where exon_2 comes before exon_1 or where a particular exon maps multiple times on the de-novo assembly.

Is there a tool that will do this? Ideally, I would like a tool that does the following:

program --ref_assembly sample_A.fasta --ref_annotations sample_A.gff3 --query_assembly sample_B.fasta --percent_identity 0.98 > sample_B.gff3

Here is an example of a unique edge case when I've mapped the exons from transcript FUN_000463-T1(from sample_A.gff3 and sample_A.fasta) to the new assembly (sample_B.fasta). Notice the exon ordering: enter image description here

Here's the left side zoomed in:

Here's the right side zoomed in:

Notice the exon ordering.

7 months ago by genomax91k

You can try RATT. Success will depend on quality of your assemblies.

ADD REPLYlink written 7 months ago by genomax91k

Thank you. I'm looking at it right now and it's pretty confusing to run. I installed with conda but it appears a lot of the files aren't there. I also found this tutorial:

Do you know of any other tools for this? I've heard of liftover but there is little documentation on using with a new organism.

7 months ago by O.rka210

I've updated my question a bit to be more specific.

7 months ago by O.rka210
7 months ago
Juke344.8k wrote:

There is a list of tool in table5 of this publication:

If you need the transcripts you just extract them from your GFF e.g with AGAT: -g infile.gff -f infile.fasta --cdna

Not listed in the publication you can also use MAKER. See basic protocol4 MAPPING ANNOTATIONS TO A NEW ASSEMBLY Genome in Genome Annotation and Curation Using MAKER and MAKER-P

6 months ago by Juke344.8k

Thank you for the suggestions! I will continue to look through these. It looks like "CESAR" is the most modern out of all of the tools (2016). I've had issues running older tools that haven't been maintained in a while. I'm looking at "transMap" right now but it's a bit confusing. So is transMap a part of ? I haven't seen any tutorials describing how to do this exactly. I'm a bit new to these suites as I'm more familiar with funannotate.

7 months ago by O.rka210
