Question: Merge two genome annotation files: one annotated by contig, the other annotated by scaffold.
0
gravatar for joelepaul
2 days ago by
joelepaul0
joelepaul0 wrote:

Hi @ll!

After doing what was recommended to me in my previous post ( Match predicted sequences to reference genome to generate data for annotation GTF ), I ended up with two annotation files. EVM produced an annotation on a scaffold level ("scaffold1234"), BLAT produced an annotation on the contig level ("RDRX12302"). Now I am wondering how do I merge the information of these two GTF files? I do know which contig is part of which scaffold (the reference genome lists them as: ">RDRX12302 isolate A scaffold1234, whole genome shotgun sequence", but I imagine that the position numbers put out by BLAT are sensitive to contig identity and cannot just copied over to the respective scaffold names.

I would be grateful if you could point me into the correct direction.

Thank you! Joe

annotation • 47 views
ADD COMMENTlink modified 2 days ago by lieven.sterck7.9k • written 2 days ago by joelepaul0
0
gravatar for lieven.sterck
2 days ago by
lieven.sterck7.9k
VIB, Ghent, Belgium
lieven.sterck7.9k wrote:

As far as I see it, this will be a two step procedure.

step 1 : 'move' the annotations from the contig level to the scaffolds. Tools that come to mind here are for instance lift-over (from the AllMaps package) or lift-off (recently published by salzberg)

step 2: integrate/merge all annotations from the scaffold level. Not 100% sure but I think there is likely a tool in the AGAT suite that can do this.

ADD COMMENTlink written 2 days ago by lieven.sterck7.9k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 976 users visited in the last hour