Question: Best tools for lifting over genome coordinates for non model organism (custom) reference genomes
gravatar for William
22 months ago by
William4.3k wrote:

What are currently the best tools for lifting over genome coordinates from one custom genome build to another custom genome build.

I found this 4 year old post that mentions UCSC liftOver as the top tool but that seems to be limited to model organisms as you need the chain file from UCSC . I can't find anywhere how to create these chain files your self: Converting Genome Coordinates From One Genome Version To Another (Ucsc Liftover, Ncbi Remap, Ensembl Api)

Is it possible to create these UCSC liftOver chain files by your self for custom genome builds?

Are there other tools with the same quality and functionality as UCSC liftOver that support creating these "chain" files yourself for custom reference genomes? How difficult and time / resource consuming is it to build these chain files?

Do these chain files and the lift over process take strand into account? If I have a VCF with an A/T SNP positions do they flip it to T/A SNP position if the new reference genome has a genome region flipped to the different DNA strand?

Does lift over support INDEL variants?

liftover • 924 views
ADD COMMENTlink written 22 months ago by William4.3k


CrossMap is a program for convenient conversion of genome coordinates (or annotation files) between different assemblies (such as Human hg18 (NCBI36) <> hg19 (GRCh37), Mouse mm9 (MGSCv37) <> mm10 (GRCm38)). It supports most commonly used file formats including SAM/BAM, Wiggle/BigWig, BED, GFF/GTF, VCF.

ADD REPLYlink written 22 months ago by Medhat7.9k

CrossMap first determines the correspondence between genome assemblies from UCSC chain file

Also for crossmap I need a UCSC chain file. How do I create these myself for two custom genome builds of the same species?

ADD REPLYlink modified 22 months ago • written 22 months ago by William4.3k
1 and

ADD REPLYlink written 22 months ago by genomax57k

These protocols are for small genomes only. We really need a way to produce chains from AGP or a clear workflow to create the chain from fasta data of larger size (and comprising many contigs/scaffolds) which is not appropriate for using the links above without expertise I did not find.

Anyone having created chain files from FASTA for genome/assemblies > 1Gb?

ADD REPLYlink written 16 months ago by Stephane Plaisance370
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1708 users visited in the last hour