Question: Different position of the same alternative locus
0
gravatar for ivar.grytten
10 weeks ago by
ivar.grytten20
ivar.grytten20 wrote:

Where do people get the position of GRCh38 alternative loci?

I've found multiple sources, and they seem to give different positions of some alternative loci.

If I download the patch 9 dataset from http://www.ebi.ac.uk/ena/data/view/GCA_000001405.24 (by clicking "regions" in the top right corner), I will for get that the alternative locus KI270915 is positioned on chr 19:54698377-54869056:

LRC_KIR KI270915.1 170665 1-170665 19 54698377-54869056

However, If I download patch 9 from ncbi (ftp://ftp.ncbi.nlm.nih.gov/genomes/all/GCA/000/001/405/GCA_000001405.24_GRCh38.p9), the same alternative locus is positioned on chr 19:54025634-55084318

LRC_KIR 19 54025634 55084318 alt-scaffold KI270915.1 NT_187669.1 ALT_REF_LOCI_21

Please correct me if I have misunderstood the format of these files.

I have not checked all the alternative loci, but most of them seem to have identical coordinates between the two sources, apart from a few.

Does anyone know why these would be different?

Also, does anyone know how the position of these alternative loci are decided (see this unanswered question: How are the GRC alt locus to primary assembly alignments generated? )?


ADD COMMENTlink modified 10 weeks ago by deanna.church1.1k • written 10 weeks ago by ivar.grytten20
2
gravatar for deanna.church
10 weeks ago by
deanna.church1.1k
Bethesda, MD
deanna.church1.1k wrote:

What NCBI produces is exactly what the GRC submit. Many other sources will re-align the alt-loci to the Primary assembly. We don't really treat alignments as a first class data object (no accession, versions, etc) so people feel free to do this. I think it just leads to confusion (as noted in the question).

That is not to say there is not room for improvement in the alignment process- but many of these alignments are in really tough regions. The alignments are generated by a tool at NCBI called NGAlign. You can find this in the Genome Workbench tool distributed by NCBI as well. Some of the alignments end up getting manually curated to adjust particularly challenging regions.

ADD COMMENTlink written 10 weeks ago by deanna.church1.1k

Thanks, that answers my question

ADD REPLYlink written 10 weeks ago by ivar.grytten20
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1543 users visited in the last hour